LLM Context Window Benchmark

A visual benchmark comparing how different Large Language Models (LLMs) handle complex coding prompts, particularly for games and interactive UI. This project serves as a "context-window-in-action" gallery.

🚀 How it Works

The project is a static site that aggregates benchmark results from various models (Gemini, Claude, GPT, Grok). Each benchmark is a directory containing:

prompt.txt: The exact prompt given to the models.
Sub-directories for each model (e.g., gemini, claude) containing the generated index.html.
modelnames.json: Mapping internal IDs to human-readable names.

🏗 Directory Structure

.
├── create_config.sh    # Script to regenerate the gallery index
├── index.html          # Main gallery UI
├── flappy/             # Benchmark: Flappy Bird clone
│   ├── prompt.txt      # The prompt used
│   ├── gemini/         # Result from Gemini
│   │   └── index.html
│   └── claude/         # Result from Claude
│       └── index.html
└── platformer/         # Benchmark: Platformer game
    └── ...

🛠 Features

Side-by-Side Comparison: view model outputs for the same prompt in one interface.
Dynamic Config Generation: Just drop a new result folder and run create_config.sh.
Vanilla Implementation: No heavy frameworks, just fast, static HTML/JS.

🤝 How to Contribute

We welcome contributions of new benchmarks or new model results for existing benchmarks!

1. Adding a New Model Result

If you want to add a result for a model (e.g., "DeepSeek") to an existing benchmark (e.g., flappy):

Create a folder named deepseek inside flappy/.
Add the generated index.html file into flappy/deepseek/.
(Optional) Add the model name to flappy/modelnames.json.
Run ./create_config.sh to update the site.

2. Adding a New Benchmark

Create a new root folder (e.g., tetris/).
Add a prompt.txt with the prompt you used.
Add folders for each model you tested.
Run ./create_config.sh.

💻 Local Development

Clone the repository.
To view the site, you can use any static server, like npx phost or python -m http.server.
After adding new folders or files, run:
```
bash create_config.sh
```

📜 License

See LICENSE.md for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github		.github
flappy		flappy
platformer		platformer
CNAME		CNAME
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md
add_result.sh		add_result.sh
config.json		config.json
create_config.sh		create_config.sh
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Context Window Benchmark

🚀 How it Works

🏗 Directory Structure

🛠 Features

🤝 How to Contribute

1. Adding a New Model Result

2. Adding a New Benchmark

💻 Local Development

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Context Window Benchmark

🚀 How it Works

🏗 Directory Structure

🛠 Features

🤝 How to Contribute

1. Adding a New Model Result

2. Adding a New Benchmark

💻 Local Development

📜 License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages