Creative Showcases
Standardized creative HTML generation tasks. Each model receives the exact same prompt and the output is evaluated by a judge model on visual quality, interactivity, code completeness, and creativity. These results are separate from the main public-benchmark leaderboard scores.
4 Completed results
3 Models with results
Creative Scoreboard
Average across completed creative showcases| # | Model | Creative Avg | Showcases |
|---|---|---|---|
| 1 | OpenAI: gpt-oss-120b OpenAI | 7.8 /10 | 1 |
| 2 | Llama 3.3 70B Meta | 7.7 /10 | 2 |
| 3 | Llama 4 Scout Meta | 3.6 /10 | 1 |
3D Safari Simulation
Creative-htmlGenerate an interactive 3D safari experience in a single HTML file using Three.js or pure CSS 3D.
Web3 Landing Page
Creative-htmlGenerate a complete, visually striking single-file HTML landing page for a fictional Web3 crypto project.