Same prompt · many models · raced side by side
Benchmarks
Little races I run most weeks: give every model the same prompt, the same machine, the same tools — then watch them build. The club gets them first.

A blank desktop → an interactive multi-page space site
GLM-5.2 vs MiniMax-M3 vs Kimi 2.7, each on its own fresh Linux desktop: build a home page with an animated starfield hero plus a /destinations page, linked by a real nav. The fastest wrote the least code — and still shipped both pages.
Watch the race →
A blank desktop → a cyberpunk landing page
Kimi 2.7 vs MiniMax-M3 vs GLM-5.2, each on its own fresh Linux desktop: terminal on the left, the page building live on the right. Three premium pages, a 2.5× spread in time.
Watch the race →
A blank Linux desktop → a playable game
GLM-5.2 vs Kimi-K2.6 vs MiniMax-M3, each on its own fresh Linux desktop, build a cyberpunk Asteroids game from scratch. One did it in 51 seconds.
Watch the race →
One prompt → an interactive solar system
Three coding agents, one prompt, one single-file interactive solar system — raced side by side and sped up.
Watch the race →