Coding LLM Benchmark

โ† Back to leaderboard

Pricing & capacity

Avg cost
$1.43 per run
Input price
$1.75 / 1M tokens
Output price
$14.00 / 1M tokens
Context window
400K

Capabilities

Vision
Yes
Reasoning
Yes
Tool calls
Yes
Cursor
Yes
OpenRouter
Yes

Agent scores

Overall
59.5%
Issue Resolution
73.8%
Frontend
35.9%
Greenfield
50.0%
Testing
67.0%
Information Gathering
70.9%

Notes

Strong greenfield and information gathering; solid pick for building new projects from scratch. Versatile code agent for end-to-end development.