Coding LLM Benchmark

โ† Back to leaderboard

Pricing & capacity

Avg cost
$0.64 per run
Input price
$0.50 / 1M tokens
Output price
$3.00 / 1M tokens
Context window
1M

Capabilities

Vision
Yes
Reasoning
Yes
Tool calls
Yes
Cursor
Yes
OpenRouter
No

Agent scores

Overall
49.0%
Issue Resolution
74.6%
Frontend
22.1%
Greenfield
18.8%
Testing
70.7%
Information Gathering
58.8%

Notes

Strong issue resolution and testing at low cost. Weak frontend; excellent value for automated CI/CD pipelines and backend triage.