Coding LLM Benchmark

โ† Back to leaderboard

Pricing & capacity

Avg cost
$1.42 per run
Input price
$2.00 / 1M tokens
Output price
$12.00 / 1M tokens
Context window
10M

Capabilities

Vision
Yes
Reasoning
Yes
Tool calls
Yes
Cursor
Yes
OpenRouter
No

Agent scores

Overall
49.0%
Issue Resolution
70.6%
Frontend
36.8%
Greenfield
25.0%
Testing
68.6%
Information Gathering
44.2%

Notes

Good issue resolution and testing with a 10M-token context window. Surprisingly weak at information gathering. Best for large codebase navigation and complex bug fixing.