Coding LLM Benchmark

โ† Back to leaderboard

Pricing & capacity

Avg cost
$1.29 per run
Input price
$3.00 / 1M tokens
Output price
$15.00 / 1M tokens
Context window
1M

Capabilities

Vision
Yes
Reasoning
Yes
Tool calls
Yes
Cursor
Yes
OpenRouter
Yes

Agent scores

Overall
43.3%
Issue Resolution
74.4%
Frontend
30.9%
Greenfield
43.8%
Testing
54.0%
Information Gathering
13.3%

Notes

Latest Sonnet. Strong issue resolution (74.4%) and greenfield (43.8%) on OpenHands.