Coding LLM Benchmark

← Back to leaderboard

Pricing & capacity

Avg cost
$1.57 per run
Input price
$3.00 / 1M tokens
Output price
$15.00 / 1M tokens
Context window
200K

Capabilities

Vision
Yes
Reasoning
Yes
Tool calls
Yes
Cursor
Yes
OpenRouter
Yes

Agent scores

Overall
53.0%
Issue Resolution
74.2%
Frontend
36.8%
Greenfield
12.5%
Testing
68.8%
Information Gathering
72.7%

Notes

Strong issue resolution at a lower price than Opus. Very weak greenfield—best for maintaining and debugging existing codebases.