Coding LLM Benchmark

โ† Back to leaderboard

Pricing & capacity

Avg cost
$1.14 per run
Input price
$5.00 / 1M tokens
Output price
$25.00 / 1M tokens
Context window
200K

Capabilities

Vision
Yes
Reasoning
Yes
Tool calls
Yes
Cursor
Yes
OpenRouter
Yes

Agent scores

Overall
66.7%
Issue Resolution
76.8%
Frontend
41.8%
Greenfield
56.2%
Testing
78.8%
Information Gathering
80.0%

Notes

Top overall agentic performer. Excels at testing and information gathering with the strongest greenfield capability in its class. Premium pricing justified by top-tier performance across all five agent categories.