Coding LLM Benchmark

← Back to leaderboard

Pricing & capacity

Input price
$2.00 / 1M tokens
Output price
$8.00 / 1M tokens
Context window
200K

Capabilities

Vision
Yes
Reasoning
Yes
Tool calls
Yes
Cursor
Yes
OpenRouter
Yes

Coding benchmarks

LiveCodeBench
80.8%
Aider Polyglot
81.3%
SWE-bench Verified
69.1%
BFCL (Tool use)
68.09%
Code Arena Elo

Notes

Reasoning model. Aider 81.3% (high effort).