eXalt Value presents
The Leaderboard of LLM Coding
Compare the models of coding by scores and prices
Find the ideal coding model
Top models across programming benchmarks
Overall score
Leaderboard average
Score (%)
30%35%40%45%50%55%60%65%70%75%80%85%
Issue Resolution
Fixing GitHub bugs
Score (%)
30%35%40%45%50%55%60%65%70%75%80%85%
Frontend
UI with visual context
Score (%)
30%35%40%45%50%55%60%65%70%75%80%85%
Greenfield
Apps from scratch
Score (%)
30%35%40%45%50%55%60%65%70%75%80%85%
Testing
Test generation and quality
Score (%)
30%35%40%45%50%55%60%65%70%75%80%85%
Information Gathering
Research and retrieval
Score (%)
30%35%40%45%50%55%60%65%70%75%80%85%
Cost / Performance
Overall Cost/Performance
Average score vs. average cost per problem (USD). Lower-right is better value.
Explore
Agentic Leaderboard
Full rankings of coding AI agents: issue resolution, frontend, greenfield, testing and information gathering. Powered by OpenHands Index data.
View full leaderboardBusiness Benchmarks
Compare LLMs on real-world business tasks: translation quality, long document analysis, cognitive reasoning, and spreadsheet intelligence.
View business benchmarksLLM Leaderboard
| Model | Value | In $/1M | Out $/1M | LiveCode | Aider | SWE | BFCL | Votes ↓ | Context |
|---|---|---|---|---|---|---|---|---|---|
|
|
— | $5.00 | $25.00 | n/a | n/a | n/a | 77.47% | 1561 | 200K |
|
|
Low | $5.00 | $25.00 | 73.8% | 89.4% | 80.9% | 73.24% | 1469 | 200K |
|
|
— | $0.30 | $1.20 | n/a | n/a | n/a | 57.51% | 1453 | 1M |
|
|
Mid | $2.00 | $12.00 | 79.7% | n/a | 76.2% | 66.46% | 1444 | 10M |
|
|
Good value | $0.40 | $1.75 | 83.1% | 59.1% | 71.3% | 59.42% | 1442 | 256K |
|
|
Good value | $0.50 | $3.00 | 79.7% | n/a | n/a | 60.61% | 1441 | 1M |
|
|
Low | $1.75 | $14.00 | 66.9% | n/a | 80.0% | 63.01% | 1395 | 400K |
|
|
Good value | $1.25 | $10.00 | 84.6% | 88.0% | 74.9% | 66.21% | 1393 | 400K |
|
|
Low | $3.00 | $15.00 | 59.0% | n/a | 82.0% | 60.67% | 1386 | 200K |
|
|
Best value | $0.27 | $1.10 | 89.6% | 74.2% | n/a | 62.11% | 1371 | 128K |
|
|
Mid | $1.25 | $10.00 | 84.9% | n/a | 76.3% | 65.18% | 1328 | 200K |
|
|
Best value | $0.27 | $0.41 | 59.3% | 70.2% | n/a | 52.56% | 1315 | 128K |
|
|
— | $1.00 | $5.00 | n/a | n/a | 73.3% | 54.84% | 1305 | 200K |
|
|
— | $2.00 | $6.00 | n/a | n/a | n/a | 39.17% | 1223 | 131K |
|
|
Mid | $1.25 | $10.00 | 69.0% | 82.2% | 59.6% | 54.41% | 1205 | 1M |
|
|
— | $3.00 | $15.00 | n/a | n/a | n/a | n/a | n/a | 1M |
|
|
— | $1.75 | $14.00 | n/a | n/a | n/a | n/a | n/a | 400K |
|
|
— | $2.00 | $12.00 | n/a | n/a | n/a | n/a | n/a | 10M |
|
|
— | $0.40 | $1.75 | n/a | n/a | n/a | n/a | n/a | 256K |
|
|
— | $0.23 | $0.90 | n/a | n/a | n/a | n/a | n/a | 197K |
|
|
— | $0.90 | $0.90 | n/a | n/a | n/a | n/a | n/a | 262K |
|
|
— | $0.38 | $1.75 | n/a | n/a | n/a | n/a | n/a | 203K |
|
|
Best value | $0.25 | $2.00 | 83.8% | n/a | n/a | 58.29% | n/a | 200K |
|
|
Good value | $2.00 | $8.00 | 80.8% | 81.3% | 69.1% | 68.09% | n/a | 200K |
|
|
Low | $3.00 | $15.00 | 79.0% | 79.6% | 75.0% | 62.9% | n/a | 256K |
|
|
Best value | $0.15 | $0.60 | 63.5% | 55.1% | n/a | 45.18% | n/a | 1M |
|
|
Mid | $2.00 | $8.00 | 52.0% | 52.4% | 55.0% | 50.18% | n/a | 1M |
■ Best
■ Good
■ Mid
■ Low