Gemini 3 Pro — Coding LLM Benchmark

← Back to leaderboard

Official links

Official model page
Google website

Pricing & capacity

Avg cost: $1.42 per run
Input price: $2.00 / 1M tokens
Output price: $12.00 / 1M tokens
Context window: 10M

Capabilities

Vision: Yes
Reasoning: Yes
Tool calls: Yes
Cursor: Yes
OpenRouter: No

Agent scores

Overall: 49.0%
Issue Resolution: 70.6%
Frontend: 36.8%
Greenfield: 25.0%
Testing: 68.6%
Information Gathering: 44.2%

Notes

Good issue resolution and testing with a 10M-token context window. Surprisingly weak at information gathering. Best for large codebase navigation and complex bug fixing.