Coding LLM Benchmark

← Back to dashboard

Grok 4

xAI · USA

Pricing & capacity

Input price: $3.00 / 1M tokens
Output price: $15.00 / 1M tokens
Context window: 256K

Capabilities

Vision: Yes
Reasoning: Yes
Tool calls: Yes
Cursor: No
OpenRouter: Yes

Coding benchmarks

LiveCodeBench: 79.0%
Aider Polyglot: 79.6%
SWE-bench Verified: 75.0%
BFCL (Tool use): 62.9%
Code Arena Elo: —

Notes

Strong across coding benchmarks. LiveCode 79%, SWE 75%, Aider 79.6% (Vellum/Aider).