Data points: 72
The readout for GPT-5 and Grok-3, before the detailed comparison sheet.
Decision read
GPT-5
GPT-5 currently has the stronger profile, with verified wins split 4 to 1.
Evidence depth
72 data points
Includes 8 benchmark rows, 0 audit samples, and 9 provider examples.
Selection signal
Start with GPT-5
The charts below split 17 high-signal samples across speed, scores, and audit health.
Model compare GPT-5 vs Grok-3gpt-5-vs-grok-3 | Model A GPT-5 | Model B Grok-3 |
|---|---|---|
| Overall leader | Leading | Contender |
| Verified metric wins | 4 wins | 1 wins |
| Where it leads | Cheapest input price, Average speed, Provider coverage, Recent tests | First-token latency |
Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.
| Metric | GPT-5 | Grok-3 |
|---|---|---|
| AIME | 95.7%#1 |
Latest completed audits from shared providers, with four safety and integrity score groups plus report links.
| Provider | GPT-5 | Grok-3 |
|---|---|---|
| No completed audits are available from shared providers yet. | ||
Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.
| Provider | GPT-5 | Grok-3 |
|---|---|---|
20 tests | GPT-5 speed / latency 95 tok/s / 1234ms input / output No data | Grok-3 |
This report only uses LMSpeed data for GPT-5 and Grok-3: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.
| Guidance | GPT-5 | Grok-3 |
|---|---|---|
| When to choose each model | GPT-5 GPT-5 is stronger when you prioritize Cheapest input price, Average speed, Provider coverage, Recent tests. | Grok-3 Grok-3 is stronger when you prioritize First-token latency. |
TL;DR: GPT-5 leads across 72 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.
Continue from GPT-5 vs Grok-3 into nearby model comparisons with enough verified LMSpeed data.
| Model metadata | GPT-5 exposes 128K tokens; notable signals: Text input, Image input, File input, Text output. | No OpenRouter metadata is available yet for this model. |
|---|---|---|
| Developer | OpenAI | No data |
| Context window | 128K tokens | No data |
| Max output | 16.4K tokens | No data |
| Released | Aug 2025 | No data |
| Modalities | Input FileImageText Output Text | No data |
| Features | Text inputImage inputFile inputText outputStructured outputsJSON mode | None listed |
| Parameters | No data | No data |
| Tokenizer | GPT | No data |
| Knowledge cutoff | 2024-09-30 | No data |
| OpenRouter ID | openai/gpt-5-chat | No data |
| References | No data | No data |
| MATH-500 | 99.4%#1 | 87.0%#36 |
|---|
| MMLU-Pro | 87.1%#7 | 79.9%#41 |
|---|
| LiveCodeBench | 84.6%#11 | 42.5%#63 |
|---|
| HLE | 26.5%#24 | 5.1%#86 |
|---|
| Input price | $1.25/M#30 | $4.00/M#40 |
|---|
| GPQA | 85.4%#31 | 69.3%#80 |
|---|
| SciCode | 42.9%#31 | 36.8%#67 |
|---|
speed / latency
N/A / N/A
input / output
No data
Undy API10 tests | GPT-5 speed / latency 131 tok/s / 743ms input / output No data | Grok-3 speed / latency 19 tok/s / 4518ms input / output No data |
|---|
N1N5 tests | GPT-5 speed / latency 102 tok/s / 948ms input / output No data | Grok-3 speed / latency N/A / N/A input / output No data |
|---|
星见雅 API5 tests | GPT-5 speed / latency 38 tok/s / 15844ms input / output No data | Grok-3 speed / latency N/A / N/A input / output No data |
|---|
0CHAT0 tests | GPT-5 speed / latency N/A / N/A input / output No data | Grok-3 speed / latency N/A / N/A input / output No data |
|---|
GPT-5 gpt-5-openai-compact speed / latency No data input / output $75.00/M / $600.00/M | Grok-3 grok-3 speed / latency No data input / output $0/M / $0/M |
GPT-5 gpt-5 speed / latency No data input / output $0.0006/M / $0.0050/M | Grok-3 grok-3 speed / latency No data input / output $0.037/M / $0.037/M |
GPT-5 gpt-5 speed / latency No data input / output $0.0006/M / $0.0050/M | Grok-3 grok-3 speed / latency No data input / output $0.037/M / $0.037/M |
GPT-5 gpt-5-all-c speed / latency No data input / output $0.0082/M / $0.0082/M | Grok-3 grok-3 speed / latency No data input / output $0.0014/M / $0.0014/M |
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.