Data points: 78
The readout for GPT-4 and GPT-4o, before the detailed comparison sheet.
Decision read
GPT-4o
GPT-4o currently has the stronger profile, with verified wins split 1 to 4.
Evidence depth
78 data points
Includes 6 benchmark rows, 0 audit samples, and 7 provider examples.
Selection signal
Start with GPT-4o
The charts below split 13 high-signal samples across speed, scores, and audit health.
Model compare GPT-4 vs GPT-4ogpt-4-vs-gpt-4o | Model A GPT-4 | Model B GPT-4o |
|---|---|---|
| Overall leader | Contender | Leading |
| Verified metric wins | 1 wins | 4 wins |
| Where it leads | First-token latency | Cheapest input price, Average speed, Provider coverage, Recent tests |
Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.
| Metric | GPT-4 | GPT-4o |
|---|---|---|
| Time to first answer token | 1.07 s#25 |
Latest completed audits from shared providers, with four safety and integrity score groups plus report links.
| Provider | GPT-4 | GPT-4o |
|---|---|---|
| No completed audits are available from shared providers yet. | ||
Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.
| Provider | GPT-4 | GPT-4o |
|---|---|---|
25 tests | GPT-4 speed / latency 42 tok/s / 819ms input / output No data | GPT-4o |
This report only uses LMSpeed data for GPT-4 and GPT-4o: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.
| Guidance | GPT-4 | GPT-4o |
|---|---|---|
| When to choose each model | GPT-4 GPT-4 is stronger when you prioritize First-token latency. | GPT-4o GPT-4o is stronger when you prioritize Cheapest input price, Average speed, Provider coverage, Recent tests. |
TL;DR: GPT-4o leads across 78 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.
Continue from GPT-4 vs GPT-4o into nearby model comparisons with enough verified LMSpeed data.
| Model metadata | GPT-4 exposes 8.2K tokens; notable signals: Text input, Text output, Tool calling, Structured outputs. | GPT-4o exposes 128K tokens; notable signals: Text input, Image input, File input, Text output. |
|---|---|---|
| Developer | OpenAI | OpenAI |
| Context window | 8.2K tokens | 128K tokens |
| Max output | 4.1K tokens | 16.4K tokens |
| Released | May 2023 | Nov 2024 |
| Modalities | Input Text Output Text | Input TextImageFile Output Text |
| Features | Text inputText outputTool callingStructured outputsJSON mode | Text inputImage inputFile inputText outputTool callingStructured outputsJSON modeWeb search |
| Parameters | No data | No data |
| Tokenizer | GPT | GPT |
| Knowledge cutoff | 2021-09-30 | 2023-10-31 |
| OpenRouter ID | openai/gpt-4 | openai/gpt-4o-2024-11-20 |
| References | No data | No data |
| Time to first token | 1.07 s#40 | 0.51 s#11 |
|---|
| Input price | $30.00/M#46 | $2.50/M#37 |
|---|
| Output speed | 39.3 tok/s#101 | 125.9 tok/s#39 |
|---|
| Output price | $60.00/M#50 | $10.00/M#41 |
|---|
| Blended price | $37.50/M#75 | $4.38/M#60 |
|---|
speed / latency
105 tok/s / 1970ms
input / output
No data
速创API20 tests | GPT-4 speed / latency N/A / N/A input / output No data | GPT-4o speed / latency 88 tok/s / 1593ms input / output No data |
|---|
AZ Rix5 tests | GPT-4 speed / latency 28 tok/s / 668ms input / output No data | GPT-4o speed / latency N/A / N/A input / output No data |
|---|
KFCV505 tests | GPT-4 gpt-4-all speed / latency N/A / N/A input / output $0.014/M / $0.014/M | GPT-4o gpt-4o-all speed / latency 48 tok/s / 13138ms input / output $0.685/M / $2.05/M |
|---|
YUNWU API5 tests | GPT-4 speed / latency N/A / N/A input / output No data | GPT-4o speed / latency 58 tok/s / 2583ms input / output No data |
|---|
GPT-4 gpt-4-all speed / latency No data input / output $0.014/M / $0.014/M | GPT-4o gpt-4o-2024-05-13 speed / latency No data input / output $0.685/M / $2.05/M |
GPT-4 gpt-4-all speed / latency No data input / output $0.014/M / $0.014/M | GPT-4o gpt-4o-2024-05-13 speed / latency No data input / output $0.685/M / $2.05/M |
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.