LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Model Pricing
  • Model Speed
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
Back to models

Data points: 80

On this page

Key TakeawaysComparison sheetBenchmark score comparisonAPI audit comparisonProvider examplesWhen to choose each modelFAQRelated compare reports

Key Takeaways

The readout for GPT-4.1 and GPT-5.4, before the detailed comparison sheet.

Decision read

GPT-5.4

GPT-5.4 currently has the stronger profile, with verified wins split 2 to 4.

Evidence depth

80 data points

Includes 8 benchmark rows, 0 audit samples, and 9 provider examples.

Selection signal

Start with GPT-5.4

The charts below split 17 high-signal samples across speed, scores, and audit health.

Model compare

GPT-4.1 vs GPT-5.4

gpt-4-1-vs-gpt-5-4

Model A

OpenAI

GPT-4.1

Model B

OpenAI

GPT-5.4

Overall leaderContenderLeading
Verified metric wins2 wins4 wins
Where it leadsAverage speed, First-token latencyCheapest input price, Free providers, Provider coverage, Recent tests
Model metadataGPT-4.1 exposes 1.0M tokens; notable signals: Text input, Image input, File input, Text output.GPT-5.4 exposes 1.1M tokens; notable signals: Text input, Image input, File input, Text output.
DeveloperOpenAIOpenAI
Context window1.0M tokens1.1M tokens
Max outputNo data128K tokens
ReleasedApr 2025Mar 2026
Modalities

Input

ImageTextFile

Output

Text

Input

TextImageFile

Output

Text
Features
Text inputImage inputFile inputText outputTool callingStructured outputsJSON mode
Text inputImage inputFile inputText outputTool callingStructured outputsJSON modeReasoning
ParametersNo dataNo data
TokenizerGPTGPT
Knowledge cutoff2024-06-30No data
OpenRouter IDopenai/gpt-4.1openai/gpt-5.4
ReferencesNo dataNo data

Benchmark score comparison

Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.

Metric
OpenAIGPT-4.1
OpenAIGPT-5.4
GPQA
66.6%#90
92.0%#3
SciCode
38.1%#58
56.6%#3
HLE
4.6%#90
41.6%#4
Time to first answer token
0.64 s#17
134.24 s#111
Time to first token
0.64 s#21
134.24 s#110
Input price
$2.00/M#36
$2.50/M#37
Output speed
129.4 tok/s#37
92.4 tok/s#56
Output price
$8.00/M#40
$15.00/M#44

API audit comparison

Latest completed audits from shared providers, with four safety and integrity score groups plus report links.

Provider
OpenAIGPT-4.1
OpenAIGPT-5.4
No completed audits are available from shared providers yet.

Provider examples

Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.

Provider
OpenAIGPT-4.1
OpenAIGPT-5.4
SWT-API120 tests
OpenAI

GPT-4.1

speed / latency

81 tok/s / 1822ms

input / output

No data

OpenAI

GPT-5.4

speed / latency

N/A / N/A

input / output

No data

Sub2API20 tests
OpenAI

GPT-4.1

speed / latency

N/A / N/A

input / output

No data

OpenAI

GPT-5.4

speed / latency

51 tok/s / 4032ms

input / output

No data

Fengsili API10 tests
OpenAI

GPT-4.1

speed / latency

N/A / N/A

input / output

No data

OpenAI

GPT-5.4

speed / latency

88 tok/s / 6075ms

input / output

No data

QQ Code10 tests
OpenAI

GPT-4.1

speed / latency

N/A / N/A

input / output

No data

OpenAI

GPT-5.4

speed / latency

55 tok/s / 2903ms

input / output

No data

Sub2API10 tests
OpenAI

GPT-4.1

speed / latency

N/A / N/A

input / output

No data

OpenAI

GPT-5.4

speed / latency

54 tok/s / 1369ms

input / output

No data

CHB API
OpenAI

GPT-4.1

gpt-4.1

speed / latency

No data

input / output

$0.027/M / $0.110/M

OpenAI

GPT-5.4

gpt-5.4-xhigh

speed / latency

No data

input / output

$0.034/M / $0.205/M

ZEN-AI VIP
OpenAI

GPT-4.1

gpt-4.1-2025-04-14

speed / latency

No data

input / output

$0.082/M / $0.329/M

OpenAI

GPT-5.4

gpt-5.4-xhigh

speed / latency

No data

input / output

$0.103/M / $0.616/M

Zero API
OpenAI

GPT-4.1

gpt-4.1

speed / latency

No data

input / output

$0.137/M / $0.548/M

OpenAI

GPT-5.4

gpt-5.4

speed / latency

No data

input / output

$0.171/M / $1.03/M

ModelPool
OpenAI

GPT-4.1

gpt-4.1

speed / latency

No data

input / output

$0.171/M / $0.686/M

OpenAI

GPT-5.4

gpt-5.4

speed / latency

No data

input / output

$0.429/M / $3.43/M

When to choose each model

This report only uses LMSpeed data for GPT-4.1 and GPT-5.4: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.

Guidance
OpenAIGPT-4.1
OpenAIGPT-5.4
When to choose each model

GPT-4.1

GPT-4.1 is stronger when you prioritize Average speed, First-token latency.

GPT-5.4

GPT-5.4 is stronger when you prioritize Cheapest input price, Free providers, Provider coverage, Recent tests.

FAQ

TL;DR: GPT-5.4 leads across 80 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.

Why is this comparison indexable?
It has 6 verifiable comparison points, and both models have pricing or benchmark data.
Are missing metrics invented?
No. Metrics without LMSpeed data are omitted from this report.

Related compare reports

Continue from GPT-4.1 vs GPT-5.4 into nearby model comparisons with enough verified LMSpeed data.

ClaudeClaude Opus 4.6 vs GPT-4.16 verified data pointsOpenAIGPT-4.1 vs GPT-56 verified data pointsOpenAIGPT-4.1 vs GPT-5.26 verified data pointsGeminiGemini 2.5 Pro vs GPT-4.16 verified data points

Data as of Jun 13, 2026, 04:33 PM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.