LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Model Pricing
  • Model Speed
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
Back to models

Data points: 81

On this page

Key TakeawaysComparison sheetBenchmark score comparisonAPI audit comparisonProvider examplesWhen to choose each modelFAQRelated compare reports

Key Takeaways

The readout for GPT-5.4 and Qwen3 Max, before the detailed comparison sheet.

Decision read

GPT-5.4

GPT-5.4 currently has the stronger profile, with verified wins split 6 to 0.

Evidence depth

81 data points

Includes 8 benchmark rows, 0 audit samples, and 8 provider examples.

Selection signal

Start with GPT-5.4

The charts below split 16 high-signal samples across speed, scores, and audit health.

Model compare

GPT-5.4 vs Qwen3 Max

gpt-5-4-vs-qwen3-max

Model A

OpenAI

GPT-5.4

Model B

Qwen

Qwen3 Max

Overall leaderLeadingContender
Verified metric wins6 wins0 wins
Where it leadsCheapest input price, Average speed, First-token latency, Free providers, Provider coverage, Recent tests

Benchmark score comparison

Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.

Metric
OpenAIGPT-5.4
QwenQwen3 Max
GPQA
92.0%#3

API audit comparison

Latest completed audits from shared providers, with four safety and integrity score groups plus report links.

Provider
OpenAIGPT-5.4
QwenQwen3 Max
No completed audits are available from shared providers yet.

Provider examples

Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.

Provider
OpenAIGPT-5.4
QwenQwen3 Max
10 tests
OpenAI

GPT-5.4

speed / latency

88 tok/s / 6075ms

input / output

No data

Qwen

When to choose each model

This report only uses LMSpeed data for GPT-5.4 and Qwen3 Max: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.

Guidance
OpenAIGPT-5.4
QwenQwen3 Max
When to choose each model

GPT-5.4

GPT-5.4 is stronger when you prioritize Cheapest input price, Average speed, First-token latency, Free providers, Provider coverage, Recent tests.

Qwen3 Max

Qwen3 Max does not clearly lead on the verified metrics here, so check provider-specific pricing before choosing.

FAQ

TL;DR: GPT-5.4 leads across 81 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.

Why is this comparison indexable?
It has 6 verifiable comparison points, and both models have pricing or benchmark data.
Are missing metrics invented?
No. Metrics without LMSpeed data are omitted from this report.

Related compare reports

Continue from GPT-5.4 vs Qwen3 Max into nearby model comparisons with enough verified LMSpeed data.

No data
Model metadataGPT-5.4 exposes 1.1M tokens; notable signals: Text input, Image input, File input, Text output.Qwen3 Max exposes 262.1K tokens; notable signals: Text input, Text output, Tool calling, JSON mode.
DeveloperOpenAIAlibaba
Context window1.1M tokens262.1K tokens
Max output128K tokens32.8K tokens
ReleasedMar 2026Sep 2025
Modalities

Input

TextImageFile

Output

Text

Input

Text

Output

Text
Features
Text inputImage inputFile inputText outputTool callingStructured outputsJSON modeReasoning
Text inputText outputTool callingJSON mode
ParametersNo dataNo data
TokenizerGPTQwen3
Knowledge cutoffNo data2025-06-30
OpenRouter IDopenai/gpt-5.4qwen/qwen3-max
ReferencesNo dataNo data
76.4%
#66
SciCode
56.6%#3
38.3%#57
HLE
41.6%#4
11.1%#63
Input price
$2.50/M#37
$1.66/M#33
Output price
$15.00/M#44
$7.22/M#39
Time to first answer token
134.24 s#111
1.90 s#42
Output speed
92.4 tok/s#56
53.9 tok/s#83
Blended price
$5.63/M#63
$3.05/M#57

Qwen3 Max

speed / latency

N/A / N/A

input / output

No data

TradingBase API10 tests
OpenAI

GPT-5.4

speed / latency

71 tok/s / 1629ms

input / output

No data

Qwen

Qwen3 Max

speed / latency

33 tok/s / 50281ms

input / output

No data

小水管 API10 tests
OpenAI

GPT-5.4

speed / latency

44 tok/s / 2518ms

input / output

No data

Qwen

Qwen3 Max

speed / latency

74 tok/s / 603ms

input / output

No data

6i25 tests
OpenAI

GPT-5.4

speed / latency

26 tok/s / 54772ms

input / output

No data

Qwen

Qwen3 Max

speed / latency

N/A / N/A

input / output

No data

6i2 API5 tests
OpenAI

GPT-5.4

speed / latency

428 tok/s / 4171ms

input / output

No data

Qwen

Qwen3 Max

speed / latency

N/A / N/A

input / output

No data

DNSHE
OpenAI

GPT-5.4

gpt-5.4-openai-compact

speed / latency

No data

input / output

$75.00/M / $450.00/M

Qwen

Qwen3 Max

qwen3-max-2026-01-23

speed / latency

No data

input / output

$0/M / $0/M

ZetaTechs API
OpenAI

GPT-5.4

gpt-5.4-free

speed / latency

No data

input / output

$0/M / $0/M

Qwen

Qwen3 Max

qwen3-max-2026-01-23

speed / latency

No data

input / output

$10.00/M / $40.00/M

紫脑喵
OpenAI

GPT-5.4

gpt-5.4-high

speed / latency

No data

input / output

$0.050/M / $0.050/M

Qwen

Qwen3 Max

qwen3-max

speed / latency

No data

input / output

$0.010/M / $0.010/M

Fengsili API
ClaudeClaude Opus 4.6 vs GPT-5.46 verified data points
OpenAIGPT-5 vs GPT-5.46 verified data points
OpenAIGPT-5.2 vs GPT-5.46 verified data points
GeminiGemini 2.5 Pro vs GPT-5.46 verified data points

Data as of Jun 13, 2026, 03:03 PM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.