LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Model Pricing
  • Model Speed
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
Back to models

Data points: 86

On this page

Key TakeawaysComparison sheetBenchmark score comparisonAPI audit comparisonProvider examplesWhen to choose each modelFAQRelated compare reports

Key Takeaways

The readout for Gemini 2.5 Flash Lite and GPT-5.4, before the detailed comparison sheet.

Decision read

GPT-5.4

GPT-5.4 currently has the stronger profile, with verified wins split 2 to 4.

Evidence depth

86 data points

Includes 8 benchmark rows, 1 audit samples, and 8 provider examples.

Selection signal

Start with GPT-5.4

The charts below split 17 high-signal samples across speed, scores, and audit health.

Model compare

Gemini 2.5 Flash Lite vs GPT-5.4

gemini-2-5-flash-lite-vs-gpt-5-4

Model A

Gemini

Gemini 2.5 Flash Lite

Model B

OpenAI

GPT-5.4

Overall leaderContenderLeading
Verified metric wins2 wins4 wins
Where it leadsAverage speed, First-token latencyCheapest input price, Free providers, Provider coverage, Recent tests
Model metadataGemini 2.5 Flash Lite exposes 1.0M tokens; notable signals: Text input, Image input, File input, Audio input.GPT-5.4 exposes 1.1M tokens; notable signals: Text input, Image input, File input, Text output.
DeveloperGoogleOpenAI
Context window1.0M tokens1.1M tokens
Max output65.5K tokens128K tokens
ReleasedSep 2025Mar 2026
Modalities

Input

TextImageFileAudiovideo

Output

Text

Input

TextImageFile

Output

Text
Features
Text inputImage inputFile inputAudio inputText outputTool callingStructured outputsJSON modeReasoning
Text inputImage inputFile inputText outputTool callingStructured outputsJSON modeReasoning
ParametersNo dataNo data
TokenizerGeminiGPT
Knowledge cutoff2025-01-31No data
OpenRouter IDgoogle/gemini-2.5-flash-lite-preview-09-2025openai/gpt-5.4
ReferencesNo dataNo data

Benchmark score comparison

Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.

Metric
GeminiGemini 2.5 Flash Lite
OpenAIGPT-5.4
Time to first answer token
0.37 s#2
134.24 s#111
GPQA
47.4%#122
92.0%#3
SciCode
17.7%#112
56.6%#3
Time to first token
0.37 s#3
134.24 s#110
HLE
3.7%#98
41.6%#4
Input price
$0.100/M#6
$2.50/M#37
Output price
$0.400/M#6
$15.00/M#44
Output speed
229.0 tok/s#7
92.4 tok/s#56

API audit comparison

Latest completed audits from shared providers, with four safety and integrity score groups plus report links.

Provider
GeminiGemini 2.5 Flash Lite
OpenAIGPT-5.4
钠 APIWinner: GPT-5.4
Gemini

Gemini 2.5 Flash Lite

gemini-2.5-flash-lite

No audit yet

OpenAI

GPT-5.4

gpt-5.4

Audit score

90

View report
1008476100

Provider examples

Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.

Provider
GeminiGemini 2.5 Flash Lite
OpenAIGPT-5.4
SkyAI60 tests
Gemini

Gemini 2.5 Flash Lite

speed / latency

276 tok/s / 1323ms

input / output

No data

OpenAI

GPT-5.4

speed / latency

N/A / N/A

input / output

No data

天宫造物50 tests
Gemini

Gemini 2.5 Flash Lite

speed / latency

N/A / N/A

input / output

No data

OpenAI

GPT-5.4

speed / latency

50 tok/s / 7305ms

input / output

No data

Rnglg2 API30 tests
Gemini

Gemini 2.5 Flash Lite

speed / latency

235 tok/s / 2518ms

input / output

No data

OpenAI

GPT-5.4

speed / latency

N/A / N/A

input / output

No data

GankInterview LLM15 tests
Gemini

Gemini 2.5 Flash Lite

speed / latency

262 tok/s / 904ms

input / output

No data

OpenAI

GPT-5.4

speed / latency

N/A / N/A

input / output

No data

WSocket AI15 tests
Gemini

Gemini 2.5 Flash Lite

speed / latency

209 tok/s / 1627ms

input / output

No data

OpenAI

GPT-5.4

speed / latency

36 tok/s / 12157ms

input / output

No data

RinkoAI
Gemini

Gemini 2.5 Flash Lite

gemini-2.5-flash-lite-c

speed / latency

No data

input / output

$0.0035/M / $0.0035/M

OpenAI

GPT-5.4

gpt-5.4

speed / latency

No data

input / output

$0.450/M / $2.70/M

ZEN-AI VIP
Gemini

Gemini 2.5 Flash Lite

gemini-2.5-flash-lite-preview-09-2025

speed / latency

No data

input / output

$0.0041/M / $0.016/M

OpenAI

GPT-5.4

gpt-5.4-xhigh

speed / latency

No data

input / output

$0.103/M / $0.616/M

ModelPool
Gemini

Gemini 2.5 Flash Lite

gemini-2.5-flash-lite-nothinking

speed / latency

No data

input / output

$0.0086/M / $0.034/M

OpenAI

GPT-5.4

gpt-5.4

speed / latency

No data

input / output

$0.429/M / $3.43/M

When to choose each model

This report only uses LMSpeed data for Gemini 2.5 Flash Lite and GPT-5.4: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.

Guidance
GeminiGemini 2.5 Flash Lite
OpenAIGPT-5.4
When to choose each model

Gemini 2.5 Flash Lite

Gemini 2.5 Flash Lite is stronger when you prioritize Average speed, First-token latency.

GPT-5.4

GPT-5.4 is stronger when you prioritize Cheapest input price, Free providers, Provider coverage, Recent tests.

FAQ

TL;DR: GPT-5.4 leads across 86 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.

Why is this comparison indexable?
It has 6 verifiable comparison points, and both models have pricing or benchmark data.
Are missing metrics invented?
No. Metrics without LMSpeed data are omitted from this report.

Related compare reports

Continue from Gemini 2.5 Flash Lite vs GPT-5.4 into nearby model comparisons with enough verified LMSpeed data.

ClaudeClaude Opus 4.6 vs Gemini 2.5 Flash Lite6 verified data pointsGeminiGemini 2.5 Flash Lite vs GPT-56 verified data pointsGeminiGemini 2.5 Flash Lite vs GPT-5.26 verified data pointsGeminiGemini 2.5 Flash Lite vs Gemini 2.5 Pro6 verified data points

Data as of Jun 13, 2026, 04:22 PM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.