LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Model Pricing
  • Model Speed
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
Back to models

Data points: 85

On this page

Key TakeawaysComparison sheetBenchmark score comparisonAPI audit comparisonProvider examplesWhen to choose each modelFAQRelated compare reports

Key Takeaways

The readout for Gemini 3.1 Flash Lite and GPT-5.4, before the detailed comparison sheet.

Decision read

GPT-5.4

GPT-5.4 currently has the stronger profile, with verified wins split 2 to 4.

Evidence depth

85 data points

Includes 8 benchmark rows, 1 audit samples, and 9 provider examples.

Selection signal

Start with GPT-5.4

The charts below split 18 high-signal samples across speed, scores, and audit health.

Model compare

Gemini 3.1 Flash Lite vs GPT-5.4

gemini-3-1-flash-lite-vs-gpt-5-4

Model A

Gemini

Gemini 3.1 Flash Lite

Model B

OpenAI

GPT-5.4

Overall leaderContenderLeading
Verified metric wins2 wins4 wins
Where it leadsCheapest input price, Average speed

Benchmark score comparison

Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.

Metric
GeminiGemini 3.1 Flash Lite
OpenAIGPT-5.4
GPQA
82.2%#48

API audit comparison

Latest completed audits from shared providers, with four safety and integrity score groups plus report links.

Provider
GeminiGemini 3.1 Flash Lite
OpenAIGPT-5.4
Winner: GPT-5.4
Gemini

Gemini 3.1 Flash Lite

gemini-3.1-flash-lite

No audit yet

OpenAI

GPT-5.4

gpt-5.4

Provider examples

Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.

Provider
GeminiGemini 3.1 Flash Lite
OpenAIGPT-5.4
20 tests
Gemini

Gemini 3.1 Flash Lite

speed / latency

218 tok/s / 4094ms

input / output

No data

OpenAI

When to choose each model

This report only uses LMSpeed data for Gemini 3.1 Flash Lite and GPT-5.4: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.

Guidance
GeminiGemini 3.1 Flash Lite
OpenAIGPT-5.4
When to choose each model

Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is stronger when you prioritize Cheapest input price, Average speed.

GPT-5.4

GPT-5.4 is stronger when you prioritize First-token latency, Free providers, Provider coverage, Recent tests.

FAQ

TL;DR: GPT-5.4 leads across 85 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.

Why is this comparison indexable?
It has 6 verifiable comparison points, and both models have pricing or benchmark data.
Are missing metrics invented?
No. Metrics without LMSpeed data are omitted from this report.

Related compare reports

Continue from Gemini 3.1 Flash Lite vs GPT-5.4 into nearby model comparisons with enough verified LMSpeed data.

First-token latency, Free providers, Provider coverage, Recent tests
Model metadataGemini 3.1 Flash Lite exposes 1.0M tokens; notable signals: Text input, Image input, File input, Audio input.GPT-5.4 exposes 1.1M tokens; notable signals: Text input, Image input, File input, Text output.
DeveloperGoogleOpenAI
Context window1.0M tokens1.1M tokens
Max output65.5K tokens128K tokens
ReleasedMay 2026Mar 2026
Modalities

Input

TextImagevideoFileAudio

Output

Text

Input

TextImageFile

Output

Text
Features
Text inputImage inputFile inputAudio inputText outputTool callingStructured outputsJSON modeReasoning
Text inputImage inputFile inputText outputTool callingStructured outputs
ParametersNo dataNo data
TokenizerGeminiGPT
Knowledge cutoffNo dataNo data
OpenRouter IDgoogle/gemini-3.1-flash-liteopenai/gpt-5.4
ReferencesNo dataNo data
92.0%
#3
Output speed
319.9 tok/s#3
92.4 tok/s#56
SciCode
41.9%#37
56.6%#3
HLE
16.2%#49
41.6%#4
Input price
$0.250/M#12
$2.50/M#37
Output price
$1.50/M#20
$15.00/M#44
Blended price
$0.563/M#27
$5.63/M#63
Time to first answer token
5.31 s#50
134.24 s#111

Audit score

90

1008476100

GPT-5.4

speed / latency

44 tok/s / 2518ms

input / output

No data

0CHAT10 tests
Gemini

Gemini 3.1 Flash Lite

speed / latency

N/A / N/A

input / output

No data

OpenAI

GPT-5.4

speed / latency

51 tok/s / 2526ms

input / output

No data

QQ Code10 tests
Gemini

Gemini 3.1 Flash Lite

speed / latency

N/A / N/A

input / output

No data

OpenAI

GPT-5.4

speed / latency

55 tok/s / 2903ms

input / output

No data

Sliam10 tests
Gemini

Gemini 3.1 Flash Lite

speed / latency

179 tok/s / 1599ms

input / output

No data

OpenAI

GPT-5.4

speed / latency

N/A / N/A

input / output

No data

VSLLM10 tests
Gemini

Gemini 3.1 Flash Lite

speed / latency

256 tok/s / 4450ms

input / output

No data

OpenAI

GPT-5.4

speed / latency

60 tok/s / 2482ms

input / output

No data

91VIP API
Gemini

Gemini 3.1 Flash Lite

gemini-3.1-flash-lite

speed / latency

No data

input / output

$0.010/M / $0.010/M

OpenAI

GPT-5.4

gpt-5-4

speed / latency

No data

input / output

$0.080/M / $0.080/M

RinkoAI
Gemini

Gemini 3.1 Flash Lite

gemini-3.1-flash-lite-preview-c

speed / latency

No data

input / output

$0.0035/M / $0.0035/M

OpenAI

GPT-5.4

gpt-5.4

speed / latency

No data

input / output

$0.450/M / $2.70/M

CookingAI
Gemini

Gemini 3.1 Flash Lite

gemini-3.1-flash-lite

speed / latency

No data

input / output

$0.010/M / $0.010/M

OpenAI

GPT-5.4

gpt-5.4-openai-compact

speed / latency

No data

input / output

$75.00/M / $450.00/M

全球AI
Gemini

Gemini 3.1 Flash Lite

gemini-3.1-flash-lite-preview-c

speed / latency

No data

input / output

$0.010/M / $0.010/M

OpenAI

GPT-5.4

gpt-5.4-2026-03-05

speed / latency

No data

input / output

$75.00/M / $450.00/M

钠 API
小水管 API
ClaudeClaude Opus 4.6 vs Gemini 3.1 Flash Lite6 verified data points
GeminiGemini 3.1 Flash Lite vs GPT-56 verified data points
GeminiGemini 3.1 Flash Lite vs GPT-5.26 verified data points
GeminiGemini 2.5 Pro vs Gemini 3.1 Flash Lite6 verified data points

Data as of Jun 13, 2026, 01:45 PM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.

View report
JSON mode
Reasoning