Data points: 85
The readout for Gemini 3.1 Flash Lite and GPT-5.4, before the detailed comparison sheet.
Decision read
GPT-5.4
GPT-5.4 currently has the stronger profile, with verified wins split 2 to 4.
Evidence depth
85 data points
Includes 8 benchmark rows, 1 audit samples, and 9 provider examples.
Selection signal
Start with GPT-5.4
The charts below split 18 high-signal samples across speed, scores, and audit health.
Model compare Gemini 3.1 Flash Lite vs GPT-5.4gemini-3-1-flash-lite-vs-gpt-5-4 | Model A Gemini 3.1 Flash Lite | Model B GPT-5.4 |
|---|---|---|
| Overall leader | Contender | Leading |
| Verified metric wins | 2 wins | 4 wins |
| Where it leads | Cheapest input price, Average speed |
Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.
| Metric | Gemini 3.1 Flash Lite | GPT-5.4 |
|---|---|---|
| GPQA | 82.2%#48 |
Latest completed audits from shared providers, with four safety and integrity score groups plus report links.
| Provider | Gemini 3.1 Flash Lite | GPT-5.4 |
|---|---|---|
Winner: GPT-5.4 | Gemini 3.1 Flash Lite gemini-3.1-flash-lite No audit yet | GPT-5.4 gpt-5.4 |
Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.
| Provider | Gemini 3.1 Flash Lite | GPT-5.4 |
|---|---|---|
20 tests | Gemini 3.1 Flash Lite speed / latency 218 tok/s / 4094ms input / output No data |
This report only uses LMSpeed data for Gemini 3.1 Flash Lite and GPT-5.4: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.
| Guidance | Gemini 3.1 Flash Lite | GPT-5.4 |
|---|---|---|
| When to choose each model | Gemini 3.1 Flash Lite Gemini 3.1 Flash Lite is stronger when you prioritize Cheapest input price, Average speed. | GPT-5.4 GPT-5.4 is stronger when you prioritize First-token latency, Free providers, Provider coverage, Recent tests. |
TL;DR: GPT-5.4 leads across 85 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.
Continue from Gemini 3.1 Flash Lite vs GPT-5.4 into nearby model comparisons with enough verified LMSpeed data.
| First-token latency, Free providers, Provider coverage, Recent tests |
| Model metadata | Gemini 3.1 Flash Lite exposes 1.0M tokens; notable signals: Text input, Image input, File input, Audio input. | GPT-5.4 exposes 1.1M tokens; notable signals: Text input, Image input, File input, Text output. |
|---|---|---|
| Developer | OpenAI | |
| Context window | 1.0M tokens | 1.1M tokens |
| Max output | 65.5K tokens | 128K tokens |
| Released | May 2026 | Mar 2026 |
| Modalities | Input TextImagevideoFileAudio Output Text | Input TextImageFile Output Text |
| Features | Text inputImage inputFile inputAudio inputText outputTool callingStructured outputsJSON modeReasoning | Text inputImage inputFile inputText outputTool callingStructured outputs |
| Parameters | No data | No data |
| Tokenizer | Gemini | GPT |
| Knowledge cutoff | No data | No data |
| OpenRouter ID | google/gemini-3.1-flash-lite | openai/gpt-5.4 |
| References | No data | No data |
| Output speed | 319.9 tok/s#3 | 92.4 tok/s#56 |
|---|
| SciCode | 41.9%#37 | 56.6%#3 |
|---|
| HLE | 16.2%#49 | 41.6%#4 |
|---|
| Input price | $0.250/M#12 | $2.50/M#37 |
|---|
| Output price | $1.50/M#20 | $15.00/M#44 |
|---|
| Blended price | $0.563/M#27 | $5.63/M#63 |
|---|
| Time to first answer token | 5.31 s#50 | 134.24 s#111 |
|---|
Audit score
90
GPT-5.4
speed / latency
44 tok/s / 2518ms
input / output
No data
0CHAT10 tests | Gemini 3.1 Flash Lite speed / latency N/A / N/A input / output No data | GPT-5.4 speed / latency 51 tok/s / 2526ms input / output No data |
|---|
QQ Code10 tests | Gemini 3.1 Flash Lite speed / latency N/A / N/A input / output No data | GPT-5.4 speed / latency 55 tok/s / 2903ms input / output No data |
|---|
Sliam10 tests | Gemini 3.1 Flash Lite speed / latency 179 tok/s / 1599ms input / output No data | GPT-5.4 speed / latency N/A / N/A input / output No data |
|---|
VSLLM10 tests | Gemini 3.1 Flash Lite speed / latency 256 tok/s / 4450ms input / output No data | GPT-5.4 speed / latency 60 tok/s / 2482ms input / output No data |
|---|
Gemini 3.1 Flash Lite gemini-3.1-flash-lite speed / latency No data input / output $0.010/M / $0.010/M | GPT-5.4 gpt-5-4 speed / latency No data input / output $0.080/M / $0.080/M |
Gemini 3.1 Flash Lite gemini-3.1-flash-lite-preview-c speed / latency No data input / output $0.0035/M / $0.0035/M | GPT-5.4 gpt-5.4 speed / latency No data input / output $0.450/M / $2.70/M |
Gemini 3.1 Flash Lite gemini-3.1-flash-lite speed / latency No data input / output $0.010/M / $0.010/M | GPT-5.4 gpt-5.4-openai-compact speed / latency No data input / output $75.00/M / $450.00/M |
Gemini 3.1 Flash Lite gemini-3.1-flash-lite-preview-c speed / latency No data input / output $0.010/M / $0.010/M | GPT-5.4 gpt-5.4-2026-03-05 speed / latency No data input / output $75.00/M / $450.00/M |
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.