LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Model Pricing
  • Model Speed
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
Back to models

Data points: 108

On this page

Key TakeawaysComparison sheetBenchmark score comparisonAPI audit comparisonProvider examplesWhen to choose each modelFAQRelated compare reports

Key Takeaways

The readout for DeepSeek V4 Flash and GLM-5.1, before the detailed comparison sheet.

Decision read

DeepSeek V4 Flash

DeepSeek V4 Flash currently has the stronger profile, with verified wins split 5 to 1.

Evidence depth

108 data points

Includes 8 benchmark rows, 6 audit samples, and 9 provider examples.

Selection signal

Start with DeepSeek V4 Flash

The charts below split 23 high-signal samples across speed, scores, and audit health.

Model compare

DeepSeek V4 Flash vs GLM-5.1

deepseek-v4-flash-vs-glm-5-1

Model A

DeepSeek

DeepSeek V4 Flash

Model B

ChatGLM

GLM-5.1

Overall leaderLeadingContender
Verified metric wins5 wins1 wins
Where it leadsAverage speed, First-token latency, Free providers, Provider coverage, Recent testsCheapest input price
Model metadataDeepSeek V4 Flash exposes 1.0M tokens; notable signals: Text input, Text output, Tool calling, Structured outputs.No OpenRouter metadata is available yet for this model.
DeveloperDeepSeekZhipu AI
Context window1.0M tokensNo data
Max output131.1K tokensNo data
ReleasedApr 2026No data
Modalities

Input

Text

Output

Text
No data
Features
Text inputText outputTool callingStructured outputsJSON modeReasoning
None listed
ParametersNo dataNo data
TokenizerDeepSeekNo data
Knowledge cutoffNo dataNo data
OpenRouter IDdeepseek/deepseek-v4-flashNo data
ReferencesNo dataNo data

Benchmark score comparison

Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.

Metric
DeepSeekDeepSeek V4 Flash
ChatGLMGLM-5.1
Output price
$0.280/M#4
$4.40/M#33
Input price
$0.140/M#8
$1.40/M#32
Blended price
$0.175/M#10
$2.15/M#53
GPQA
89.4%#11
86.8%#22
HLE
32.1%#13
28.0%#18
SciCode
44.9%#23
43.8%#26
Time to first token
0.94 s#34
0.90 s#31
Output speed
96.7 tok/s#52
71.0 tok/s#72

API audit comparison

Latest completed audits from shared providers, with four safety and integrity score groups plus report links.

Provider
DeepSeekDeepSeek V4 Flash
ChatGLMGLM-5.1
PICO APIWinner: DeepSeek V4 Flash
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash

Audit score

92

View report
10068100100
ChatGLM

GLM-5.1

glm-5.1

No audit yet

MyDamoxingWinner: DeepSeek V4 Flash
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash

Audit score

84

View report
787584100
ChatGLM

GLM-5.1

GLM-5.1

No audit yet

CatClaw APIWinner: DeepSeek V4 Flash
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash

Audit score

83

View report
766886100
ChatGLM

GLM-5.1

glm-5.1

Audit score

78

View report
766868100
小水管 APIWinner: DeepSeek V4 Flash
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash

Audit score

81

View report
727280100
ChatGLM

GLM-5.1

glm-5.1

No audit yet

TokenessWinner: DeepSeek V4 Flash
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash

Audit score

79

View report
648468100
ChatGLM

GLM-5.1

glm-5.1

No audit yet

VSLLMWinner: GLM-5.1
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash-free

No audit yet

ChatGLM

GLM-5.1

glm-5.1-free

Audit score

79

View report
678465100

Provider examples

Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.

Provider
DeepSeekDeepSeek V4 Flash
ChatGLMGLM-5.1
NVIDIA NIM45 tests
DeepSeek

DeepSeek V4 Flash

speed / latency

24 tok/s / 9354ms

input / output

No data

ChatGLM

GLM-5.1

speed / latency

123 tok/s / 671ms

input / output

No data

SiliconFlow35 tests
DeepSeek

DeepSeek V4 Flash

speed / latency

45 tok/s / 9731ms

input / output

No data

ChatGLM

GLM-5.1

speed / latency

50 tok/s / 9040ms

input / output

No data

火山引擎 Ark35 tests
DeepSeek

DeepSeek V4 Flash

speed / latency

73 tok/s / 3161ms

input / output

No data

ChatGLM

GLM-5.1

speed / latency

35 tok/s / 22863ms

input / output

No data

小水管 API25 tests
DeepSeek

DeepSeek V4 Flash

speed / latency

83 tok/s / 5114ms

input / output

No data

ChatGLM

GLM-5.1

speed / latency

37 tok/s / 20151ms

input / output

No data

阿里云百炼 DashScope25 tests
DeepSeek

DeepSeek V4 Flash

speed / latency

108 tok/s / 3324ms

input / output

No data

ChatGLM

GLM-5.1

speed / latency

45 tok/s / 14244ms

input / output

No data

91VIP API
DeepSeek

DeepSeek V4 Flash

deepseek-ai/deepseek-v4-flash

speed / latency

No data

input / output

$0/M / $0/M

ChatGLM

GLM-5.1

glm-5.1

speed / latency

No data

input / output

$0.050/M / $0.050/M

AI Claw API
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash

speed / latency

No data

input / output

$0/M / $0/M

ChatGLM

GLM-5.1

z-ai/glm-5.1

speed / latency

No data

input / output

$375.00/M / $375.00/M

Dibin84 API Hub
DeepSeek

DeepSeek V4 Flash

deepseek-ai/deepseek-v4-flash

speed / latency

No data

input / output

$0/M / $0/M

ChatGLM

GLM-5.1

glm-5.1

speed / latency

No data

input / output

$3.65/M / $11.68/M

MapleLeaf API
DeepSeek

DeepSeek V4 Flash

deepseek-v4-flash-nothinking

speed / latency

No data

input / output

$75.00/M / $75.00/M

ChatGLM

GLM-5.1

glm-5.1

speed / latency

No data

input / output

$0/M / $0/M

When to choose each model

This report only uses LMSpeed data for DeepSeek V4 Flash and GLM-5.1: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.

Guidance
DeepSeekDeepSeek V4 Flash
ChatGLMGLM-5.1
When to choose each model

DeepSeek V4 Flash

DeepSeek V4 Flash is stronger when you prioritize Average speed, First-token latency, Free providers, Provider coverage, Recent tests.

GLM-5.1

GLM-5.1 is stronger when you prioritize Cheapest input price.

FAQ

TL;DR: DeepSeek V4 Flash leads across 108 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.

Why is this comparison indexable?
It has 6 verifiable comparison points, and both models have pricing or benchmark data.
Are missing metrics invented?
No. Metrics without LMSpeed data are omitted from this report.

Related compare reports

Continue from DeepSeek V4 Flash vs GLM-5.1 into nearby model comparisons with enough verified LMSpeed data.

DeepSeekDeepSeek V4 Flash vs GPT-5.46 verified data pointsClaudeClaude Opus 4.6 vs DeepSeek V4 Flash6 verified data pointsDeepSeekDeepSeek V4 Flash vs GPT-56 verified data pointsDeepSeekDeepSeek V4 Flash vs GPT-5.26 verified data points

Data as of Jun 13, 2026, 01:26 PM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.