Sponsored byFusecodeEnterprise coding API for Claude Code, Codex, and model workflows.
LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
  1. Home
  2. Leaderboard
  3. Best Throughput Models Monthly
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Model Pricing
  • Model Speed
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️

LMSpeed Throughput Leaderboard

Multi-dimensional rankings based on model speed tests, provider health checks, and standard model benchmarks. Compare providers, endpoints, models, and reliability at a glance.

Ranked by median tokens per second (resistant to outliers). Higher is better for fast responses.

Data as of Jun 25, 2026, 05:46 PM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.

ThroughputFirst Token LatencyHealthModel PerformanceModel PricingModel Speed
Fairness Notice
To ensure fairness, the system uses AI-powered detection. Suspicious data or cheating will be flagged and removed from the leaderboard.
Last 7 DaysLast 30 Days
Archive Month
Jun 2026May 2026Apr 2026Mar 2026Feb 2026Jan 2026Dec 2025Nov 2025Oct 2025Sep 2025Aug 2025Jul 2025Jun 2025May 2025Apr 2025Mar 2025Feb 2025Jan 2025
RankProviderModelThroughputAvg first token latencyUpdatedTotal Tests
1NEW
qwen30b-sglang
590.19 t/s
Best: 704.04Worst: 391.40
7.20s
Jun 12, 06:36 PM
10
2NEW
钠 APInaapi.cc
llama3.1-8B
522.22 t/s
Best: 1761.98Worst: 327.76
0.70s
Jun 14, 05:04 PM
10
3NEW
6345ywz APIapi.6345ywz.cn
PRO/minimax-m2.7
480.83 t/s
Best: 556.70Worst: 453.54
0.96s
May 29, 05:26 PM
15
4NEW
6345ywz APIapi.6345ywz.cn
FAST/minimax-m2.7
469.27 t/s
Best: 506.28Worst: 97.36
1.25s
Jun 19, 03:03 PM
85
5NEW
X
Xiaomimimo APIapi.xiaomimimo.com
mimo-v2.5-pro-ultraspeed
418.09 t/s
Best: 544.37Worst: 282.90
0.98s
Jun 22, 03:17 AM
15
6NEW
6345ywz APIapi.6345ywz.cn
FAST/deepseek-v3.1
271.26 t/s
Best: 306.38Worst: 225.88
0.55s
Jun 14, 12:01 AM
15
7NEW
6345ywz APIapi.6345ywz.cn
FAST/deepseek-v3.2
269.21 t/s
Best: 317.88Worst: 88.01
1.34s
Jun 19, 03:07 PM
25
8NEW
6345ywz APIapi.6345ywz.cn
PRO/deepseek-v3.1
267.70 t/s
Best: 295.83Worst: 223.08
0.56s
May 29, 04:59 PM
15
9NEW
6345ywz APIapi.6345ywz.cn
PRO/deepseek-v3.2
262.79 t/s
Best: 331.18Worst: 222.72
0.98s
May 29, 05:01 PM
10
10NEW
a
api.generalcompute.comapi.generalcompute.com
deepseek-v3.2
251.77 t/s
Best: 299.48Worst: 211.69
1.18s
Jun 2, 01:49 PM
10
11NEW
a
api.generalcompute.comapi.generalcompute.com
minimax-m2.7
245.75 t/s
Best: 470.87Worst: 10.42
15.45s
Jun 2, 01:50 PM
20
12NEW
pro.fan142.toppro.fan142.top
gpt-5.3-codex-spark
222.42 t/s
Best: 1019.26Worst: 17.09
1.61s
Jun 9, 10:33 AM
20
13NEW
n
new.itus.ccnew.itus.cc
gemini-3.5-flash
173.93 t/s
Best: 463.86Worst: 111.23
6.93s
Jun 24, 08:38 AM
10
14NEW
0
02F APIapi.02f.cc:8317
gpt-5.3-codex-spark
166.35 t/s
Best: 1042.72Worst: 38.25
1.33s
Jun 4, 08:05 AM
25
15NEW
NVIDIA NIMintegrate.api.nvidia.com
openai/gpt-oss-20b
166.14 t/s
Best: 210.74Worst: 85.49
0.79s
May 30, 05:19 AM
10
16NEW
a
api.tsc-mc.cnapi.tsc-mc.cn
gemini-3-flash
163.33 t/s
Best: 459.95Worst: 45.51
5.01s
Jun 12, 08:38 AM
20
17NEW
a
api.bbbc.eu.orgapi.bbbc.eu.org
kimi-k2.7-code
152.73 t/s
Best: 220.18Worst: 43.27
3.47s
Jun 13, 02:00 PM
10
18NEW
apihub.agnes-ai.comapihub.agnes-ai.com
agnes-1.5-flash
140.56 t/s
Best: 160.47Worst: 20.44
1.47s
Jun 21, 12:10 PM
10
19NEW
6345ywz APIapi.6345ywz.cn
meta/llama-3.1-8b-instruct
111.01 t/s
Best: 170.28Worst: 95.49
0.25s
Jun 4, 04:30 AM
10
20NEW
apihub.agnes-ai.comapihub.agnes-ai.com
agnes-2.0-flash
107.45 t/s
Best: 200.69Worst: 7.21
0.78s
Jun 23, 10:08 AM
55
21NEW
b
bayunzi.shop:8081bayunzi.shop:8081
gemini-3.5-flash-thinking
106.79 t/s
Best: 121.13Worst: 82.85
2.41s
Jun 4, 03:13 AM
10
22NEW
NVIDIA NIMintegrate.api.nvidia.com
nvidia/nemotron-3-ultra-550b-a55b
97.76 t/s
Best: 124.85Worst: 73.94
1.02s
Jun 4, 02:35 PM
10
23NEW
t
token.juda.devtoken.juda.dev
MiniMax-M2.7-highspeed
94.41 t/s
Best: 110.77Worst: 86.55
4.99s
Jun 25, 01:19 PM
15
24NEW
o
oneapi.milolab.cnoneapi.milolab.cn
MiniMax-M2.7-highspeed
93.17 t/s
Best: 99.45Worst: 50.81
6.69s
Jun 13, 09:54 AM
10
2516
DeepSeekapi.deepseek.com
deepseek-v4-flash
89.59 t/s
Best: 121.92Worst: 64.90
1.42s
Jun 25, 05:46 PM
15
26NEW
a
api.bluesminds.comapi.bluesminds.com
gpt-5-mini
89.23 t/s
Best: 140.09Worst: 31.77
7.27s
Jun 20, 02:18 PM
10
27NEW
a
ai.beehears.comai.beehears.com
gpt-5.4-mini
88.97 t/s
Best: 104.65Worst: 8.10
2.95s
Jun 20, 03:57 PM
15
28NEW
NVIDIA NIMintegrate.api.nvidia.com
z-ai/glm-5.1
88.13 t/s
Best: 146.74Worst: 18.23
7.99s
Jun 18, 12:27 PM
10
29NEW
NVIDIA NIMintegrate.api.nvidia.com
stepfun-ai/step-3.7-flash
86.30 t/s
Best: 261.76Worst: 26.00
15.22s
Jun 12, 11:11 PM
10
30NEW
o
oneapi.milolab.cnoneapi.milolab.cn
MiniMax-M2.7
84.22 t/s
Best: 91.91Worst: 10.80
6.37s
Jun 10, 01:17 PM
15
31NEW
OpenCodeopencode.ai
deepseek-v4-flash-free
80.52 t/s
Best: 123.35Worst: 10.26
26.51s
Jun 14, 03:54 PM
15
32NEW
X
Xiaomimimo Token Plan CNtoken-plan-cn.xiaomimimo.com
mimo-v2.5
78.82 t/s
Best: 103.25Worst: 56.29
3.12s
Jun 12, 04:20 AM
25
33NEW
a
api.0326.topapi.0326.top
xy1.0-fast
77.80 t/s
Best: 185.65Worst: 15.85
3.01s
Jun 9, 03:33 PM
15
34NEW
n
new.itus.ccnew.itus.cc
mimo-v2.5
77.72 t/s
Best: 91.05Worst: 61.36
1.50s
Jun 24, 09:49 AM
10
35NEW
1
123NHH APIapi.123nhh.com
deepseek-v4-flash
77.65 t/s
Best: 114.84Worst: 61.54
2.61s
Jun 4, 02:47 PM
15
364
DeepSeekapi.deepseek.com
deepseek-v4-flash
75.69 t/s
Best: 116.62Worst: 50.69
1.74s
Jun 15, 01:24 AM
55
37NEW
1
123NHH APIapi.123nhh.com
agnes-2.0-flash
73.91 t/s
Best: 137.12Worst: 16.28
1.38s
Jun 4, 03:48 PM
10
38NEW
火山引擎 Arkark.cn-beijing.volces.com
DeepSeek-V4-Flash
71.97 t/s
Best: 88.69Worst: 56.56
2.73s
Jun 8, 08:00 AM
10
39NEW
E
EdgeFN APIapi.edgefn.net
GLM-5
71.33 t/s
Best: 86.59Worst: 37.38
11.58s
Jun 5, 06:56 AM
10
408
X
Xiaomimimo APIapi.xiaomimimo.com
mimo-v2.5
71.27 t/s
Best: 95.28Worst: 48.27
1.72s
Jun 8, 08:18 AM
10
41NEW
a
aihub.071129.xyzaihub.071129.xyz
Kimi-k2.6
70.55 t/s
Best: 112.35Worst: 33.77
11.98s
Jun 23, 12:38 PM
10
42NEW
OpenCodeopencode.ai
glm-5.2
64.02 t/s
Best: 74.90Worst: 47.17
7.85s
Jun 24, 10:08 AM
10
43NEW
MiniMaxapi.minimaxi.com
MiniMax-M2.7
63.14 t/s
Best: 134.88Worst: 30.89
1.20s
Jun 5, 06:38 AM
15
44NEW
DeepSeekapi.deepseek.com
deepseek-v4-pro
62.39 t/s
Best: 83.00Worst: 41.81
2.94s
Jun 23, 12:03 PM
10
45NEW
b
buddybackend.cloudbuddybackend.cloud
deepseek-v4-flash
61.70 t/s
Best: 110.81Worst: 11.03
7.30s
Jun 25, 03:57 PM
10
46NEW
y
yibuapi.comyibuapi.com
gpt-5.5
61.51 t/s
Best: 82.87Worst: 13.43
3.45s
Jun 4, 06:21 AM
10
47NEW
智谱 AIopen.bigmodel.cn
glm-5-turbo
59.88 t/s
Best: 82.41Worst: 44.50
13.29s
May 31, 04:31 AM
15
48NEW
火山引擎 Arkark.cn-beijing.volces.com
glm-5.2
59.34 t/s
Best: 68.85Worst: 40.69
3.81s
Jun 24, 10:12 AM
10
49NEW
a
api.sbbbbbbbbb.xyzapi.sbbbbbbbbb.xyz
gpt-5.5
57.62 t/s
Best: 79.11Worst: 22.39
3.18s
Jun 18, 10:36 AM
20
50NEW
h
hubway.cchubway.cc
gpt-5.5
57.57 t/s
Best: 76.74Worst: 14.12
5.98s
Jun 19, 08:44 AM
10
u
u544625-hyxk-c75d1b7a.bjb1.seetacloud.com:8443u544625-hyxk-c75d1b7a.bjb1.seetacloud.com:8443
First Token Latencys
HealthAvg probe latency
Model PerformanceAA benchmark matrix
Model PricingAA pricing matrix
Model SpeedAA speed matrix
Model
Provider