Multi-dimensional rankings based on model speed tests, provider health checks, and standard model benchmarks. Compare providers, endpoints, models, and reliability at a glance.
Ranked by median tokens per second (resistant to outliers). Higher is better for fast responses.
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.
| Rank | Provider | Model | Throughput | Avg first token latency | Updated | Total Tests |
|---|---|---|---|---|---|---|
1NEW | qwen30b-sglang | 590.19 t/s Best: 704.04Worst: 391.40 | 7.20s | 10 | ||
2NEW |
钠 APInaapi.cc |
| llama3.1-8B |
522.22 t/s Best: 1761.98Worst: 327.76 |
0.70s |
10 |
3NEW | 6345ywz APIapi.6345ywz.cn | PRO/minimax-m2.7 | 480.83 t/s Best: 556.70Worst: 453.54 | 0.96s | 15 |
4NEW | 6345ywz APIapi.6345ywz.cn | FAST/minimax-m2.7 | 469.27 t/s Best: 506.28Worst: 97.36 | 1.25s | 85 |
5NEW | X Xiaomimimo APIapi.xiaomimimo.com | mimo-v2.5-pro-ultraspeed | 418.09 t/s Best: 544.37Worst: 282.90 | 0.98s | 15 |
6NEW | 6345ywz APIapi.6345ywz.cn | FAST/deepseek-v3.1 | 271.26 t/s Best: 306.38Worst: 225.88 | 0.55s | 15 |
7NEW | 6345ywz APIapi.6345ywz.cn | FAST/deepseek-v3.2 | 269.21 t/s Best: 317.88Worst: 88.01 | 1.34s | 25 |
8NEW | 6345ywz APIapi.6345ywz.cn | PRO/deepseek-v3.1 | 267.70 t/s Best: 295.83Worst: 223.08 | 0.56s | 15 |
9NEW | 6345ywz APIapi.6345ywz.cn | PRO/deepseek-v3.2 | 262.79 t/s Best: 331.18Worst: 222.72 | 0.98s | 10 |
10NEW | a api.generalcompute.comapi.generalcompute.com | deepseek-v3.2 | 251.77 t/s Best: 299.48Worst: 211.69 | 1.18s | 10 |
11NEW | a api.generalcompute.comapi.generalcompute.com | minimax-m2.7 | 245.75 t/s Best: 470.87Worst: 10.42 | 15.45s | 20 |
12NEW | pro.fan142.toppro.fan142.top | gpt-5.3-codex-spark | 222.42 t/s Best: 1019.26Worst: 17.09 | 1.61s | 20 |
13NEW | n new.itus.ccnew.itus.cc | gemini-3.5-flash | 173.93 t/s Best: 463.86Worst: 111.23 | 6.93s | 10 |
14NEW | 0 02F APIapi.02f.cc:8317 | gpt-5.3-codex-spark | 166.35 t/s Best: 1042.72Worst: 38.25 | 1.33s | 25 |
15NEW | NVIDIA NIMintegrate.api.nvidia.com | openai/gpt-oss-20b | 166.14 t/s Best: 210.74Worst: 85.49 | 0.79s | 10 |
16NEW | a api.tsc-mc.cnapi.tsc-mc.cn | gemini-3-flash | 163.33 t/s Best: 459.95Worst: 45.51 | 5.01s | 20 |
17NEW | a api.bbbc.eu.orgapi.bbbc.eu.org | kimi-k2.7-code | 152.73 t/s Best: 220.18Worst: 43.27 | 3.47s | 10 |
18NEW | apihub.agnes-ai.comapihub.agnes-ai.com | agnes-1.5-flash | 140.56 t/s Best: 160.47Worst: 20.44 | 1.47s | 10 |
19NEW | 6345ywz APIapi.6345ywz.cn | meta/llama-3.1-8b-instruct | 111.01 t/s Best: 170.28Worst: 95.49 | 0.25s | 10 |
20NEW | apihub.agnes-ai.comapihub.agnes-ai.com | agnes-2.0-flash | 107.45 t/s Best: 200.69Worst: 7.21 | 0.78s | 55 |
21NEW | b bayunzi.shop:8081bayunzi.shop:8081 | gemini-3.5-flash-thinking | 106.79 t/s Best: 121.13Worst: 82.85 | 2.41s | 10 |
22NEW | NVIDIA NIMintegrate.api.nvidia.com | nvidia/nemotron-3-ultra-550b-a55b | 97.76 t/s Best: 124.85Worst: 73.94 | 1.02s | 10 |
23NEW | t token.juda.devtoken.juda.dev | MiniMax-M2.7-highspeed | 94.41 t/s Best: 110.77Worst: 86.55 | 4.99s | 15 |
24NEW | o oneapi.milolab.cnoneapi.milolab.cn | MiniMax-M2.7-highspeed | 93.17 t/s Best: 99.45Worst: 50.81 | 6.69s | 10 |
2516 | DeepSeekapi.deepseek.com | deepseek-v4-flash | 89.59 t/s Best: 121.92Worst: 64.90 | 1.42s | 15 |
26NEW | a api.bluesminds.comapi.bluesminds.com | gpt-5-mini | 89.23 t/s Best: 140.09Worst: 31.77 | 7.27s | 10 |
27NEW | a ai.beehears.comai.beehears.com | gpt-5.4-mini | 88.97 t/s Best: 104.65Worst: 8.10 | 2.95s | 15 |
28NEW | NVIDIA NIMintegrate.api.nvidia.com | z-ai/glm-5.1 | 88.13 t/s Best: 146.74Worst: 18.23 | 7.99s | 10 |
29NEW | NVIDIA NIMintegrate.api.nvidia.com | stepfun-ai/step-3.7-flash | 86.30 t/s Best: 261.76Worst: 26.00 | 15.22s | 10 |
30NEW | o oneapi.milolab.cnoneapi.milolab.cn | MiniMax-M2.7 | 84.22 t/s Best: 91.91Worst: 10.80 | 6.37s | 15 |
31NEW | OpenCodeopencode.ai | deepseek-v4-flash-free | 80.52 t/s Best: 123.35Worst: 10.26 | 26.51s | 15 |
32NEW | X Xiaomimimo Token Plan CNtoken-plan-cn.xiaomimimo.com | mimo-v2.5 | 78.82 t/s Best: 103.25Worst: 56.29 | 3.12s | 25 |
33NEW | a api.0326.topapi.0326.top | xy1.0-fast | 77.80 t/s Best: 185.65Worst: 15.85 | 3.01s | 15 |
34NEW | n new.itus.ccnew.itus.cc | mimo-v2.5 | 77.72 t/s Best: 91.05Worst: 61.36 | 1.50s | 10 |
35NEW | 1 123NHH APIapi.123nhh.com | deepseek-v4-flash | 77.65 t/s Best: 114.84Worst: 61.54 | 2.61s | 15 |
364 | DeepSeekapi.deepseek.com | deepseek-v4-flash | 75.69 t/s Best: 116.62Worst: 50.69 | 1.74s | 55 |
37NEW | 1 123NHH APIapi.123nhh.com | agnes-2.0-flash | 73.91 t/s Best: 137.12Worst: 16.28 | 1.38s | 10 |
38NEW | 火山引擎 Arkark.cn-beijing.volces.com | DeepSeek-V4-Flash | 71.97 t/s Best: 88.69Worst: 56.56 | 2.73s | 10 |
39NEW | E EdgeFN APIapi.edgefn.net | GLM-5 | 71.33 t/s Best: 86.59Worst: 37.38 | 11.58s | 10 |
408 | X Xiaomimimo APIapi.xiaomimimo.com | mimo-v2.5 | 71.27 t/s Best: 95.28Worst: 48.27 | 1.72s | 10 |
41NEW | a aihub.071129.xyzaihub.071129.xyz | Kimi-k2.6 | 70.55 t/s Best: 112.35Worst: 33.77 | 11.98s | 10 |
42NEW | OpenCodeopencode.ai | glm-5.2 | 64.02 t/s Best: 74.90Worst: 47.17 | 7.85s | 10 |
43NEW | MiniMaxapi.minimaxi.com | MiniMax-M2.7 | 63.14 t/s Best: 134.88Worst: 30.89 | 1.20s | 15 |
44NEW | DeepSeekapi.deepseek.com | deepseek-v4-pro | 62.39 t/s Best: 83.00Worst: 41.81 | 2.94s | 10 |
45NEW | b buddybackend.cloudbuddybackend.cloud | deepseek-v4-flash | 61.70 t/s Best: 110.81Worst: 11.03 | 7.30s | 10 |
46NEW | y yibuapi.comyibuapi.com | gpt-5.5 | 61.51 t/s Best: 82.87Worst: 13.43 | 3.45s | 10 |
47NEW | 智谱 AIopen.bigmodel.cn | glm-5-turbo | 59.88 t/s Best: 82.41Worst: 44.50 | 13.29s | 15 |
48NEW | 火山引擎 Arkark.cn-beijing.volces.com | glm-5.2 | 59.34 t/s Best: 68.85Worst: 40.69 | 3.81s | 10 |
49NEW | a api.sbbbbbbbbb.xyzapi.sbbbbbbbbb.xyz | gpt-5.5 | 57.62 t/s Best: 79.11Worst: 22.39 | 3.18s | 20 |
50NEW | h hubway.cchubway.cc | gpt-5.5 | 57.57 t/s Best: 76.74Worst: 14.12 | 5.98s | 10 |