Multi-dimensional rankings based on model speed tests, provider health checks, and standard model benchmarks. Compare providers, endpoints, models, and reliability at a glance.
Ranked by median tokens per second (resistant to outliers). Higher is better for fast responses.
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.
| Rank | Provider | Model | Throughput | Avg first token latency | Updated | Total Tests |
|---|---|---|---|---|---|---|
1NEW | MBZUAI-IFM/K2-Think-v2 | 2022.13 t/s Best: 2868.88Worst: 619.71 | 2.75s | 5 | ||
2NEW |
6345ywz APIapi.6345ywz.cn |
| PRO/minimax-m2.7 |
480.83 t/s Best: 556.70Worst: 453.54 |
0.96s |
15 |
3NEW | GPT Load (Shiho)gpt-load.shiho.top | openai/gpt-oss-120b | 480.23 t/s Best: 488.51Worst: 475.39 | 0.38s | 5 |
4NEW | 6345ywz APIapi.6345ywz.cn | FAST/minimax-m2.7 | 469.21 t/s Best: 507.72Worst: 98.56 | 1.22s | 80 |
5NEW | h hub.oaifree.comhub.oaifree.com | minimax-m2.7 | 459.40 t/s Best: 469.90Worst: 422.10 | 2.08s | 5 |
6NEW | 6345ywz APIapi.6345ywz.cn | FAST/deepseek-v3.1 | 272.83 t/s Best: 311.60Worst: 224.84 | 0.54s | 10 |
7NEW | 6345ywz APIapi.6345ywz.cn | FAST/deepseek-v3.2 | 269.24 t/s Best: 324.56Worst: 54.87 | 1.34s | 20 |
8NEW | 6345ywz APIapi.6345ywz.cn | PRO/deepseek-v3.1 | 267.70 t/s Best: 295.83Worst: 223.08 | 0.56s | 15 |
9NEW | 6345ywz APIapi.6345ywz.cn | PRO/deepseek-v3.2 | 262.79 t/s Best: 331.18Worst: 222.72 | 0.98s | 10 |
10NEW | a api.generalcompute.comapi.generalcompute.com | deepseek-v3.2 | 251.77 t/s Best: 299.48Worst: 211.69 | 1.18s | 10 |
11NEW | a api.generalcompute.comapi.generalcompute.com | minimax-m2.7 | 245.75 t/s Best: 470.87Worst: 10.42 | 15.45s | 20 |
12NEW | Fengsili APIapi.fengsili.online | skywork-ai/skyclaw-v1-lite | 203.90 t/s Best: 205.11Worst: 52.06 | 7.70s | 5 |
13NEW | 中国科技云大模型 API 开放平台uni-api.cstcloud.cn | gpt-oss-120b | 201.30 t/s Best: 204.53Worst: 180.54 | 0.82s | 5 |
14NEW | a api.02f.cc:8317api.02f.cc:8317 | gpt-5.3-codex-spark | 187.39 t/s Best: 1181.91Worst: 35.51 | 1.33s | 15 |
15NEW | NVIDIA NIMintegrate.api.nvidia.com | openai/gpt-oss-20b | 166.14 t/s Best: 210.74Worst: 85.49 | 0.79s | 10 |
16NEW | a api.02f.cc:8317api.02f.cc:8317 | test-1 | 157.27 t/s Best: 808.31Worst: 33.08 | 2.62s | 5 |
17NEW | a apihub.agnes-ai.comapihub.agnes-ai.com | agnes-1.5-flash | 147.18 t/s Best: 163.80Worst: 128.22 | 0.38s | 5 |
18NEW | NVIDIA NIMintegrate.api.nvidia.com | meta/llama-3.1-8b-instruct | 138.96 t/s Best: 169.77Worst: 122.43 | 0.16s | 5 |
19NEW | a apihub.agnes-ai.comapihub.agnes-ai.com | agnes-2.0-flash | 132.80 t/s Best: 195.01Worst: 23.66 | 0.52s | 20 |
2013 | NVIDIA NIMintegrate.api.nvidia.com | openai/gpt-oss-120b | 129.34 t/s Best: 147.63Worst: 80.62 | 1.01s | 5 |
21NEW | 6345ywz APIapi.6345ywz.cn | gpt-oss-120b | 125.57 t/s Best: 150.20Worst: 105.36 | 0.76s | 5 |
22NEW | a api.cctoken.funapi.cctoken.fun | deepseek-v4-flash | 123.74 t/s Best: 207.68Worst: 24.08 | 2.21s | 5 |
23NEW | n new.xinjianya.topnew.xinjianya.top | grok-4.20-multi-agent-medium | 122.63 t/s Best: 152.14Worst: 32.52 | 8.71s | 5 |
24NEW | b bayunzi.shop:8081bayunzi.shop:8081 | gemini-3.1-pro | 117.71 t/s Best: 126.73Worst: 87.12 | 2.44s | 5 |
25NEW | f freeapi.514179.xyzfreeapi.514179.xyz | ChatGPT‑4‑Turbo | 116.35 t/s Best: 145.24Worst: 41.20 | 1.23s | 5 |
26NEW | 6345ywz APIapi.6345ywz.cn | meta/llama-3.1-8b-instruct | 111.01 t/s Best: 170.28Worst: 95.49 | 0.25s | 10 |
27NEW | t token-center.netopstec.comtoken-center.netopstec.com | Qwen3.6-35B-A3B-APEX-MTP-I-Balanced | 108.29 t/s Best: 112.75Worst: 14.82 | 10.20s | 5 |
28NEW | b bayunzi.shop:8081bayunzi.shop:8081 | gemini-3.5-flash-thinking | 106.79 t/s Best: 121.13Worst: 82.85 | 2.41s | 10 |
29NEW | y yibuapi.comyibuapi.com | deepseek-v4-flash | 106.55 t/s Best: 120.71Worst: 87.91 | 2.30s | 5 |
30NEW | NVIDIA NIMintegrate.api.nvidia.com | nvidia/nemotron-mini-4b-instruct | 100.10 t/s Best: 102.46Worst: 45.64 | 0.39s | 5 |
31NEW | b bayunzi.shop:8081bayunzi.shop:8081 | gemini-3.5-flash | 98.21 t/s Best: 121.37Worst: 87.02 | 2.17s | 5 |
322 | DeepSeekapi.deepseek.com | deepseek-v4-flash | 97.84 t/s Best: 136.19Worst: 74.76 | 1.29s | 20 |
33NEW | NVIDIA NIMintegrate.api.nvidia.com | nvidia/nemotron-3-ultra-550b-a55b | 97.76 t/s Best: 124.85Worst: 73.94 | 1.02s | 10 |
343 | X Xiaomimimo Token Plan CNtoken-plan-cn.xiaomimimo.com | mimo-v2.5 | 92.24 t/s Best: 102.52Worst: 69.16 | 2.04s | 10 |
35NEW | 钠 APInaapi.cc | deepseek-v4-flash | 84.27 t/s Best: 95.50Worst: 71.70 | 5.36s | 5 |
36NEW | a api.cctoken.funapi.cctoken.fun | claude-haiku-4-5-20251001 | 82.50 t/s Best: 100.12Worst: 73.96 | 1.13s | 5 |
37NEW | 讯飞星火maas-coding-api.cn-huabei-1.xf-yun.com | astron-code-latest | 81.60 t/s Best: 126.43Worst: 49.70 | 2.81s | 5 |
38NEW | w www.cctoken.funwww.cctoken.fun | claude-haiku-4-5-20251001 | 81.21 t/s Best: 94.01Worst: 76.18 | 1.10s | 5 |
397 | aigw-gzgy2.cucloud.cn:8443aigw-gzgy2.cucloud.cn:8443 | DeepSeek-V4-Flash | 80.96 t/s Best: 83.59Worst: 36.87 | 1.54s | 5 |
40 | X Xiaomimimo APIapi.xiaomimimo.com | mimo-v2.5 | 79.51 t/s Best: 94.42Worst: 65.44 | 1.62s | 5 |
41NEW | a api.123nhh.comapi.123nhh.com | deepseek-v4-flash | 77.65 t/s Best: 114.84Worst: 61.54 | 2.61s | 15 |
42NEW | 6345ywz APIapi.6345ywz.cn | mimo-v2.5 | 76.15 t/s Best: 92.68Worst: 27.05 | 3.69s | 5 |
43NEW | a api.123nhh.comapi.123nhh.com | agnes-2.0-flash | 73.91 t/s Best: 137.12Worst: 16.28 | 1.38s | 10 |
44NEW | 6345ywz APIapi.6345ywz.cn | tencent/Hunyuan-MT-7B | 71.38 t/s Best: 71.77Worst: 61.14 | 0.30s | 5 |
45NEW | E EdgeFN APIapi.edgefn.net | GLM-5 | 71.33 t/s Best: 86.59Worst: 37.38 | 11.58s | 10 |
46NEW | q qwenplusplan.airoe.cnqwenplusplan.airoe.cn | qwen3.7-max | 64.96 t/s Best: 74.16Worst: 59.17 | 3.95s | 5 |
47NEW | k keungliang.dpdns.orgkeungliang.dpdns.org | deepseek-v4-pro | 64.28 t/s Best: 130.69Worst: 50.00 | 2.79s | 5 |
48NEW | MiniMaxapi.minimaxi.com | MiniMax-M2.7 | 63.14 t/s Best: 134.88Worst: 30.89 | 1.20s | 15 |
49NEW | y yibuapi.comyibuapi.com | gpt-5.5 | 61.51 t/s Best: 82.87Worst: 13.43 | 3.45s | 10 |
50NEW | g gemini.beijixingxing.comgemini.beijixingxing.com | gemini-3-flash-preview[真流] | 60.18 t/s Best: 142.58Worst: 24.29 | 6.11s | 5 |