Leaderboard
Model performance rankings based on speed test results. Compare models across different providers and endpoints.
Average tokens generated per second. Higher is better for fast responses.
| Rank | Provider | Model | Throughput | Avg first token latency | Total Tests |
|---|---|---|---|---|---|
| 1 | gemini-2.5-flash | 104600.84 t/s Best: 115213.03Worst: 81275.31 | 19.21s | 5 | |
| 2 | jimmy | 101506.95 t/s Best: 145658.50Worst: 13204.57 | 0.59s | 5 | |
| 3 |
a api.amethyst.ltdapi.amethyst.ltd |
| jimmy |
86213.91 t/s Best: 138352.88Worst: 42053.25 |
0.58s |
| 10 |
| 4 | 2 20230621.pp.ua20230621.pp.ua | translate-model | 31767.17 t/s Best: 48227.39Worst: 13109.67 | 1.06s | 5 |
| 5 | S SWT-APIapi.lhyb.dpdns.org | gemini-3-pro | 14970.09 t/s Best: 17228.53Worst: 9361.83 | 3.36s | 5 |
| 6 | 素 素墨APIapifree.rensumo.top | echo | 6152.15 t/s Best: 22934.12Worst: 506.54 | 0.75s | 15 |
| 7 | XJY APIapi.xinjianya.top | grok-imagine-1.0-fast | 4998.02 t/s Best: 7933.91Worst: 1462.69 | 4.80s | 15 |
| 8 | 钠 APIus.naapi.cc | mercury-2 | 1653.71 t/s Best: 6228.40Worst: 371.89 | 2.50s | 5 |
| 9 | 钠 APIus.naapi.cc | mercury-2 | 1653.71 t/s Best: 6228.40Worst: 371.89 | 2.50s | 5 |
| 10 | 素 素墨APIapifree.rensumo.top | llama3.1-8B | 1421.44 t/s Best: 2829.11Worst: 100.62 | 0.94s | 10 |
| 11 | 素 素墨APIapifree.rensumo.top | 快速/llama3.1-8B | 1258.69 t/s Best: 2172.93Worst: 595.62 | 1.24s | 15 |
| 12 | 素 素墨APIapifree.rensumo.top | llama3.1-8b | 731.95 t/s Best: 1144.52Worst: 67.76 | 1.17s | 15 |
| 13 | a api.rnglg2.top:30000api.rnglg2.top:30000 | inception/mercury | 386.10 t/s Best: 525.30Worst: 123.95 | 1.26s | 15 |
| 14 | XJY APIapi.xinjianya.top | nvidia/nemotron-3-nano-30b-a3b | 246.87 t/s Best: 299.20Worst: 195.73 | 1.15s | 5 |
| 15 | s skyag.xiamu.asiaskyag.xiamu.asia | gcli-gemini-2.5-flash | 202.50 t/s Best: 249.46Worst: 145.06 | 9.33s | 5 |
| 16 | XJY APIapi.xinjianya.top | meta/llama-3.1-8b-instruct | 200.27 t/s Best: 209.02Worst: 192.35 | 0.42s | 15 |
| 17 | w www.uniaix.comwww.uniaix.com | gemini-2.5-flash | 186.47 t/s Best: 322.42Worst: 123.16 | 9.77s | 10 |
| 18 | n newapi.kzwbelieve.topnewapi.kzwbelieve.top | gemini-2.5-flash | 185.65 t/s Best: 241.24Worst: 145.00 | 10.41s | 5 |
| 19 | a api.modelverse.cnapi.modelverse.cn | gemini-2.5-flash | 177.77 t/s Best: 222.97Worst: 134.50 | 17.34s | 5 |
| 20 | 2 20230621.pp.ua20230621.pp.ua | translate-model | 169.06 t/s Best: 282.13Worst: 69.51 | 1.17s | 5 |
| 21 | DashScopecoding.dashscope.aliyuncs.com | qwen3-coder-next | 132.13 t/s Best: 186.52Worst: 101.86 | 4.36s | 5 |
| 22 | c coding.dashscope.aliyuncs.comcoding.dashscope.aliyuncs.com | qwen3-coder-next | 132.13 t/s Best: 186.52Worst: 101.86 | 4.36s | 5 |
| 23 | OpenRouteropenrouter.ai | stepfun/step-3.5-flash:free | 126.56 t/s Best: 184.37Worst: 98.47 | 9.46s | 5 |
| 24 | 包 包子铺api.5202030.xyz | claude-sonnet-4-5-20250929 | 125.92 t/s Best: 181.38Worst: 93.14 | 1.27s | 5 |
| 25 | 包 包子铺api.5202030.xyz | grok-4 | 125.14 t/s Best: 185.94Worst: 80.98 | 3.54s | 5 |
| 26 | S SWT-APIapi.lhyb.dpdns.org | gemini-3-pro-poe | 103.53 t/s Best: 115.29Worst: 85.06 | 13.75s | 5 |
| 27 | Seamee APInapi.seaya.link | auto-translator | 103.35 t/s Best: 214.60Worst: 63.64 | 0.70s | 10 |
| 28 | XJY APIapi.xinjianya.top | grok-4.1-fast | 99.38 t/s Best: 128.20Worst: 82.86 | 1.37s | 5 |
| 29 | A AI Toolsplatform.aitools.cfd | zhipu/glm-4.6v-flash | 95.24 t/s Best: 172.17Worst: 0.00 | 6.38s | 5 |
| 30 | XJY APIapi.xinjianya.top | ibm/granite-guardian-3.0-8b | 93.66 t/s Best: 133.93Worst: 55.74 | 0.61s | 10 |
| 31 | A AI Toolsplatform.aitools.cfd | qwen/qwen2.5-7b | 90.28 t/s Best: 110.81Worst: 36.90 | 0.92s | 5 |
| 32 | SiliconFlowapi.siliconflow.cn | Pro/MiniMaxAI/MiniMax-M2.5 | 85.18 t/s Best: 89.25Worst: 72.91 | 6.00s | 5 |
| 33 | DashScopecoding.dashscope.aliyuncs.com | qwen3.5-plus | 73.24 t/s Best: 95.82Worst: 56.44 | 12.66s | 10 |
| 34 | c coding.dashscope.aliyuncs.comcoding.dashscope.aliyuncs.com | qwen3.5-plus | 73.24 t/s Best: 95.82Worst: 56.44 | 12.66s | 10 |
| 35 | XJY APIapi.xinjianya.top | grok-4.1-mini | 73.19 t/s Best: 102.55Worst: 53.30 | 7.00s | 5 |
| 36 | DashScopecoding.dashscope.aliyuncs.com | MiniMax-M2.5 | 72.72 t/s Best: 112.99Worst: 40.63 | 18.43s | 10 |
| 37 | c coding.dashscope.aliyuncs.comcoding.dashscope.aliyuncs.com | MiniMax-M2.5 | 72.72 t/s Best: 112.99Worst: 40.63 | 18.43s | 10 |
| 38 | 算 算了么 APIapi.suanli.cn | Qwen/Qwen3-VL-32B-Thinking | 60.96 t/s Best: 81.95Worst: 42.79 | 22.31s | 5 |
| 39 | a ai.san.babyai.san.baby | gpt-5.2 | 57.02 t/s Best: 93.80Worst: 34.64 | 4.60s | 5 |
| 40 | a api.amethyst.ltdapi.amethyst.ltd | qwen-3.5-plus | 55.05 t/s Best: 65.34Worst: 42.13 | 3.10s | 5 |
| 41 | A AI Toolsplatform.aitools.cfd | zhipu/glm-4v-flash | 54.32 t/s Best: 62.73Worst: 44.56 | 0.53s | 5 |
| 42 | A AI Toolsplatform.aitools.cfd | zhipu/glm-4.7-flash | 53.60 t/s Best: 93.91Worst: 0.00 | 25.49s | 25 |
| 43 | A AI Toolsplatform.aitools.cfd | google/gemma-3-27b | 53.59 t/s Best: 54.87Worst: 52.53 | 1.23s | 5 |
| 44 | A AI Toolsplatform.aitools.cfd | zhipu/glm-4-9b | 52.73 t/s Best: 57.98Worst: 41.24 | 0.64s | 5 |
| 45 | A AI Toolsplatform.aitools.cfd | google/gemma-3-27b | 52.12 t/s Best: 56.48Worst: 47.60 | 4.29s | 10 |
| 46 | n newapi.kzwbelieve.topnewapi.kzwbelieve.top | claude-sonnet-4-6 | 48.66 t/s Best: 68.32Worst: 37.12 | 1.94s | 5 |
| 47 | SiliconFlowapi.siliconflow.cn | Pro/deepseek-ai/DeepSeek-V3.2 | 47.80 t/s Best: 57.78Worst: 36.04 | 32.32s | 5 |
| 48 | a apifs.shubiaobiao.cnapifs.shubiaobiao.cn | claude-sonnet-4-6 | 46.71 t/s Best: 65.13Worst: 32.60 | 1.07s | 5 |
| 49 | 云 云智APIyunzhiapi.cn | Mimo-v2-Flash | 46.11 t/s Best: 170.56Worst: 0.00 | 1.21s | 75 |
| 50 | a api.123nhh.meapi.123nhh.me | GPT-5.3 Codex Spark | 45.83 t/s Best: 54.82Worst: 37.09 | 1.67s | 5 |