Leaderboard
Multi-dimensional rankings based on model speed tests and provider health checks. Compare providers, endpoints, and reliability at a glance.
Average tokens generated per second. Higher is better for fast responses.
| Rank | Provider | Model | Throughput | Avg first token latency | Total Tests |
|---|---|---|---|---|---|
| 1 | llama3.1-8b | 1096.01 t/s Best: 1506.55Worst: 556.32 | 0.45s | 10 | |
| 2 | llama-3.3-70b | 970.00 t/s Best: 1534.72Worst: 658.59 | 0.52s | 10 | |
| 3 |
Veloeralinjinpeng-veloera.hf.space |
| llama-4-109b |
969.99 t/s Best: 1383.86Worst: 660.49 |
0.48s |
| 15 |
| 4 | Veloeralinjinpeng-veloera.hf.space | qwen-3-32b | 733.22 t/s Best: 875.02Worst: 628.83 | 0.47s | 5 |
| 5 | OpenRouteropenrouter.ai | inception/mercury-coder-small-beta | 393.33 t/s Best: 481.97Worst: 280.40 | 1.59s | 5 |
| 6 | A AI Toolsplatform.aitools.cfd | deepseek/deepseek-v3 | 344.09 t/s Best: 2672.02Worst: 26.19 | 4.36s | 10 |
| 7 | 登 登录 - Fo-APIv2.voct.top | gpt-4.1-nano | 316.06 t/s Best: 975.08Worst: 154.51 | 2.06s | 10 |
| 8 | AAAIapi.aaai.vip | gemini-2.5-flash-lite-preview-06-17 | 310.35 t/s Best: 393.85Worst: 216.49 | 1.38s | 5 |
| 9 | A AI Toolsplatform.aitools.cfd | deepseek/deepseek-v3-0324 | 306.89 t/s Best: 2305.82Worst: 29.14 | 4.17s | 15 |
| 10 | 登 登录 - Fo-APIv2.voct.top | gpt-4.1-mini | 291.46 t/s Best: 3559.38Worst: 56.68 | 3.31s | 20 |
| 11 | DashScopedashscope.aliyuncs.com | qwen3-0.6b | 261.65 t/s Best: 382.13Worst: 213.60 | 2.60s | 5 |
| 12 | A ASXS APIai.asxs.top | gemini-2.5-flash-preview-05-20-max | 224.58 t/s Best: 424.65Worst: 69.37 | 7.44s | 10 |
| 13 | 登 登录 - Fo-APIv2.voct.top | gpt-4.1-mini-2025-04-14 | 222.69 t/s Best: 2480.85Worst: 44.11 | 3.91s | 20 |
| 14 | A ASXS APIai.asxs.top | gemini-2.5-flash-fastmax | 196.98 t/s Best: 224.95Worst: 170.62 | 10.93s | 5 |
| 15 | d d06131241-vllmbasev03-201-v5gshhys-8000.550c.cloudd06131241-vllmbasev03-201-v5gshhys-8000.550c.cloud | deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | 194.01 t/s Best: 194.94Worst: 193.20 | 1.00s | 5 |
| 16 | 共绩算力d06131241-vllmbasev03-201-v5gshhys-8000.550c.cloud | deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | 194.01 t/s Best: 194.94Worst: 193.20 | 1.00s | 5 |
| 17 | d d06131241-vllmbasev03-201-v5gshhys-8000.550c.cloudd06131241-vllmbasev03-201-v5gshhys-8000.550c.cloud | deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | 194.01 t/s Best: 194.94Worst: 193.20 | 1.00s | 5 |
| 18 | O One APIgpt.fitue.cc | gemini-2.5-flash | 192.77 t/s Best: 332.83Worst: 138.87 | 9.34s | 10 |
| 19 | V-APIus.vveai.com | gpt-4.1-nano | 164.24 t/s Best: 197.04Worst: 121.43 | 1.44s | 5 |
| 20 | b bt6.topk.bt6.top | gemini-2.5-pro-preview-05-06 | 158.75 t/s Best: 176.72Worst: 126.56 | 9.72s | 5 |
| 21 | 1 154.17.230.220:3000154.17.230.220:3000 | gpt-4.1-mini | 156.81 t/s Best: 260.70Worst: 49.57 | 0.73s | 50 |
| 22 | 1 154.17.230.220:3000154.17.230.220:3000 | gpt-4.1 | 154.62 t/s Best: 225.79Worst: 101.14 | 0.80s | 25 |
| 23 | V-APIcf.v36.cm | gpt-4.1-nano | 152.73 t/s Best: 203.66Worst: 104.01 | 1.05s | 5 |
| 24 | AAAIapi.aaai.vip | gpt-4.1-nano | 151.24 t/s Best: 192.36Worst: 126.62 | 1.51s | 5 |
| 25 | O One APIgpt.fitue.cc | gemini-2.0-flash | 149.12 t/s Best: 179.11Worst: 118.03 | 0.74s | 5 |
| 26 | V-APIapi.vveai.com | gpt-4.1-nano | 137.68 t/s Best: 184.12Worst: 62.74 | 1.05s | 10 |
| 27 | DashScopedashscope.aliyuncs.com | deepseek-r1-distill-qwen-1.5b | 135.25 t/s Best: 155.44Worst: 88.14 | 6.13s | 5 |
| 28 | t translate.doi9.toptranslate.doi9.top | gpt-4.1-nano | 118.02 t/s Best: 312.31Worst: 62.01 | 8.08s | 5 |
| 29 | A ASXS APIai.asxs.top | gemini-2.5-pro-preview-06-05-max | 106.92 t/s Best: 142.44Worst: 69.53 | 13.33s | 20 |
| 30 | GPTs APIapi.gptsapi.net | gpt-4o | 105.12 t/s Best: 147.31Worst: 36.17 | 2.04s | 10 |
| 31 | DashScopedashscope.aliyuncs.com | qwen3-8b | 102.46 t/s Best: 127.79Worst: 90.56 | 8.15s | 5 |
| 32 | N Neo APIapi.chiban.de | DeepSeek-V3-Fast | 102.13 t/s Best: 128.63Worst: 69.76 | 0.60s | 5 |
| 33 | A Academic Sanctumassx.top:13101 | Gemini-2.5-Pro | 98.70 t/s Best: 119.05Worst: 79.21 | 19.14s | 5 |
| 34 | A Academic Sanctumassx.top:13101 | Gemini-2.5-Pro | 98.70 t/s Best: 119.05Worst: 79.21 | 19.14s | 5 |
| 35 | A Academic Sanctumassx.top:13101 | GPT-4o-mini | 95.24 t/s Best: 107.86Worst: 78.51 | 4.75s | 5 |
| 36 | A Academic Sanctumassx.top:13101 | GPT-4o-mini | 95.24 t/s Best: 107.86Worst: 78.51 | 4.75s | 5 |
| 37 | a api.almzbh.icuapi.almzbh.icu | gpt-4o-mini | 94.37 t/s Best: 126.09Worst: 48.22 | 1.47s | 5 |
| 38 | A ASXS APIai.asxs.top | gemini-2.5-pro-fastmax | 89.41 t/s Best: 99.04Worst: 73.85 | 23.79s | 5 |
| 39 | t translate.doi9.toptranslate.doi9.top | gpt-4.1-mini | 86.62 t/s Best: 141.56Worst: 18.97 | 8.62s | 5 |
| 40 | 1 154.17.230.220:3000154.17.230.220:3000 | o4-mini | 86.05 t/s Best: 146.48Worst: 66.71 | 3.61s | 10 |
| 41 | SiliconFlowapi.siliconflow.cn | Qwen/Qwen2-7B-Instruct | 85.82 t/s Best: 96.99Worst: 74.48 | 0.61s | 5 |
| 42 | O One APIgpt.fitue.cc | doubao-seed-1-6-flash | 82.66 t/s Best: 99.25Worst: 63.66 | 3.57s | 5 |
| 43 | V-APIapi.vveai.com | gpt-4o-mini | 77.53 t/s Best: 126.48Worst: 33.52 | 1.29s | 5 |
| 44 | 登 登录 - Fo-APIv2.voct.top | gpt-4o-mini | 76.86 t/s Best: 156.36Worst: 48.90 | 3.64s | 15 |
| 45 | SiliconFlowapi.siliconflow.cn | Pro/THUDM/glm-4-9b-chat | 75.87 t/s Best: 80.21Worst: 65.89 | 0.56s | 5 |
| 46 | New APIapi.lianwusuoai.top | 沉浸式翻译 | 72.24 t/s Best: 79.55Worst: 66.21 | 1.01s | 5 |
| 47 | 智谱AI开放平台open.bigmodel.cn | glm-4-flashx | 71.64 t/s Best: 73.12Worst: 70.67 | 0.29s | 5 |
| 48 | SiliconFlowapi.siliconflow.cn | Pro/Qwen/Qwen2-7B-Instruct | 71.29 t/s Best: 75.66Worst: 68.18 | 0.56s | 5 |
| 49 | a arkark.cn-beijing.volces.com | doubao-seed-1-6-flash-250615 | 68.67 t/s Best: 77.37Worst: 60.04 | 3.88s | 5 |
| 50 | a arkark.cn-beijing.volces.com | doubao-seed-1-6-250615 | 67.76 t/s Best: 76.79Worst: 58.34 | 12.10s | 5 |