Leaderboard
Multi-dimensional rankings based on model speed tests and provider health checks. Compare providers, endpoints, and reliability at a glance.
Average time to first token. Lower is better for responsiveness.
| Rank | Provider | Model | First Token Latency | Avg tokens per second | Total Tests |
|---|---|---|---|---|---|
| 1 | glm-z1-air | 0.30 s Best: 0.28Worst: 0.33 | 52.65t/s | 5 | |
| 2 | GLM-4-FlashX | 0.30 s Best: 0.28Worst: 0.36 | 61.60t/s | 5 | |
| 3 |
智谱AI开放平台open.bigmodel.cn |
| glm-4-flash-250414 |
0.31 s Best: 0.20Worst: 0.44 |
32.24t/s |
| 5 |
| 4 | 百 百度千帆qianfan.baidubce.com | qwen3-0.6b | 0.33 s Best: 0.27Worst: 0.39 | 151.91t/s | 5 |
| 5 | 智谱AI开放平台open.bigmodel.cn | GLM-4-Flash | 0.36 s Best: 0.31Worst: 0.42 | 47.16t/s | 5 |
| 6 | C ChatST APIapi.chatst.org | qwen-3-235b-2507 | 0.37 s Best: 0.28Worst: 0.66 | 625.49t/s | 5 |
| 7 | SophNetwww.sophnet.com | DeepSeek-V3-Fast | 0.37 s Best: 0.27Worst: 0.57 | 109.61t/s | 10 |
| 8 | SophNetwww.sophnet.com | DeepSeek-V3-Fast | 0.50 s Best: 0.31Worst: 0.81 | 84.97t/s | 5 |
| 9 | SophNetwww.sophnet.com | DeepSeek-v3 | 0.56 s Best: 0.46Worst: 0.64 | 29.03t/s | 5 |
| 10 | SiliconFlowapi.siliconflow.cn | internlm/internlm2_5-7b-chat | 0.56 s Best: 0.50Worst: 0.60 | 68.15t/s | 10 |
| 11 | SiliconFlowapi.siliconflow.cn | Qwen/Qwen2-7B-Instruct | 0.56 s Best: 0.54Worst: 0.59 | 68.21t/s | 5 |
| 12 | C ChatST APIapi.chatst.org | meta-llama/llama-4-scout-17b-16e-instruct | 0.63 s Best: 0.51Worst: 0.96 | 444.26t/s | 5 |
| 13 | SiliconFlowapi.siliconflow.cn | Qwen/Qwen2.5-32B-Instruct | 0.63 s Best: 0.52Worst: 0.74 | 59.09t/s | 10 |
| 14 | SiliconFlowapi.siliconflow.cn | THUDM/glm-4-9b-chat | 0.63 s Best: 0.56Worst: 0.71 | 77.31t/s | 5 |
| 15 | 柏拉图AIapi.bltcy.cn | gemini-2.5-flash-lite | 0.65 s Best: 0.52Worst: 1.37 | 390.20t/s | 20 |
| 16 | V Veloerazone.veloera.org | moonshotai/Kimi-K2-Instruct | 0.66 s Best: 0.58Worst: 0.97 | 77.91t/s | 10 |
| 17 | 柏拉图AIapi.bltcy.ai | gemini-2.5-flash-lite | 0.66 s Best: 0.54Worst: 1.01 | 371.07t/s | 5 |
| 18 | SiliconFlowapi.siliconflow.cn | Pro/THUDM/glm-4-9b-chat | 0.69 s Best: 0.55Worst: 1.06 | 76.64t/s | 5 |
| 19 | DashScopedashscope.aliyuncs.com | qwen-plus-latest | 0.72 s Best: 0.69Worst: 0.73 | 27.57t/s | 5 |
| 20 | Infini AIcloud.infini-ai.com | qwen2.5-7b-instruct | 0.74 s Best: 0.58Worst: 1.29 | 54.46t/s | 5 |
| 21 | 4 47.79.39.127:300047.79.39.127:3000 | gemini-2.5-flash-lite-preview-06-17 | 0.74 s Best: 0.63Worst: 1.07 | 407.69t/s | 5 |
| 22 | 智谱AI开放平台open.bigmodel.cn | GLM-4-Flash-250414 | 0.78 s Best: 0.35Worst: 1.88 | 38.23t/s | 5 |
| 23 | ChatAnywhereapi.chatanywhere.org | gemini-2.5-flash-lite-preview-06-17 | 0.78 s Best: 0.71Worst: 0.87 | 386.12t/s | 5 |
| 24 | SiliconFlowapi.siliconflow.cn | Qwen/Qwen2.5-7B-Instruct | 0.78 s Best: 0.66Worst: 0.96 | 22.05t/s | 10 |
| 25 | 柏拉图AIapi.bltcy.ai | gpt-4.1 | 0.82 s Best: 0.67Worst: 1.04 | 78.97t/s | 5 |
| 26 | 柏拉图AIapi.bltcy.cn | gpt-4.1 | 0.82 s Best: 0.62Worst: 1.16 | 77.76t/s | 15 |
| 27 | Deno Deploydash.deno.com | deepseek-ai/DeepSeek-V3-0324-Turbo | 0.82 s Best: 0.49Worst: 1.34 | 179.62t/s | 5 |
| 28 | YUNWU APIyunwu.ai | gemini-2.5-flash-lite-preview-06-17 | 0.84 s Best: 0.67Worst: 1.22 | 381.52t/s | 5 |
| 29 | SiliconFlowapi.siliconflow.cn | Qwen/Qwen2.5-72B-Instruct-128K | 0.86 s Best: 0.64Worst: 1.37 | 18.27t/s | 5 |
| 30 | ChatAnywhereapi.chatanywhere.tech | gpt-4o | 0.87 s Best: 0.81Worst: 0.95 | 136.68t/s | 5 |
| 31 | 腾讯云api.hunyuan.cloud.tencent.com | hunyuan-lite | 0.90 s Best: 0.81Worst: 0.98 | 140.56t/s | 5 |
| 32 | ChatAnywhereapi.chatanywhere.tech | gemini-2.5-flash-lite-preview-06-17 | 0.92 s Best: 0.77Worst: 1.06 | 388.26t/s | 5 |
| 33 | DashScopedashscope.aliyuncs.com | qwen-plus | 0.94 s Best: 0.60Worst: 1.77 | 21.43t/s | 5 |
| 34 | DashScopedashscope.aliyuncs.com | qwen2.5-14b-instruct | 0.96 s Best: 0.69Worst: 1.89 | 49.95t/s | 5 |
| 35 | a api.almzbh.icuapi.almzbh.icu | DeepSeek-V3-0324-80 | 0.97 s Best: 0.74Worst: 1.31 | 129.14t/s | 5 |
| 36 | ChatAnywhereapi.chatanywhere.org | gpt-4.1-ca | 1.00 s Best: 0.51Worst: 1.44 | 87.57t/s | 5 |
| 37 | C ClawCloud Runakmfmietibxz.ap-southeast-1.clawcloudrun.com | google/gemma-3-27b-it | 1.01 s Best: 0.81Worst: 1.46 | 37.62t/s | 5 |
| 38 | C ClawCloud Runakmfmietibxz.ap-southeast-1.clawcloudrun.com | google/gemma-3-27b-it | 1.01 s Best: 0.81Worst: 1.46 | 37.62t/s | 5 |
| 39 | C ClawCloud Runakmfmietibxz.ap-southeast-1.clawcloudrun.com | google/gemma-3-27b-it | 1.01 s Best: 0.81Worst: 1.46 | 37.62t/s | 5 |
| 40 | 算 算了么 APIapi.suanli.cn | QwQ-32B | 1.03 s Best: 0.63Worst: 1.66 | 23.17t/s | 10 |
| 41 | a arkark.cn-beijing.volces.com | deepseek-v3-250324 | 1.03 s Best: 0.81Worst: 1.49 | 24.60t/s | 5 |
| 42 | 共绩算力d07262148-ollama-webui-qwen3v2-2957-yi0fnkga-11434.550c.cloud | qwen3:30b-a3b | 1.04 s Best: 0.42Worst: 4.58 | 107.03t/s | 15 |
| 43 | 共绩算力d07262148-ollama-webui-qwen3v2-2957-yi0fnkga-11434.550c.cloud | qwen3:30b-a3b | 1.04 s Best: 0.42Worst: 4.58 | 107.03t/s | 15 |
| 44 | 共绩算力d07262148-ollama-webui-qwen3v2-2957-yi0fnkga-11434.550c.cloud | qwen3:30b-a3b | 1.04 s Best: 0.42Worst: 4.58 | 107.03t/s | 15 |
| 45 | ChatAnywhereapi.chatanywhere.tech | gpt-4o-ca | 1.06 s Best: 0.81Worst: 1.49 | 124.83t/s | 5 |
| 46 | C ChatST APIapi.chatst.org | GLM-4.5-Air | 1.08 s Best: 0.57Worst: 1.93 | 120.66t/s | 5 |
| 47 | SiliconFlowapi.siliconflow.cn | Qwen/Qwen2.5-72B-Instruct | 1.08 s Best: 0.61Worst: 2.60 | 29.59t/s | 5 |
| 48 | A AI Toolsplatform.aitools.cfd | zhipu/glm-4-flash | 1.09 s Best: 0.42Worst: 9.29 | 31.71t/s | 460 |
| 49 | a api.almzbh.icuapi.almzbh.icu | DeepSeek-V3-0324 | 1.10 s Best: 0.71Worst: 1.77 | 95.89t/s | 5 |
| 50 | 百 百度千帆qianfan.baidubce.com | qwen2.5-7b-instruct | 1.11 s Best: 1.05Worst: 1.16 | 46.17t/s | 5 |