Leaderboard
Model performance rankings based on speed test results. Compare models across different providers and endpoints.
Average tokens generated per second. Higher is better for fast responses.
| Rank | Provider | Model | Throughput | Avg first token latency | Total Tests |
|---|---|---|---|---|---|
| 1 | gemini-2.5-flash | 4575.77 t/s Best: 22057.30Worst: 143.99 | 4.39s | 5 | |
| 2 | llama3.1-8b | 2142.09 t/s Best: 2534.63Worst: 861.08 | 0.19s | 5 | |
| 3 |
Cerebrasapi.cerebras.ai |
| gpt-oss-120b |
1920.13 t/s Best: 2415.66Worst: 1587.06 |
0.54s |
| 5 |
| 4 | Cerebrasapi.cerebras.ai | llama-3.3-70b | 1532.55 t/s Best: 1982.40Worst: 964.64 | 0.25s | 5 |
| 5 | Cerebrasapi.cerebras.ai | qwen-3-235b-a22b-instruct-2507 | 851.89 t/s Best: 1123.67Worst: 610.53 | 12.09s | 10 |
| 6 | WONG公益站wzw.pp.ua | grok-4.1 | 729.41 t/s Best: 1814.47Worst: 74.51 | 59.04s | 5 |
| 7 | c codex-api-slb.packycode.comcodex-api-slb.packycode.com | gpt-5.2 | 637.15 t/s Best: 904.25Worst: 230.58 | 12.16s | 15 |
| 8 | c codex-api.packycode.comcodex-api.packycode.com | gpt-5.2 | 606.84 t/s Best: 900.16Worst: 294.43 | 11.49s | 10 |
| 9 | ChatGTPwww.chatgtp.cn | gpt-4.1-nano-2025-04-14 | 592.40 t/s Best: 1840.57Worst: 213.50 | 1.03s | 10 |
| 10 | 1 156.225.23.250156.225.23.250 | deepseek-ai/DeepSeek-V3.2-Exp | 557.51 t/s Best: 2787.57Worst: 0.00 | 3.46s | 5 |
| 11 | Cerebrasapi.cerebras.ai | zai-glm-4.7 | 454.25 t/s Best: 609.14Worst: 295.71 | 3.57s | 5 |
| 12 | Hugging Facerouter.huggingface.co | meta-llama/Llama-3.3-70B-Instruct | 416.19 t/s Best: 538.29Worst: 336.89 | 0.26s | 5 |
| 13 | Hugging Facerouter.huggingface.co | meta-llama/Llama-3.3-70B-Instruct | 416.19 t/s Best: 538.29Worst: 336.89 | 0.26s | 5 |
| 14 | New APInew.123nhh.xyz | gemini-flash-lite-latest | 369.22 t/s Best: 551.78Worst: 307.12 | 0.67s | 5 |
| 15 | R Realpicsrealpics.cn:2234 | Qwen3-0.6B-Q8_0.gguf | 340.00 t/s Best: 431.04Worst: 251.73 | 3.08s | 15 |
| 16 | r realpics.cn:2234realpics.cn:2234 | Qwen3-0.6B-Q8_0.gguf | 340.00 t/s Best: 431.04Worst: 251.73 | 3.08s | 15 |
| 17 | r realpics.cn:2234realpics.cn:2234 | Qwen3-0.6B-Q8_0.gguf | 340.00 t/s Best: 431.04Worst: 251.73 | 3.08s | 15 |
| 18 | s sd.rnglg2.top:30000sd.rnglg2.top:30000 | gpt-oss-120b-medium | 337.69 t/s Best: 393.18Worst: 266.67 | 2.32s | 20 |
| 19 | Mistral AIapi.mistral.ai | ministral-3b-2410 | 332.18 t/s Best: 539.88Worst: 217.45 | 0.50s | 5 |
| 20 | Mistral AIapi.mistral.ai | ministral-3b-2410 | 332.18 t/s Best: 539.88Worst: 217.45 | 0.50s | 5 |
| 21 | s sd.rnglg2.top:30000sd.rnglg2.top:30000 | gpt-5-codex-mini | 327.07 t/s Best: 407.63Worst: 157.66 | 3.35s | 5 |
| 22 | R Realpicsrealpics.cn:5002 | Qwen3-0.6B-Q8_0.gguf | 300.87 t/s Best: 305.13Worst: 296.03 | 2.59s | 10 |
| 23 | r realpics.cn:5002realpics.cn:5002 | Qwen3-0.6B-Q8_0.gguf | 300.87 t/s Best: 305.13Worst: 296.03 | 2.59s | 10 |
| 24 | r realpics.cn:2234realpics.cn:5002 | Qwen3-0.6B-Q8_0.gguf | 300.87 t/s Best: 305.13Worst: 296.03 | 2.59s | 10 |
| 25 | s sd.rnglg2.top:7777sd.rnglg2.top:7777 | gemini-2.5-flash-lite | 298.48 t/s Best: 370.05Worst: 234.32 | 1.75s | 5 |
| 26 | e elysiver.h-e.topelysiver.h-e.top | gemini-3-flash-preview | 263.35 t/s Best: 427.47Worst: 188.54 | 6.89s | 5 |
| 27 | s sd.rnglg2.top:30000sd.rnglg2.top:30000 | gpt-5.1-codex-mini | 248.33 t/s Best: 265.97Worst: 214.97 | 1.91s | 5 |
| 28 | s sd.rnglg2.top:30000sd.rnglg2.top:30000 | gemini-2.5-flash-lite | 240.59 t/s Best: 359.28Worst: 114.50 | 2.56s | 25 |
| 29 | lansonsamtest.nuiziyyds.com | gemini-2.5-flash | 206.62 t/s Best: 260.40Worst: 142.45 | 7.26s | 5 |
| 30 | Mistral AIapi.mistral.ai | open-mistral-nemo | 200.44 t/s Best: 224.63Worst: 168.65 | 0.40s | 5 |
| 31 | Mistral AIapi.mistral.ai | open-mistral-nemo | 200.44 t/s Best: 224.63Worst: 168.65 | 0.40s | 5 |
| 32 | GG公益站-云GCLIgcli.ggchan.dev | gemini-3-pro-preview-search | 199.11 t/s Best: 422.28Worst: 110.87 | 15.69s | 5 |
| 33 | 酒馆无限制免费APIapi2.aoyou.shop | 酒馆-Flash-Long | 194.62 t/s Best: 212.42Worst: 179.41 | 1.75s | 5 |
| 34 | s sd.rnglg2.top:7777sd.rnglg2.top:7777 | gemini-2.5-flash | 194.27 t/s Best: 260.28Worst: 147.23 | 9.13s | 5 |
| 35 | a api.gemai.ccapi.gemai.cc | [官逆C]gemini-3-flash-preview | 193.84 t/s Best: 258.15Worst: 138.52 | 5.57s | 5 |
| 36 | 全球AIglobalai.vip | gemini-2.5-flash | 192.65 t/s Best: 235.51Worst: 144.32 | 9.27s | 5 |
| 37 | Mistral AIapi.mistral.ai | magistral-small-latest | 189.91 t/s Best: 230.24Worst: 160.56 | 0.39s | 5 |
| 38 | Mistral AIapi.mistral.ai | magistral-small-latest | 189.91 t/s Best: 230.24Worst: 160.56 | 0.39s | 5 |
| 39 | s sd.rnglg2.top:30000sd.rnglg2.top:30000 | gemini-2.5-flash | 189.12 t/s Best: 231.86Worst: 143.35 | 11.43s | 5 |
| 40 | 黑与白公益站ai.hybgzs.com | gemini-2.5-flash-lite | 186.31 t/s Best: 355.26Worst: 129.74 | 1.55s | 5 |
| 41 | Fireworks AIapi.fireworks.ai | accounts/fireworks/models/minimax-m2p1 | 185.81 t/s Best: 216.27Worst: 154.65 | 1.91s | 10 |
| 42 | GG公益站-云GCLIgcli.ggchan.dev | gemini-3-flash-preview-search | 184.83 t/s Best: 276.49Worst: 117.50 | 9.96s | 5 |
| 43 | XJY APIapi.xinjianya.top | gemini-2.5-flash | 183.01 t/s Best: 213.04Worst: 155.85 | 8.28s | 5 |
| 44 | 1 123asfdgsaedf.netease.mom123asfdgsaedf.netease.mom | gpt-4o | 182.75 t/s Best: 245.77Worst: 143.71 | 3.58s | 5 |
| 45 | N New APIapi.seosycy.com | gemini-2.0-flash | 181.83 t/s Best: 202.86Worst: 160.24 | 1.79s | 5 |
| 46 | ChatGTPwww.chatgtp.cn | gemini-2.0-flash | 181.28 t/s Best: 199.82Worst: 145.48 | 1.02s | 5 |
| 47 | a api.vectorengine.aiapi.vectorengine.ai | gemini-2.0-flash | 175.01 t/s Best: 192.51Worst: 154.00 | 0.56s | 5 |
| 48 | z zenmux.aizenmux.ai | z-ai/glm-4.6v-flash | 172.60 t/s Best: 238.26Worst: 118.48 | 9.42s | 5 |
| 49 | 简小智API中转站newapi.jianxiaozhi.chat:88 | gemini-2.0-flash | 171.32 t/s Best: 194.98Worst: 137.78 | 1.85s | 5 |
| 50 | 简小智API中转站newapi.jianxiaozhi.chat:88 | gemini-2.0-flash | 171.32 t/s Best: 194.98Worst: 137.78 | 1.85s | 5 |