| NVIDIA NIMintegrate.api.nvidia.com | meta/llama-4-maverick-17b-128e-instruct | | 100.29t/s | 15 |
| SiliconFlowapi.siliconflow.cn | tencent/Hunyuan-MT-7B | | 67.44t/s | 10 |
| NVIDIA NIMintegrate.api.nvidia.com | minimaxai/minimax-m2.1 | | 95.95t/s | 10 |
| | llama3.1-8B | | 1070.27t/s | 15 |
| SiliconFlowapi.siliconflow.cn | THUDM/GLM-4-9B-0414 | | 63.75t/s | 10 |
| I IPv4 Beta LM Studioipv4-beta.kxcym.top:11434 | qwen3.5-0.8b | | 356.07t/s | 10 |
| NVIDIA NIMintegrate.api.nvidia.com | institute-of-science-tokyo/llama-3.1-swallow-70b-instruct-v0.1 | | 19.05t/s | 10 |
| NVIDIA NIMintegrate.api.nvidia.com | abacusai/dracarys-llama-3.1-70b-instruct | | 19.09t/s | 10 |
| I IPv4 Beta LM Studioipv4-beta.kxcym.top:11434 | huihui-ai/Huihui-Qwen3.5-0.8B-abliterated | | 195.97t/s | 10 |
| A AI Toolsplatform.aitools.cfd | zhipu/glm-4v-flash | | 51.56t/s | 45 |
| SiliconFlowapi.siliconflow.cn | deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | | 83.11t/s | 10 |
| NVIDIA NIMintegrate.api.nvidia.com | meta/llama-3.1-405b-instruct | | 22.72t/s | 20 |
| K | llama3.1-8B | | 784.80t/s | 15 |
| NVIDIA NIMintegrate.api.nvidia.com | openai/gpt-oss-120b | | 201.82t/s | 25 |
| SiliconFlowapi.siliconflow.cn | Qwen/Qwen3-8B | | 23.35t/s | 10 |
| NVIDIA NIMintegrate.api.nvidia.com | minimaxai/minimax-m2.5 | | 64.27t/s | 30 |
| A AI Toolsplatform.aitools.cfd | qwen/qwen2.5-7b | | 93.01t/s | 140 |
| NVIDIA NIMintegrate.api.nvidia.com | moonshotai/kimi-k2-instruct | | 41.18t/s | 10 |
| Supabase AI Proxyttknrllwjndwdtycqqfv.supabase.co | llama3.1-8b | | 2365.78t/s | 10 |
| A AI Toolsplatform.aitools.cfd | qwen/qwen2.5-7b | | 96.76t/s | 10 |
| A AI Toolsplatform.aitools.cfd | zhipu/glm-4-flash | | 29.19t/s | 2525 |
| NVIDIA NIMintegrate.api.nvidia.com | qwen/qwen3-coder-480b-a35b-instruct | | 51.33t/s | 25 |
| 温 温云sxtuyxrxcgim.ap-northeast-1.clawcloudrun.com | moonshotai/kimi-k2-instruct-0905 | | 65.70t/s | 10 |
| Q | GLM-4-Flash-250414 | | 38.40t/s | 10 |
| G GankInterview LLMllm.gankinterview.com | gemini-2.5-flash-lite | | 251.28t/s | 10 |
| NVIDIA NIMintegrate.api.nvidia.com | qwen/qwen3.5-122b-a10b | | 23.24t/s | 55 |
| X XShuLab Sub2APIapi.xshulab.com | gpt-5.4-mini | | 162.55t/s | 10 |
| | grok-4.20-beta | | 46.28t/s | 15 |
| Vercel AI Gatewayai-gateway.vercel.sh | google/gemini-2.5-flash-lite | | 209.12t/s | 10 |
| ModelScopems-ens-1f4a9445-d0e7.api-inference.modelscope.cn | kgiser/gpu_gpt_5 | | 30.27t/s | 10 |
| 素 | openai/gpt-oss-20b | | 272.22t/s | 10 |
| SiliconFlowapi.siliconflow.cn | deepseek-ai/DeepSeek-V3.2 | | 22.93t/s | 15 |
| 阿里云百炼 DashScopecoding.dashscope.aliyuncs.com | kimi-k2.5 | | 42.23t/s | 65 |
| | kimi-k2.5 | | 1073.96t/s | 15 |
| M Mars HKmars-hk.duckdns.org:38317 | GLM-4.7 | | 129.35t/s | 15 |
| | deepseek-chat | | 32.12t/s | 15 |
| I IPv4 Beta LM Studioipv4-beta.kxcym.top:11434 | qwen3.5-instruct | | 56.39t/s | 10 |
| | gpt-5.4 | | 49.20t/s | 25 |
| | qwen3-coder-plus | | 38.40t/s | 10 |
| A AI Toolsplatform.aitools.cfd | qwen/qwen3-8b | | 25.26t/s | 85 |
| | gpt-5.4(high) | | 51.89t/s | 15 |
| | gpt-5.1-codex-mini | | 186.77t/s | 10 |
| | arcee-ai/trinity-large-preview:free | | 6.42t/s | 15 |
| SiliconFlowapi.siliconflow.cn | deepseek-ai/DeepSeek-V3 | | 18.82t/s | 15 |
| 火山引擎 Arkark.cn-beijing.volces.com | deepseek-v3-2-251201 | | 29.79t/s | 10 |
| | 李/gpt-5.3-codex | | 53.69t/s | 10 |
| | gpt-5.2-codex | | 56.92t/s | 11 |
| | gpt-5.2 | | 53.95t/s | 10 |
| S | gemini-3.1-flash-lite-preview | | 182.91t/s | 10 |
| | MiniMax-M2.7-highspeed | | 47.25t/s | 20 |