AI模型聚合平台,提供统一API接口访问多种大语言模型,支持负载均衡和成本优化功能。
| 模型 | 速度 | 延迟 | 测试数 |
|---|---|---|---|
| qwen3-next-80b-a3b-instruct | 164.04 t/s | 0.45s | 5 |
| qwen3-next-80b-a3b-instruct | 164.04 t/s | 0.45s | 5 |
| qwen3-8b | 150.65 t/s | 5.56s | 5 |
| hunyuan-a13b-instruct | 143.03 t/s | 3.85s | 5 |
| deepseek-v3.1 | 57.51 t/s | 0.30s | 5 |
| deepseek-v3.1 | 57.51 t/s | 0.30s | 5 |
| deepseek-v3.2-exp | 39.89 t/s | 0.58s | 5 |
| deepseek-v3.2-exp | 39.89 t/s | 0.58s | 5 |
| kimi-k2-instruct-0905 | 13.20 t/s | 1.09s | 5 |
| 时间 | 模型 | 速度 | 延迟 |
|---|---|---|---|
| Nov 14, 10:21 AM | deepseek-v3.2-exp | 39.89 t/s | 0.58s |
| Nov 3, 12:17 PM | hunyuan-a13b-instruct | 143.03 t/s | 3.85s |
| Sep 23, 04:24 AM | kimi-k2-instruct-0905 | 13.20 t/s | 1.09s |
| Sep 23, 04:20 AM | deepseek-v3.1 | 57.51 t/s | 0.30s |
| Sep 23, 03:46 AM | qwen3-next-80b-a3b-instruct | 164.04 t/s | 0.45s |
| Sep 23, 03:44 AM | qwen3-8b | 150.65 t/s | 5.56s |