An AI model aggregation platform providing unified API access to multiple large language models with cost optimization features.

| Model | Speed | Latency | Tests |
|---|---|---|---|
| qwen3-next-80b-a3b-instruct | 164.04 t/s | 0.45s | 5 |
| qwen3-next-80b-a3b-instruct | 164.04 t/s | 0.45s | 5 |
| qwen3-8b | 150.65 t/s | 5.56s | 5 |
| hunyuan-a13b-instruct | 143.03 t/s | 3.85s | 5 |
| deepseek-v3.1 | 57.51 t/s | 0.30s | 5 |
| deepseek-v3.1 | 57.51 t/s | 0.30s | 5 |
| deepseek-v3.2-exp | 39.89 t/s | 0.58s | 5 |
| deepseek-v3.2-exp | 39.89 t/s | 0.58s | 5 |
| kimi-k2-instruct-0905 | 13.20 t/s | 1.09s | 5 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Nov 14, 10:21 AM | deepseek-v3.2-exp | 39.89 t/s | 0.58s |
| Nov 3, 12:17 PM | hunyuan-a13b-instruct | 143.03 t/s | 3.85s |
| Sep 23, 04:24 AM | kimi-k2-instruct-0905 | 13.20 t/s | 1.09s |
| Sep 23, 04:20 AM | deepseek-v3.1 | 57.51 t/s | 0.30s |
| Sep 23, 03:46 AM | qwen3-next-80b-a3b-instruct | 164.04 t/s | 0.45s |
| Sep 23, 03:44 AM | qwen3-8b | 150.65 t/s | 5.56s |