A unified API gateway providing access to multiple large language models and AI services through a single interface.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| gemini-3.1-pro-preview | 664.77 t/s | 25.06s | 5 |
| gemini-2.0-flash | 163.32 t/s | 0.59s | 5 |
| gemini-3-flash-preview | 155.47 t/s | 7.06s | 5 |
| grok-4-1-fast-non-reasoning | 108.12 t/s | 1.01s | 5 |
| sophnet-mimo-v2-flash | 104.20 t/s | 1.22s | 5 |
| DeepSeek-V3-Fast | 79.64 t/s | 0.89s | 5 |
| zai-glm-4.7 | 59.55 t/s | 41.22s | 5 |
| doubao-seed-1-8-251228 | 46.32 t/s | 23.71s | 5 |
| alicloud-qwen3-max-2026-01-23 | 40.59 t/s | 1.26s | 5 |
| Qwen/Qwen2.5-72B-Instruct | 32.01 t/s | 1.03s | 10 |
| Qwen/Qwen2.5-72B-Instruct | 32.01 t/s | 1.03s | 10 |
| DeepSeek-V3 | 29.76 t/s | 1.44s | 5 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Feb 21, 02:31 PM | gemini-3.1-pro-preview | 664.77 t/s | 25.06s |
| Feb 1, 09:47 AM | alicloud-qwen3-max-2026-01-23 | 40.59 t/s | 1.26s |
| Feb 1, 09:41 AM | zai-glm-4.7 | 59.55 t/s | 41.22s |
| Feb 1, 09:40 AM | sophnet-mimo-v2-flash | 104.20 t/s | 1.22s |
| Feb 1, 09:36 AM | doubao-seed-1-8-251228 | 46.32 t/s | 23.71s |
| Jan 30, 04:31 AM | grok-4-1-fast-non-reasoning | 108.12 t/s | 1.01s |
| Jan 30, 04:28 AM | gemini-3-flash-preview | 155.47 t/s | 7.06s |
| Sep 13, 12:54 PM | Qwen/Qwen2.5-72B-Instruct | 32.44 t/s | 1.01s |
| Sep 13, 12:52 PM | Qwen/Qwen2.5-72B-Instruct | 31.58 t/s | 1.05s |
| Sep 13, 12:49 PM | gemini-2.0-flash | 163.32 t/s | 0.59s |