统一的LLM API网关,通过标准化端点提供多种AI模型的访问。
| Model | Speed | Latency | Tests |
|---|---|---|---|
| gemini-2.0-flash | 7751.02 t/s | 4.46s | 5 |
| gemini-2.5-flash | 7707.00 t/s | 12.19s | 10 |
| gemini-flash-lite-latest | 369.22 t/s | 0.67s | 5 |
| inclusionAI/Ling-flash-2.0 | 142.37 t/s | 1.05s | 5 |
| inclusionAI/Ring-flash-2.0 | 124.18 t/s | 5.75s | 5 |
| glm-4.6-nothinking | 100.63 t/s | 1.99s | 5 |
| glm-4.6-nothinking | 100.63 t/s | 1.99s | 5 |
| GLM-4.6 | 68.55 t/s | 2.58s | 5 |
| GLM-4.6 | 68.55 t/s | 2.58s | 5 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Dec 28, 09:02 AM | Unknown | - | -s |
| Dec 28, 07:36 AM | Unknown | - | -s |
| Dec 21, 01:06 PM | Unknown | - | -s |
| Dec 20, 02:24 PM | Unknown | - | -s |
| Dec 20, 01:45 PM | gemini-flash-lite-latest | 369.22 t/s | 0.67s |
| Dec 20, 01:44 PM | Unknown | - | -s |
| Nov 29, 12:18 PM | gemini-2.5-flash | 210.68 t/s | 7.27s |
| Oct 18, 10:09 AM | gemini-2.5-flash | 15203.32 t/s | 17.11s |
| Oct 12, 01:16 AM | glm-4.6-nothinking | 100.63 t/s | 1.99s |
| Oct 12, 01:06 AM | gemini-2.0-flash | 7751.02 t/s | 4.46s |