Provides cost-effective generative AI cloud services based on open-source models for text, image, video, and audio generation.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| THUDM/GLM-Z1-9B-0414 | 171.50 t/s | 13.03s | 25 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | 142.95 t/s | 4.47s | 5 |
| Qwen/Qwen3-VL-8B-Instruct | 142.73 t/s | 0.64s | 5 |
| Qwen/Qwen3-VL-8B-Instruct | 142.73 t/s | 0.64s | 5 |
| Qwen/Qwen3-Next-80B-A3B-Instruct | 86.82 t/s | 0.62s | 10 |
| Qwen/Qwen3-Next-80B-A3B-Instruct | 86.82 t/s | 0.62s | 10 |
| Pro/MiniMaxAI/MiniMax-M2.5 | 85.18 t/s | 6.00s | 5 |
| Qwen/Qwen2-7B-Instruct | 81.93 t/s | 0.57s | 25 |
| Qwen/Qwen3-14B | 78.64 t/s | 9.81s | 5 |
| Pro/THUDM/glm-4-9b-chat | 76.25 t/s | 0.63s | 10 |
| THUDM/glm-4-9b-chat | 75.38 t/s | 0.59s | 15 |
| Pro/zai-org/GLM-4.7 | 74.70 t/s | 17.00s | 15 |
| zai-org/GLM-4.5V | 73.00 t/s | 6.15s | 10 |
| zai-org/GLM-4.5V | 73.00 t/s | 6.15s | 10 |
| zai-org/GLM-4.5V | 73.00 t/s | 6.15s | 10 |
| Pro/Qwen/Qwen2-7B-Instruct | 71.29 t/s | 0.56s | 5 |
| zai-org/GLM-4.6 | 70.24 t/s | 1.60s | 15 |
| zai-org/GLM-4.6 | 70.24 t/s | 1.60s | 15 |
| Qwen/QwQ-32B-Preview | 69.75 t/s | 0.62s | 5 |
| internlm/internlm2_5-7b-chat | 68.17 t/s | 0.56s | 20 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Feb 28, 04:36 AM | deepseek-ai/DeepSeek-V3.2 | 21.69 t/s | 0.77s |
| Feb 25, 06:45 PM | Pro/MiniMaxAI/MiniMax-M2.5 | 85.18 t/s | 6.00s |
| Feb 25, 06:37 PM | Pro/deepseek-ai/DeepSeek-V3.2 | 47.80 t/s | 32.32s |
| Feb 25, 06:32 PM | deepseek-ai/DeepSeek-V3.2 | 21.83 t/s | 0.82s |
| Jan 24, 04:58 PM | Qwen/Qwen3-235B-A22B-Instruct-2507 | 15.27 t/s | 0.71s |
| Jan 24, 04:56 PM | zai-org/GLM-4.6 | 67.71 t/s | 2.88s |
| Jan 23, 11:46 PM | zai-org/GLM-4.6 | 75.43 t/s | 0.61s |
| Jan 23, 02:16 PM | zai-org/GLM-4.6 | 67.59 t/s | 1.32s |
| Jan 23, 02:13 PM | Pro/zai-org/GLM-4.7 | 78.69 t/s | 15.95s |
| Jan 21, 08:52 AM | deepseek-ai/DeepSeek-V3.2 | 20.92 t/s | 1.12s |