Provides elastic computing power and intelligent resource scheduling through a distributed network in China.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| qwen3:0.6b | 239.34 t/s | 0.40s | 5 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | 194.01 t/s | 1.00s | 5 |
| fradser/deeptranslate-r2-4b:latest | 123.70 t/s | 0.78s | 10 |
| deepseek-r1:7b | 114.86 t/s | 8.73s | 125 |
| minicpm4-8b:latest | 101.12 t/s | 1.09s | 15 |
| /root/models/Qwen/Qwen3-4B | 72.15 t/s | 0.56s | 5 |
| qwen3:30b-a3b | 57.50 t/s | 1.81s | 60 |
| qwen3:32b | 36.77 t/s | 3.71s | 40 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Oct 7, 12:05 PM | qwen3:32b | 37.26 t/s | 0.88s |
| Oct 7, 10:15 AM | qwen3:32b | 37.38 t/s | 2.04s |
| Oct 7, 09:59 AM | qwen3:32b | 37.23 t/s | 2.26s |
| Aug 23, 03:22 PM | qwen3:32b | 36.90 t/s | 2.63s |
| Aug 23, 03:14 PM | qwen3:32b | 37.52 t/s | 9.29s |
| Aug 10, 09:43 AM | qwen3:32b | 35.44 t/s | 8.61s |
| Aug 1, 09:34 AM | minicpm4-8b:latest | 159.84 t/s | 0.96s |
| Jul 26, 03:09 PM | qwen3:30b-a3b | 106.93 t/s | 1.15s |
| Jul 26, 03:03 PM | qwen3:30b-a3b | 107.70 t/s | 0.67s |
| Jul 26, 02:49 PM | qwen3:30b-a3b | 106.46 t/s | 1.29s |