A unified LLM API gateway providing access to multiple AI models through a single endpoint with competitive pricing and no subscription required.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| Qwen/Qwen2-1.5B-Instruct | 213.84 t/s | 0.67s | 5 |
| Pro/Qwen/Qwen2-1.5B-Instruct | 204.04 t/s | 0.60s | 5 |
| 免费Qwen2-1.5B | 203.36 t/s | 0.68s | 5 |
| 免费Grok3-mini | 180.93 t/s | 3.99s | 5 |
| deepseek-ai/deepseek-vl2 | 125.77 t/s | 0.75s | 5 |
| deepseek-ai/deepseek-vl2 | 125.77 t/s | 0.75s | 5 |
| Pro/Qwen/Qwen2-7B-Instruct | 96.25 t/s | 0.63s | 5 |
| deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | 94.25 t/s | 5.51s | 5 |
| Qwen/Qwen2-7B-Instruct | 93.64 t/s | 0.68s | 5 |
| Pro/Qwen/Qwen2-VL-7B-Instruct | 93.23 t/s | 0.71s | 5 |
| Pro/Qwen/Qwen2-VL-7B-Instruct | 93.23 t/s | 0.71s | 5 |
| 免费Qwen2-7B | 93.01 t/s | 0.85s | 10 |
| 免费DS-VL2 | 86.80 t/s | 1.86s | 5 |
| 免费Grok3 | 85.51 t/s | 1.39s | 5 |
| Pro/Qwen/Qwen2.5-VL-7B-Instruct | 85.51 t/s | 0.76s | 5 |
| Pro/Qwen/Qwen2.5-VL-7B-Instruct | 85.51 t/s | 0.76s | 5 |
| Pro/Qwen/Qwen2.5-VL-7B-Instruct | 85.51 t/s | 0.76s | 5 |
| 免费Qwen2.5-14B | 77.95 t/s | 0.69s | 5 |
| 免费Qwen2.5-14B | 77.95 t/s | 0.69s | 5 |
| Qwen/Qwen2.5-14B-Instruct | 77.92 t/s | 0.69s | 5 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Feb 15, 03:21 AM | 翻译 | 60.49 t/s | 0.97s |
| Dec 19, 05:02 PM | Unknown | - | -s |
| Jun 19, 03:51 PM | gpt-4.1-mini | 0.00 t/s | 0.00s |
| Jun 19, 03:50 PM | 沉浸式翻译 | 0.00 t/s | 0.00s |
| Jun 19, 03:49 PM | 沉浸式翻译 | 0.00 t/s | 0.00s |
| Jun 11, 06:54 AM | 沉浸式翻译 | 0.00 t/s | 0.00s |
| Jun 11, 06:54 AM | 沉浸式翻译 | 0.00 t/s | 0.00s |
| Jun 8, 11:16 PM | 沉浸式翻译 | 72.24 t/s | 1.01s |
| May 3, 03:43 PM | Qwen/Qwen3-30B-A3B | 63.72 t/s | 19.32s |
| May 3, 03:41 PM | Qwen/Qwen3-30B-A3B | 84.79 t/s | 9.54s |