A unified LLM API gateway providing access to multiple AI models with competitive pricing and stability.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| Qwen/QwQ-32B-Preview | 23.63 t/s | 0.27s | 15 |
| Qwen/QwQ-32B | 14.02 t/s | 14.80s | 10 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Mar 26, 01:06 PM | Qwen/QwQ-32B | 28.04 t/s | 29.60s |
| Mar 26, 01:04 PM | Qwen/QwQ-32B-Preview | 70.89 t/s | 0.80s |
| Mar 26, 01:03 PM | Qwen/QwQ-32B | 0.00 t/s | 0.00s |
| Mar 26, 01:02 PM | Qwen/QwQ-32B-Preview | 0.00 t/s | 0.00s |
| Mar 26, 01:02 PM | Qwen/QwQ-32B-Preview | 0.00 t/s | 0.00s |