A unified LLM API gateway providing access to multiple AI models through standardized endpoints.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| gemini-2.5-flash | 17911.37 t/s | 14.53s | 5 |
| qwen3 | 59.87 t/s | 5.39s | 5 |
| llama-4-maverick | 0.00 t/s | 0.00s | 5 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Aug 12, 04:58 PM | qwen3 | 59.87 t/s | 5.39s |
| Aug 12, 04:58 PM | llama-4-maverick | 0.00 t/s | 0.00s |
| Aug 12, 04:53 PM | gemini-2.5-flash | 17911.37 t/s | 14.53s |