A unified LLM API gateway offering access to multiple AI models with OpenAI-compatible endpoints.

| Model | Speed | Latency | Tests |
|---|---|---|---|
| gpt-oss-120b | 671.95 t/s | 2.33s | 5 |
| moonshotai/kimi-k2-instruct-0905 | 240.47 t/s | 1.79s | 10 |
| inclusionAI/Ling-1T | 18.55 t/s | 1.27s | 5 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Nov 9, 10:17 AM | inclusionAI/Ling-1T | 18.55 t/s | 1.27s |
| Nov 9, 10:17 AM | gpt-oss-120b | 671.95 t/s | 2.33s |
| Nov 9, 09:29 AM | moonshotai/kimi-k2-instruct-0905 | 245.29 t/s | 1.75s |
| Nov 9, 09:26 AM | moonshotai/kimi-k2-instruct-0905 | 235.64 t/s | 1.83s |