Provides API relay services for AI models including Azure OpenAI, Gemini, Claude, and Grok with flexible resource grouping and multi-cloud routing.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| gpt-4o | 129.13 t/s | 5.78s | 10 |
| gpt-4o-mini | 79.07 t/s | 1.28s | 10 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Feb 19, 08:11 AM | gpt-4o-mini | 70.65 t/s | 1.52s |
| Feb 19, 08:10 AM | gpt-4o-mini | 87.50 t/s | 1.03s |
| Feb 19, 08:08 AM | gpt-4o | 70.54 t/s | 3.59s |
| Feb 19, 08:06 AM | gpt-4o | 187.71 t/s | 7.96s |