A serverless platform for deploying and scaling applications, including AI inference, on high-performance infrastructure.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| deepseek-chat | 20.87 t/s | 1.12s | 5 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Mar 15, 03:55 PM | deepseek-chat | 20.87 t/s | 1.12s |