ngrok provides a unified gateway for AI models and secure connectivity solutions for development and production environments.
| Model | Speed | Latency | Tests |
|---|---|---|---|
| auto_chat | 26226.28 t/s | 2.31s | 10 |
| QWEN | 24489.11 t/s | 2.40s | 5 |
| CEREBRAS | 1473.52 t/s | 3.54s | 5 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Sep 23, 03:16 PM | CEREBRAS | 1473.52 t/s | 3.54s |
| Sep 23, 03:07 PM | QWEN | 24489.11 t/s | 2.40s |
| Sep 23, 03:06 PM | auto_chat | 26318.21 t/s | 2.23s |
| Sep 23, 03:06 PM | auto_chat | 26134.35 t/s | 2.39s |