API Provider Performance Test Results
Model Name | Test Count | Average Output Speed | Average First Token Latency |
---|---|---|---|
allam-2-7b | 5 | 337.15 t/s | 0.27s |
gemini-2.5-flash-lite-ts | 5 | 214.83 t/s | 1.17s |
lgai/exaone-3-5-32b-instruct | 5 | 100.84 t/s | 1.08s |
Qwen/Qwen3-32B-FP8 | 5 | 75.12 t/s | 1.01s |
Total Tests
19.95 - 517.78 t/s
0.23 - 2.31s
Last Test Time
allam-2-7b
Test Count: 5
337.15 t/s
Average Speed
gemini-2.5-flash-lite-ts
Test Count: 5
214.83 t/s
Average Speed
lgai/exaone-3-5-32b-instruct
Test Count: 5
100.84 t/s
Average Speed
Qwen/Qwen3-32B-FP8
Test Count: 5
75.12 t/s
Average Speed
Test Time | Model | Average Output Speed | Average First Token Latency | Total Tokens |
---|---|---|---|---|
8/9/2025, 8:16:15 AM | gemini-2.5-flash-lite-ts | 214.83 t/s | 1.17s | 4875 |
8/9/2025, 8:14:54 AM | lgai/exaone-3-5-32b-instruct | 100.84 t/s | 1.08s | 2953 |
8/9/2025, 8:11:07 AM | Qwen/Qwen3-32B-FP8 | 75.12 t/s | 1.01s | 5706 |
8/9/2025, 8:10:29 AM | allam-2-7b | 337.15 t/s | 0.27s | 1842 |