API Provider Performance Test Results
Model Name | Test Count | Average Output Speed | Average First Token Latency |
---|---|---|---|
deepseek-ai/DeepSeek-V3-0324 | 5 | 12.46 t/s | 1.81s |
google/gemma-3-27b-it | 5 | 37.62 t/s | 1.01s |
gpt-4.1-mini | 5 | 46.53 t/s | 17.48s |
gpt-4o-all | 5 | 75.93 t/s | 21.98s |
o1-mini | 5 | 60.85 t/s | 19.57s |
o4-mini | 5 | 139.60 t/s | 18.99s |
o4-mini-high | 5 | 118.19 t/s | 14.36s |
Total Tests
10.37 - 204.39 t/s
0.81 - 38.82s
Last Test Time
o4-mini
Test Count: 5
139.60 t/s
Average Speed
o4-mini-high
Test Count: 5
118.19 t/s
Average Speed
gpt-4o-all
Test Count: 5
75.93 t/s
Average Speed
o1-mini
Test Count: 5
60.85 t/s
Average Speed
gpt-4.1-mini
Test Count: 5
46.53 t/s
Average Speed
Test Time | Model | Average Output Speed | Average First Token Latency | Total Tokens |
---|---|---|---|---|
7/21/2025, 5:50:47 AM | o1-mini | 60.85 t/s | 19.57s | 1420 |
7/21/2025, 5:47:41 AM | o4-mini-high | 118.19 t/s | 14.36s | 2804 |
7/21/2025, 5:44:48 AM | o4-mini | 139.60 t/s | 18.99s | 3466 |
7/21/2025, 5:39:48 AM | gpt-4.1-mini | 46.53 t/s | 17.48s | 1547 |
7/21/2025, 5:03:59 AM | gpt-4o-all | 75.93 t/s | 21.98s | 2194 |
7/21/2025, 4:05:54 AM | google/gemma-3-27b-it | 37.62 t/s | 1.01s | 4446 |
7/21/2025, 3:56:06 AM | deepseek-ai/DeepSeek-V3-0324 | 12.46 t/s | 1.81s | 2274 |