qwen3
Also known as
Qwen3 is available through 200 API providers on LMSpeed. Compare API pricing from $0.0010 to $75.00 per million input tokens across providers. Free API access is offered by 5 providers. In speed benchmarks, the fastest provider reaches 1214 tok/s.
Qwen3 is free to use through 5 providers with no per-token charges. These providers offer free API credits or a free tier:
| Provider | Speed (t/s) |
|---|---|
| — | |
| — | |
| — | |
| 乐天图书馆 | — |
| GOU API | — |
Compare speed and latency performance across all API providers.
Showing 1-36 of 36 providers
| Provider | Speed | Latency | Tests |
|---|---|---|---|
Qwen/Qwen3-32B | 1214.07 tok/s | 0.91s | 5 |
Qwen/Qwen3-235B | 234.72 tok/s | 1.71s | 25 |
qwen/qwen3-32b | 143.07 tok/s | 13.67s | 15 |
Qwen/Qwen3-4B | 126.44 tok/s | 4.27s | 5 |
Qwen/Qwen3-235B-A22B | 116.62 tok/s | 21.27s | 30 |
qwen3-8b | 103.87 tok/s | 7.64s | 15 |
qwen/qwen3-30b-a3b | 88.00 tok/s | 13.53s | 60 |
qwen/qwen3-30b-a3b:free | 83.19 tok/s | 22.71s | 5 |
unsloth/qwen3:30b-a3b-q8_0 | 82.71 tok/s | 2.21s | 5 |
qwen3-8b | 78.65 tok/s | 10.01s | 5 |
qwen3-8b | 76.04 tok/s | 0.98s | 5 |
qwen3-14b | 75.24 tok/s | 9.23s | 5 |
qwen/qwen3-14b | 69.57 tok/s | 18.89s | 5 |
Qwen/Qwen3-30B-A3B | 64.95 tok/s | 18.06s | 45 |
qwen3-30b-a3b | 64.07 tok/s | 8.79s | 25 |
qwen3:30b-a3b-q8_0 | 63.96 tok/s | 0.58s | 10 |
qwen3:30b-a3b | 63.68 tok/s | 1.76s | 80 |
unsloth/qwen3:14b-q8_0 | 61.79 tok/s | 1.51s | 5 |
qwen3 | 59.87 tok/s | 5.39s | 5 |
Qwen/Qwen3-14B | 59.12 tok/s | 8.14s | 15 |
qwen/qwen3-235b-a22b | 44.48 tok/s | 21.25s | 9 |
qwen3:30b | 44.38 tok/s | 0.67s | 40 |
Qwen3-235B-A22B | 40.69 tok/s | 7.58s | 10 |
qwen3-235b-a22b | 39.18 tok/s | 15.96s | 35 |
qwen3-235b | 37.44 tok/s | 17.65s | 10 |
qwen3:32b | 36.77 tok/s | 3.71s | 40 |
Qwen/Qwen3-8B | 36.69 tok/s | 35.75s | 65 |
Qwen/Qwen3-235B-A22B:novita | 34.12 tok/s | 1.12s | 5 |
Qwen/Qwen3-32B | 29.04 tok/s | 29.09s | 15 |
qwen/qwen3-8b | 28.21 tok/s | 5.41s | 164 |
Qwen3-32B | 27.00 tok/s | 39.79s | 15 |
Qwen3-235B-A22B | 24.42 tok/s | 31.52s | 5 |
Qwen/Qwen3-8B | 22.98 tok/s | 0.55s | 10 |
qwen3-32b | 22.93 tok/s | 0.81s | 5 |
bailian/qwen3-235b-a22b:free | 13.60 tok/s | 34.41s | 5 |
Qwen3-30B-A3B | 12.40 tok/s | 38.51s | 5 |
Latest benchmark results measuring API response speed and first-token latency.
| Time | Model | Speed | Latency |
|---|---|---|---|
| 04/15/2026, 14:17 | qwen/qwen3-8b | 30.82 tok/s | 10.51s |
| 04/15/2026, 14:17 | qwen/qwen3-8b | 35.87 tok/s | 20.40s |
| 04/15/2026, 14:17 | qwen/qwen3-8b | 34.99 tok/s | 37.72s |
| 04/15/2026, 14:17 | qwen/qwen3-8b | 34.55 tok/s | 5.91s |
| 04/15/2026, 14:17 | qwen/qwen3-8b | 35.61 tok/s | 16.45s |
| 04/10/2026, 13:59 | qwen/qwen3-8b | 41.37 tok/s | 14.07s |
| 04/10/2026, 13:59 | qwen/qwen3-8b | 27.00 tok/s | 9.84s |
| 04/10/2026, 13:59 | qwen/qwen3-8b | 29.96 tok/s | 52.80s |
| 04/10/2026, 13:59 | qwen/qwen3-8b | 5.16 tok/s | 22.04s |
| 04/10/2026, 13:59 | qwen/qwen3-8b | 24.58 tok/s | 34.60s |
| 04/10/2026, 05:18 | qwen/qwen3-8b | 23.57 tok/s | 14.55s |
| 04/10/2026, 05:18 | qwen/qwen3-8b | 26.81 tok/s | 21.98s |
| 04/10/2026, 05:18 | qwen/qwen3-8b | 24.13 tok/s | 29.71s |
| 04/10/2026, 05:18 | qwen/qwen3-8b | 25.25 tok/s | 9.11s |
| 04/10/2026, 05:18 | qwen/qwen3-8b | 24.75 tok/s | 32.78s |
| 04/08/2026, 06:54 | qwen/qwen3-8b | 24.38 tok/s | 17.90s |
| 04/08/2026, 06:54 | qwen/qwen3-8b | 26.17 tok/s | 13.33s |
| 04/08/2026, 06:54 | qwen/qwen3-8b | 24.58 tok/s | 32.49s |
| 04/08/2026, 06:54 | qwen/qwen3-8b | 24.70 tok/s | 11.08s |
| 04/08/2026, 06:54 | qwen/qwen3-8b | 25.87 tok/s | 32.03s |