qwq
Developer: Alibaba
Also known as
Qwq by Alibaba is available through 60 API providers on LMSpeed. Compare API pricing from $0.0003 to $75.00 per million input tokens across providers. Free API access is offered by 3 providers. In speed benchmarks, the fastest provider reaches 71 tok/s.
Compare Qwq API pricing across 57 providers. Input prices range from $0.0003 to $75.00 per million input. 91VIP offers the lowest rate at $0.0003/M.
| Provider | Model Variant | Input ($/M) | Output ($/M) | Speed (t/s) |
|---|---|---|---|---|
| Qwen/QwQ-32B | Free | Free | 42.3 t/s | |
| qwen/qwq-32b | Free | Free | — | |
| 91VIP API | qwen/qwq-32b | Free | Free | — |
| 91VIP | qwq-32b | $0.0003 | $0.0002 | — |
| Futureppo | qwq-32b | $0.0003 | $0.0002 | — |
| 素墨API | qwen/qwq-32b | $0.010 | $0.010 | — |
| 素墨API | qwq-32b | $0.010 | $0.010 | — |
| 素墨API | qwq-32b-preview | $0.010 | $0.010 | — |
| 英伟达/qwen/qwq-32b | $0.100 | $0.100 | — | |
| 对空六课 API | 英伟达/qwen/qwq-32b | $0.100 | $0.100 | — |
| Qwen/QwQ-32B | $0.150 | $0.580 | — | |
| qwen/qwq-32b | $0.150 | $0.400 | — | |
| qwen/qwq-32b | $0.150 | $0.400 | — | |
| Seamee API | Qwen/QwQ-32B | $0.150 | $0.580 | — |
| Seamee API | qwen/qwq-32b | $0.150 | $0.400 | — |
| Qwen/QwQ-32B | $0.150 | $0.580 | 52.2 t/s | |
| qwen/qwq-32b | $0.150 | $0.400 | — | |
| Seamee API | Qwen/QwQ-32B-Preview | $0.200 | $0.020 | — |
| Qwen/QwQ-32B | $0.240 | $0.360 | — | |
| Qwen/QwQ-32B-Preview | $0.240 | $0.360 | — |
Pricing data from provider public APIs
Qwq is free to use through 3 providers with no per-token charges. These providers offer free API credits or a free tier:
Compare speed and latency performance across all API providers.
Showing 1-6 of 6 providers
| Provider | Speed | Latency | Tests |
|---|---|---|---|
Qwen/QwQ-32B-Preview | 71.07 tok/s | 0.74s | 15 |
ALMZBH API QwQ-32B | 58.39 tok/s | 3.84s | 75 |
Qwen/QwQ-32B | 52.21 tok/s | 11.23s | 5 |
Qwen/QwQ-32B | 42.34 tok/s | 25.11s | 105 |
102417 API Qwen/QwQ-32B | 40.91 tok/s | 19.18s | 5 |
qwq-32b | 38.05 tok/s | 17.85s | 5 |
Latest benchmark results measuring API response speed and first-token latency.
| Time | Model | Speed | Latency |
|---|---|---|---|
| 04/12/2026, 11:13 | Qwen/QwQ-32B | 38.32 tok/s | 21.37s |
| 04/12/2026, 11:13 | Qwen/QwQ-32B | 38.92 tok/s | 29.27s |
| 04/12/2026, 11:13 | Qwen/QwQ-32B | 42.94 tok/s | 40.26s |
| 04/12/2026, 11:13 | Qwen/QwQ-32B | 41.19 tok/s | 10.66s |
| 04/12/2026, 11:13 | Qwen/QwQ-32B | 43.61 tok/s | 25.77s |
| 04/04/2026, 21:04 | Qwen/QwQ-32B | 107.79 tok/s | 6.31s |
| 04/04/2026, 21:04 | Qwen/QwQ-32B | 80.22 tok/s | 7.10s |
| 04/04/2026, 21:04 | Qwen/QwQ-32B | 56.52 tok/s | 14.90s |
| 04/04/2026, 21:04 | Qwen/QwQ-32B | 93.29 tok/s | 6.71s |
| 04/04/2026, 21:04 | Qwen/QwQ-32B | 84.69 tok/s | 10.00s |
| 03/24/2026, 19:16 | Qwen/QwQ-32B | 70.74 tok/s | 8.62s |
| 03/24/2026, 19:16 | Qwen/QwQ-32B | 41.73 tok/s | 15.97s |
| 03/24/2026, 19:16 | Qwen/QwQ-32B | 52.44 tok/s | 11.57s |
| 03/24/2026, 19:16 | Qwen/QwQ-32B | 43.82 tok/s | 22.75s |
| 03/24/2026, 19:16 | Qwen/QwQ-32B | 67.50 tok/s | 13.77s |
| 03/21/2026, 16:26 | Qwen/QwQ-32B | 42.23 tok/s | 11.69s |
| 03/21/2026, 16:26 | Qwen/QwQ-32B | 45.15 tok/s | 5.79s |
| 03/21/2026, 16:26 | Qwen/QwQ-32B | 43.92 tok/s | 24.25s |
| 03/21/2026, 16:26 | Qwen/QwQ-32B | 44.20 tok/s | 7.38s |
| 03/21/2026, 16:26 | Qwen/QwQ-32B | 60.86 tok/s | 24.18s |
