qwen3-next-thinking
Developer: Alibaba
Alibaba Qwen3 Next Thinking is a reasoning model in the Qwen3 Next line, optimized for deliberate multi-step problem solving.
Also known as
Qwen3 Next Thinking by Alibaba is available through 38 API providers on LMSpeed. Compare API pricing from $0.0010 to $1998.00 per million input tokens across providers. Free API access is offered by 4 providers. In speed benchmarks, the fastest provider reaches 107 tok/s.
Compare speed and latency performance across all API providers.
| Provider | Speed | Latency | Tests |
|---|---|---|---|
NVIDIA NIM qwen/qwen3-next-80b-a3b-thinking | 106.59 tok/s | 9.02s | 20 |
Showing 1-1 of 1 providers
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
kimi-k2-5
Moonshot Kimi K2.5 is a large language model in the Kimi series, offering advanced reasoning, code generation, and multimodal capabilities.
gpt-oss
GPT-OSS is an open-source language model offering advanced reasoning, code generation, and multimodal capabilities.
gpt-5-4
OpenAI GPT-5.4 is a large language model in the GPT-5 series, providing enhanced reasoning, coding, and multimodal capabilities.
minimax-m2-7
MiniMax M2.7 is a large language model in the MiniMax series, offering advanced reasoning, code generation, and multimodal capabilities.
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.