qwen3-next
Alibaba Qwen3-Next is an ultra-efficient next-gen architecture (80B total, 3B active) with hybrid attention and sparse MoE, designed for long-context training and inference at dramatically lower cost.
Also known as
Qwen3 Next is available through 7 API providers on LMSpeed. Compare API pricing from $0.037 to $75.00 per million input tokens across providers.
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.