qwen3-instruct
Developer: Alibaba
Alibaba Qwen3 Instruct is an instruction-tuned variant in the Qwen3 family, optimized for chat, tool use, and general assistant applications.
Also known as
Qwen3 Instruct by Alibaba is available through 48 API providers on LMSpeed. Compare API pricing from $0.0010 to $75.00 per million input tokens across providers. Free API access is offered by 1 provider. In speed benchmarks, the fastest provider reaches 910 tok/s.
Compare speed and latency performance across all API providers.
| Provider | Speed | Latency | Tests |
|---|---|---|---|
GPT Load (PP.UA) qwen-3-235b-a22b-instruct-2507 | 910.44 tok/s | 0.50s | 5 |
qwen-3-235b-a22b-instruct-2507 | 910.42 tok/s | 0.63s | 5 |
qwen-3-235b-a22b-instruct-2507 | 572.89 tok/s | 2.84s | 70 |
accounts/fireworks/models/qwen3-235b-a22b-instruct-2507 | 78.64 tok/s | 0.62s | 5 |
qwen3-235b-a22b-instruct-2507 | 65.96 tok/s | 1.19s | 5 |
Qwen3-235B-A22B-Instruct-2507 | 43.21 tok/s | 1.05s | 5 |
qwen3-235b-a22b-instruct-2507 | 43.21 tok/s | 0.95s | 15 |
qwen3-235b-a22b-instruct | 24.36 tok/s | 0.39s | 10 |
Qwen/Qwen3-235B-A22B-Instruct-2507 | 20.96 tok/s | 1.49s | 15 |
Showing 1-9 of 9 providers
qwen3
Alibaba Qwen3 is the Qwen family's flagship LLM series with dense and MoE variants, seamless thinking/non-thinking modes, and leading open-source performance in math, code, and agent tasks.
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
deepseek-v3
DeepSeek V3 is a large language model in the DeepSeek series, offering advanced reasoning and general-purpose text generation capabilities.
gemini-2-5-flash
Google Gemini 2.5 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
gpt-oss
GPT-OSS is an open-source language model offering advanced reasoning, code generation, and multimodal capabilities.
deepseek-v3-1
DeepSeek V3.1 is an open-weights style frontier model from DeepSeek with strong math, coding, and Chinese-English bilingual reasoning.
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.