Qwen2 Instruct by Alibaba is available through 30 API providers on LMSpeed. Compare API pricing from $0.0003 to $75.00 per million input tokens across providers. Free API access is offered by 1 provider. In speed benchmarks, the fastest provider reaches 84 tok/s.
Alibaba Qwen2 Instruct is an instruction-tuned language model in the Qwen2 series, widely used for chat, coding, and open-weight deployments.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
联无所AI Qwen/Qwen2-7B-Instruct | 83.88 tok/s | 0.59s | 30 |
Pro/Qwen/Qwen2-7B-Instruct | 83.77 tok/s | 0.60s | 10 |
qwen2-instruct | 47.93 tok/s | 0.61s | 10 |
Показано 1–3 из 3 провайдеров
deepseek-v3-1
DeepSeek V3.1 is an open-weights style frontier model from DeepSeek with strong math, coding, and Chinese-English bilingual reasoning.
deepseek-v3-2
DeepSeek V3.2 is an upgraded V3-series MoE model with stronger reasoning, coding, and math performance, widely available through OpenAI-compatible API relays.
gpt-oss
GPT-OSS is an open-weight language model family designed for self-hosted inference, research, and cost-efficient alternatives to proprietary GPT-class models.
kimi-k2-5
Moonshot Kimi K2.5 is an open-weight multimodal agent model with native vision and text input, strong coding performance, and a 256K context window.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
deepseek-v3
DeepSeek V3 is DeepSeek flagship MoE language model with 671B total parameters, delivering strong performance in reasoning, coding, and multilingual tasks at competitive inference cost.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.