Qwen3.1 by Alibaba is available through 10 API providers on LMSpeed. Compare API pricing from $0.0037 to $2.00 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 171 tok/s.
Alibaba Qwen3.1 is a Qwen3 series model with improved reasoning and coding over Qwen3.0, suitable for general chat and lightweight agent tasks.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
N1N qwen3-1.7b | 171.33 tok/s | 3.66s | 10 |
qwen3-1.7b | 140.08 tok/s | 4.85s | 10 |
qwen3-1.7b:free | 133.77 tok/s | 5.08s | 5 |
Показано 1–3 из 3 провайдеров
kimi-k2-5
Moonshot Kimi K2.5 is an open-weight multimodal agent model with native vision and text input, strong coding performance, and a 256K context window.
deepseek-v3-2
DeepSeek V3.2 is an upgraded V3-series MoE model with stronger reasoning, coding, and math performance, widely available through OpenAI-compatible API relays.
glm-5
Zhipu GLM-5 is Zhipu flagship GLM series model with enhanced reasoning, agent capabilities, and strong performance on Chinese enterprise and coding scenarios.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
glm-4-7
Zhipu GLM-4.7 is a flagship GLM release from Zhipu AI with advanced Chinese-English reasoning, coding, and agent features.
qwen3
Alibaba Qwen3 is the Qwen family's flagship LLM series with dense and MoE variants, seamless thinking/non-thinking modes, and leading open-source performance in math, code, and agent tasks.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.