Qwen3.5 Flash is available through 38 API providers on LMSpeed. Compare API pricing from $0.010 to $150.00 per million input tokens across providers. Free API access is offered by 2 providers. Context window: 1,000,000. In speed benchmarks, the fastest provider reaches 136 tok/s.
Alibaba Qwen3.5 Flash is a fast multimodal language model in the Qwen series, delivering cost-efficient text, image, and video understanding with a 1M-token context window.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
AI Claw API Qwen3.5-Flash | 136.18 tok/s | 3.39s | 5 |
qwen3.5-flash | 114.78 tok/s | 8.57s | 30 |
qwen3.5-flash | 98.67 tok/s | 8.55s | 5 |
Показано 1–3 из 3 провайдеров
gpt-5-4
OpenAI GPT-5.4 extends the GPT-5 family with stronger instruction following, deeper tool use, and improved performance on coding, math, and long-document analysis.
qwen3-6-plus
Alibaba Qwen3.6 Plus is an enhanced Qwen3.6-tier model optimized for reasoning, coding, and long-context tasks with balanced cost and performance.
qwen3-5-plus
Alibaba Qwen3.5 Plus is an enhanced Qwen3.5 model with stronger reasoning, tool use, and multilingual generation for demanding production workloads.
gpt-5-3-codex
OpenAI GPT-5.3 Codex is a code-specialized variant in the GPT-5 series, optimized for code generation, debugging, and software development tasks.
kimi-k2-5
Moonshot Kimi K2.5 is an open-weight multimodal agent model with native vision and text input, strong coding performance, and a 256K context window.
claude-sonnet-4-6
Anthropic Claude Sonnet 4.6 extends the Sonnet line with improved tool use, coding reliability, and long-context performance for everyday production workloads.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.