Qwen3 VL Thinking by Alibaba is available through 58 API providers on LMSpeed. Compare API pricing from $0.0061 to $75.00 per million input tokens across providers. Free API access is offered by 1 provider. Context window: 256,000. In speed benchmarks, the fastest provider reaches 52 tok/s.
Alibaba Qwen3 VL Thinking is a vision-language reasoning model in the Qwen3 family, combining image inputs with extended deliberation for complex tasks.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
Fireworks AI accounts/fireworks/models/qwen3-vl-235b-a22b-thinking | 51.54 tok/s | 1.25s | 5 |
Qwen/Qwen3-VL-32B-Thinking | 38.59 tok/s | 19.90s | 79 |
Показано 1–2 из 2 провайдеров
deepseek-r1
DeepSeek R1 is a reasoning-focused language model in the DeepSeek series, designed for complex reasoning, problem-solving, and analytical tasks.
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
gemini-3-flash
Google Gemini 3 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
gemini-2-5-pro
Google Gemini 2.5 Pro is Google advanced multimodal model with a 1M-token context window, strong STEM reasoning, and native support for images, audio, and video understanding.
deepseek-v3
DeepSeek V3 is DeepSeek flagship MoE language model with 671B total parameters, delivering strong performance in reasoning, coding, and multilingual tasks at competitive inference cost.
gpt-5-4
OpenAI GPT-5.4 extends the GPT-5 family with stronger instruction following, deeper tool use, and improved performance on coding, math, and long-document analysis.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.