qwen3-vl-instruct
Разработчик: Alibaba
Alibaba Qwen3 VL Instruct is a multimodal model in the Qwen3 series, combining vision understanding with instruction-tuned text generation.
Также известна как
Qwen3 VL Instruct by Alibaba is available through 57 API providers on LMSpeed. Compare API pricing from $0.0061 to $75.00 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 106 tok/s.
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
SiliconFlow Qwen/Qwen3-VL-8B-Instruct | 106.50 tok/s | 0.92s | 10 |
qwen/qwen3-vl-32b-instruct | 44.94 tok/s | 0.76s | 5 |
qwen3-vl-235b-a22b-instruct | 41.54 tok/s | 3.23s | 5 |
Показано 1–3 из 3 провайдеров
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
deepseek-r1
DeepSeek R1 is a reasoning-focused language model in the DeepSeek series, designed for complex reasoning, problem-solving, and analytical tasks.
gemini-3-flash
Google Gemini 3 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
gemini-2-5-pro
Google Gemini 2.5 Pro is Google advanced multimodal model with a 1M-token context window, strong STEM reasoning, and native support for images, audio, and video understanding.
deepseek-v3
DeepSeek V3 is DeepSeek flagship MoE language model with 671B total parameters, delivering strong performance in reasoning, coding, and multilingual tasks at competitive inference cost.
kimi-k2-5
Moonshot Kimi K2.5 is a large language model in the Kimi series, offering advanced reasoning, code generation, and multimodal capabilities.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.