Qwen3 S2S Flash Realtime is available through 10 API providers on LMSpeed. Compare API pricing from $0.050 to $75.00 per million input tokens across providers.
Alibaba Qwen3 S2S Flash Realtime is a realtime audio model in the Qwen series, supporting low-latency speech and conversational interactions.
Также известна как
kimi-k2-5
Moonshot Kimi K2.5 is a large language model in the Kimi series, offering advanced reasoning, code generation, and multimodal capabilities.
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
glm-5-1
Zhipu GLM-5.1 is a next-generation GLM model aimed at frontier reasoning, coding, and bilingual agent applications.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
glm-5
Zhipu GLM-5 is Zhipu flagship GLM series model with enhanced reasoning, agent capabilities, and strong performance on Chinese enterprise and coding scenarios.
deepseek-v3
DeepSeek V3 is DeepSeek flagship MoE language model with 671B total parameters, delivering strong performance in reasoning, coding, and multilingual tasks at competitive inference cost.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.