LFM 2.5 1.2B Instruct is available through 12 API providers on LMSpeed. Compare API pricing from $0.0010 to $75.00 per million input tokens across providers. Free API access is offered by 2 providers.
LFM 2.5 1.2B Instruct is a compact language model in the LFM series, optimized for low-latency responses and efficient inference.
Также известна как
gpt-oss
GPT-OSS is an open-weight language model family designed for self-hosted inference, research, and cost-efficient alternatives to proprietary GPT-class models.
glm-4-5-air
Zhipu AI GLM-4.5 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
deepseek-v4-flash
DeepSeek V4 Flash is a fast, cost-efficient language model in the DeepSeek V4 family, optimized for low-latency chat, coding assistance, and high-throughput API workloads while retaining strong reasoning quality.
qwen3-5
Alibaba Qwen3.5 is a Qwen3 generation model with improved reasoning, multilingual support, and efficient inference for chat, coding, and agent applications.
kimi-k2-5
Moonshot Kimi K2.5 is an open-weight multimodal agent model with native vision and text input, strong coding performance, and a 256K context window.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.