Free is available through 10 API providers on LMSpeed. Compare API pricing from $0.0050 to $75.00 per million input tokens across providers. Free API access is offered by 6 providers. In speed benchmarks, the fastest provider reaches 70 tok/s.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
Kilo kilo-auto/free | 70.28 tok/s | 11.23s | 7 |
free:QwQ-32B | 27.71 tok/s | 26.13s | 6399 |
free:QwQ-32B | 26.42 tok/s | 17.17s | 55 |
free:Qwen3-30B-A3B | 19.74 tok/s | 25.30s | 6532 |
free:Qwen3-30B-A3B | 19.62 tok/s | 7.50s | 30 |
Показано 1–5 из 5 провайдеров
gpt-oss
GPT-OSS is an open-source language model offering advanced reasoning, code generation, and multimodal capabilities.
glm-4-5-air
Zhipu AI GLM-4.5 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
qwen3
Alibaba Qwen3 is the Qwen family's flagship LLM series with dense and MoE variants, seamless thinking/non-thinking modes, and leading open-source performance in math, code, and agent tasks.
step-3-5-flash
Step 3.5 Flash is a fast and efficient language model, optimized for quick responses and high throughput.
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.