Longcat Flash is available through 13 API providers on LMSpeed. Compare API pricing from $0.010 to $749250.00 per million input tokens across providers. Free API access is offered by 1 provider. In speed benchmarks, the fastest provider reaches 109 tok/s.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
天絮 API LongCat-Flash-Chat | 109.48 tok/s | 6.18s | 10 |
LongCat-Flash-Chat | 83.18 tok/s | 8.72s | 45 |
LongCat-Flash-Chat-2602-Exp | 82.41 tok/s | 7.31s | 5 |
meituan/longcat-flash-chat | 56.31 tok/s | 3.87s | 16 |
meituan/longcat-flash-chat:free | 52.09 tok/s | 3.05s | 5 |
Показано 1–5 из 5 провайдеров
gpt-5-4
OpenAI GPT-5.4 extends the GPT-5 family with stronger instruction following, deeper tool use, and improved performance on coding, math, and long-document analysis.
deepseek-v4-flash
DeepSeek V4 Flash is a fast, cost-efficient language model in the DeepSeek V4 family, optimized for low-latency chat, coding assistance, and high-throughput API workloads while retaining strong reasoning quality.
gpt-oss
GPT-OSS is an open-source language model offering advanced reasoning, code generation, and multimodal capabilities.
kimi-k2-5
Moonshot Kimi K2.5 is a large language model in the Kimi series, offering advanced reasoning, code generation, and multimodal capabilities.
deepseek-v3-2
DeepSeek V3.2 is a large language model in the DeepSeek V3 series, offering advanced reasoning, code generation, and multimodal capabilities.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.