LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Model Pricing
  • Model Speed
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
Back to models

Longcat Flash

/longcat-flash
Avg speed79.15t/s
First token7.01s
Total tests81
Providers24
Variants30

Longcat Flash is available through 13 API providers on LMSpeed. Compare API pricing from $0.010 to $749250.00 per million input tokens across providers. Free API access is offered by 1 provider. In speed benchmarks, the fastest provider reaches 109 tok/s.

Also known as

LongCat-Flash-ChatLongCat-Flash-Chat-2602-Explongcat-flashlongcat-flash-chatmeituan/longcat-flash-chat

Pricing Comparison

Compare Longcat Flash API pricing across 12 providers. Input prices range from $0.010 to $749250.00 per million input. 素墨API offers the lowest rate at $0.010/M. 1 provider offers free API credits or a free tier.

ProviderModel VariantAuditInput ($/M)Output ($/M)Speed (t/s)First token
天絮 API100%
LongCat-Flash-Chat—$5.71$5.71109.5 t/s6.18 s
星见雅 API100%
李/LongCat-Flash-Chat—FreeFree——
MapleLeaf API100%
LongCat-Flash-Chat—$75.00$75.00——
LongCat-Flash-Chat-2602-Exp—$75.00$75.00——
素墨API100%
longcat-flash—$0.010$0.010——
GLM BigModel Relay100%
longcat-flash—$0.200$0.800——
meituan/longcat-flash-chat—$0.200$0.800——
钱多多 API100%
meituan/longcat-flash-chat—$0.734$3.67——
WSocket AI99.3%
LongCat-Flash-Chat-2602-Exp—$749250.00$749250.00——
91VIP6.1%
longcat-flash-chat—$0.400$0.320——
LLM API99.9%
LongCat-Flash-Chat—$0.146$0.146——
Synapse94.2%
LongCat-Flash-Chat—$1.00$1.00——
Futureppo6%
longcat-flash-chat—$0.400$0.320——

Pricing data from provider public APIs

API Speed Benchmarks by Provider

Compare speed and latency performance across all API providers.

ProviderSpeedLatencyTests
天絮 API

LongCat-Flash-Chat

109.48 tok/s
6.18s
10
LongCat API

LongCat-Flash-Chat

83.18 tok/s
8.72s
45
LongCat API

LongCat-Flash-Chat-2602-Exp

82.41 tok/s
7.31s
5
AI Tools

meituan/longcat-flash-chat

56.31 tok/s
3.87s
16
MN API

meituan/longcat-flash-chat:free

52.09 tok/s
3.05s
5

Showing 1-5 of 5 providers

Frequently Asked Questions

Is Longcat Flash API free?
Yes, Longcat Flash is available for free through 1 API provider on LMSpeed, including 星见雅 API. These providers offer free API credits or a free tier with no per-token charges.
How much does Longcat Flash API cost?
Longcat Flash API pricing ranges from $0.010 to $749250.00 per million input tokens across 13 providers. 素墨API offers the cheapest rate at $0.010/M. Output pricing varies by provider.
Which provider has the cheapest Longcat Flash API pricing?
The cheapest Longcat Flash API pricing is offered by 素墨API at $0.010 per million input tokens. Compare all 13 providers above to find the best pricing per token for your use case.

Alternatives & Similar Models

OpenAIGPT-5.4

gpt-5-4

OpenAI GPT-5.4 extends the GPT-5 family with stronger instruction following, deeper tool use, and improved performance on coding, math, and long-document analysis.

20 shared providers

DeepSeekDeepSeek V4 Flash

deepseek-v4-flash

DeepSeek V4 Flash is a fast, cost-efficient language model in the DeepSeek V4 family, optimized for low-latency chat, coding assistance, and high-throughput API workloads while retaining strong reasoning quality.

20 shared providers

OpenAIGPT-OSS

gpt-oss

GPT-OSS is an open-weight language model family designed for self-hosted inference, research, and cost-efficient alternatives to proprietary GPT-class models.

20 shared providers

MoonshotAIKimi K2.5

kimi-k2-5

Moonshot Kimi K2.5 is an open-weight multimodal agent model with native vision and text input, strong coding performance, and a 256K context window.

19 shared providers

DeepSeekDeepSeek V3.2

deepseek-v3-2

DeepSeek V3.2 is an upgraded V3-series MoE model with stronger reasoning, coding, and math performance, widely available through OpenAI-compatible API relays.

19 shared providers

MinimaxMiniMax M2.5

minimax-m2-5

MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.

19 shared providers

Data as of Jun 13, 2026, 04:18 PM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.