DeepSeek V4 Flash is a fast, cost-efficient language model in the DeepSeek V4 family, optimized for low-latency chat, coding assistance, and high-throughput API workloads while retaining strong reasoning quality.
Compare DeepSeek V4 Flash API pricing across 178 providers. Input prices range from $0.0007 to $1398.60 per million input. VSLLM offers the lowest rate at $0.0007/M. 15 providers offer free API credits or a free tier.
| Provider | Model Variant | Audit | Input ($/M) | Output ($/M) | Speed (t/s) | First token |
|---|---|---|---|---|---|---|
Seamee API100% | deepseek-v4-flash | — | $0.140 | $0.280 | 143.0 t/s | 1.76 s |
Fengsili API98.7% | deepseek-v4-flash | — | $75.00 | $75.00 | 86.3 t/s | 4.08 s |
钠 API100% | deepseek-v4-flash | — | $0.137-2% | $0.274-2% | 84.8 t/s | 6.39 s |
| deepseek-ai/DeepSeek-V4-Flash | — | $0.0027-98% | $0.0027-99% | — | — | |
Tokeness99.9% | deepseek-v4-flash | 648468100 | $0.084-40% | $0.168-40% | 73.1 t/s | 2.52 s |
PICO API100% | deepseek-v4-flash | 10068100100 | $0.137-2% | $0.274-2% | 62.1 t/s | 2.19 s |
ChooseC API99.6% | deepseek-v4-flash | — | $1.00 | $2.00 | 56.4 t/s | 3.93 s |
PICO AI1.7% | deepseek-v4-flash | — | $1.00 | $2.00 | 49.2 t/s | 35.50 s |
星见雅 API100% | deepseek-v4-flash | — | Free | Free | 46.9 t/s | 10.09 s |
| Deepseek-V4-Flash | — | Free | Free | — | — | |
| deepseek-ai/deepseek-v4-flash | — | Free | Free | — | — | |
| 英伟达/deepseek-ai/deepseek-v4-flash | — | Free | Free | — | — | |
CatClaw API100% | deepseek-v4-flash | 766886100 | $0.0080-94% | $0.0080-97% | 42.3 t/s | 3.08 s |
6i20% | deepseek-v4-flash | — | $0.500 | $1.00 | 31.3 t/s | 6.18 s |
9527 API99.6% | deepseek-v4-flash | — | $0.120-14% | $0.210-25% | 31.1 t/s | 7.88 s |
| deepseek-ai/deepseek-v4-flash | — | $0.140 | $0.280 | — | — | |
涵冰API99.7% | deepseek-v4-flash | — | $0.020-86% | $0.030-89% | 21.1 t/s | 2.79 s |
V-API100% | DeepSeek-V4-Flash | — | $0.020-86% | $0.020-93% | — | — |
| deepseek-v4-flash | — | $1.00 | $2.00 | — | — | |
Thorbase100% | deepseek/deepseek-v4-flash | — | $0.098-30% | $0.196-30% | — | — |
Pricing data from provider public APIs
deepseek-v4-pro
DeepSeek V4 Pro is the professional-tier DeepSeek V4 model, targeting frontier reasoning, coding, and agent workflows with maximum capability.
glm-5-1
Zhipu GLM-5.1 is a next-generation GLM model aimed at frontier reasoning, coding, and bilingual agent applications.
gpt-5-4
OpenAI GPT-5.4 extends the GPT-5 family with stronger instruction following, deeper tool use, and improved performance on coding, math, and long-document analysis.
kimi-k2-5
Moonshot Kimi K2.5 is an open-weight multimodal agent model with native vision and text input, strong coding performance, and a 256K context window.
deepseek-v3-2
DeepSeek V3.2 is an upgraded V3-series MoE model with stronger reasoning, coding, and math performance, widely available through OpenAI-compatible API relays.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.Standard benchmark data may include BenchLM and other public sources.