毫秒API is an API forwarding service that offers enterprise-grade, high-bandwidth access to AI models. It supports OpenAI-compatible interfaces, including GPT-3.5, GPT-4, Claude, Midjourney, and other models, with real-time updates. Key features include:

Global server deployment across 8 regions (e.g., US, Japan, Singapore) using CN2 lines and load balancing for low latency and high stability.
Transparent, usage-based pricing with a conversion rate of approximately 1.2 RMB to 1 USD, and VIP/SVIP discounts available.
Support for multiple model groups (e.g., reverse-engineered, Azure, pure OpenAI) to cater to different needs, with high concurrency handling up to 1 million TPM.
Full compatibility with OpenAI API protocols, enabling seamless integration into existing applications.
Additional services include Midjourney proxy support, 7*24-hour self-service recharge, and proxy distribution options.
Use cases include AI chat, file analysis, multimodal tasks, text-to-speech, and image generation via Midjourney.

Model

Input ($/M)

Output ($/M)

Audit

Speed

Latency

gemini-3.1-flash-image-previewdefault

$0.100/M

—

gemini-3.1-flash-image-preview-2kdefault

$0.100/M

—

gemini-3.1-flash-image-preview-4kdefault

$0.137/M

—

gemini-3-pro-image-previewdefault

$0.200/M

—

gemini-3-pro-image-preview-2kdefault

$0.200/M

—

gemini-3-pro-image-preview-4kdefault

$0.275/M

—

claude-opus-4-7default

$5.00/M

$25.00/M

—

gemini-3.1-flash-lite-previewdefault

$0.250/M

$0.750/M

—

gpt-4o-mini-transcribedefault

$1.25/M

$5.00/M

—

gpt-4o-transcribedefault

$2.50/M

$10.00/M

—

chatgpt-4o-latest

—

100.7 t/s

3.47 s

gemini-2.0-pro-exp

—

70.1 t/s

7.13 s

Time

Model

Speed

Latency

Feb 20, 03:23 AM

gemini-2.0-pro-exp

70.15 tok/s

7.13s

Feb 20, 03:21 AM

chatgpt-4o-latest

100.72 tok/s

3.47s

Provider

Why compare

Models

Free

Avg price

Speed

30d uptime

毫秒API

haomiao-api

毫秒API provides a stable, high-bandwidth API forwarding service for OpenAI-compatible models, including GPT, Claude, and Midjourney, with global server deployment and transparent pricing.

Current provider baseline

729

1,037

N/A

82 tok/s

99.5%

向量引擎

api-vectorengine-ai

Vector Engine provides an API platform aggregating access to over 500 AI large models with OpenAI API compatibility and global deployment.

Faster measured speed
Higher 30-day availability

182

$5.75/M

89 tok/s

99.6%

api-n1n-ai

N1N provides API access to a wide range of AI models including GPT-4, Claude 3, Gemini, and others for text, image, and video generation.

Faster measured speed
Higher 30-day availability

177

$8.65/M

90 tok/s

99.7%

api-kr777-top

CaMeL AI provides an OpenAI-compatible API gateway with extensive model coverage and pricing options.

Faster measured speed

194

$88.30/M

93 tok/s

99%

api-futureppo-top

Future Hub is a large-scale OpenAI-compatible API gateway at api.futureppo.top with extensive model coverage and pricing data.

Faster measured speed

132

223

N/A

263 tok/s

0.6%

meta-api

Provides API services with a model marketplace and developer tools.

Higher 30-day availability

$31.81/M

N/A

99.8%

ccll-xyz

An OpenAI-compatible API relay providing access to multiple AI models.

Higher 30-day availability

$134.86/M

N/A

99.6%

Notes

Health checks: Scope: the 72-hour chart and recent availability measure API connectivity only. Each bar summarizes one hour of checks. Targets: LMSpeed tries the configured health check URL and provider status URL first, then API endpoints derived from known API hosts and recent speed-test base URLs. A website host is considered only when it looks like an API endpoint. Probe steps: each candidate goes through DNS lookup, TCP connection, TLS handshake for HTTPS, and an HTTP HEAD request with redirects followed. Probing stops after the first reachable candidate. Reachable criteria: every required network step must succeed. An HTTP response below 500 is treated as reachable, including 401 because it confirms that an authenticated API endpoint responded, except for statuses classified as blocked. Blocked results: HTTP 403, 429, 521, 525, and 530, plus detected WAF or Cloudflare challenges, are shown as blocked and excluded from availability calculations because LMSpeed cannot determine whether the API itself is down. Model availability: when a dedicated test key is configured, LMSpeed sends an authenticated GET request to a derived /models endpoint and compares returned model IDs with this provider's listed models. These per-model results appear in Models & Pricing and are not included in the provider connectivity percentage. Timeouts: TCP connection, TLS handshake, HTTP connectivity, and model requests each use a 5-second timeout. A full run can take longer when several candidates are tried. Frequency: a background worker checks all providers every 5 minutes by default. The 72-hour chart combines those samples into hourly bars, and the schedule may be changed by the service operator. Limit: automated samples are not an SLA and do not guarantee account quota, every model, every region, or successful completion requests. Check the provider's own status page before making operational decisions.

Domain Rating data is sourced from Ahrefs. It is a 0–100 backlink-based domain strength signal and does not measure API speed or reliability.

Announcements and FAQ are read from this provider's NewAPI status snapshot when available. LMSpeed stores the original content and optional English translations from the provider status source, then shows the localized fields on this page.

毫秒API

毫秒API

Features

API Endpoints

Health Check

API Benchmarks & Pricing

Recent Test Records

Similar API Provider Alternatives to Compare

Notes

Similar API Provider Alternatives to Compare

Provider	Why compare	Models	Free	Avg price	Speed	30d uptime
毫秒API haomiao-api 毫秒API provides a stable, high-bandwidth API forwarding service for OpenAI-compatible models, including GPT, Claude, and Midjourney, with global server deployment and transparent pricing.	Current provider baseline	729	1,037	N/A	82 tok/s	99.5%
向量引擎 api-vectorengine-ai Vector Engine provides an API platform aggregating access to over 500 AI large models with OpenAI API compatibility and global deployment.	Faster measured speed Higher 30-day availability	182	6	$5.75/M	89 tok/s	99.6%
api-n1n-ai N1N provides API access to a wide range of AI models including GPT-4, Claude 3, Gemini, and others for text, image, and video generation.	Faster measured speed Higher 30-day availability	177	6	$8.65/M	90 tok/s	99.7%
api-kr777-top CaMeL AI provides an OpenAI-compatible API gateway with extensive model coverage and pricing options.	Faster measured speed	194	4	$88.30/M	93 tok/s	99%
api-futureppo-top Future Hub is a large-scale OpenAI-compatible API gateway at api.futureppo.top with extensive model coverage and pricing data.	Faster measured speed	132	223	N/A	263 tok/s	0.6%
meta-api Provides API services with a model marketplace and developer tools.	Higher 30-day availability	34	0	$31.81/M	N/A	99.8%
ccll-xyz An OpenAI-compatible API relay providing access to multiple AI models.	Higher 30-day availability	23	0	$134.86/M	N/A	99.6%

毫秒API

毫秒API

Features

API Endpoints

About 毫秒API

Health Check

API Benchmarks & Pricing

Recent Test Records

Similar API Provider Alternatives to Compare

Notes

Similar API Provider Alternatives to Compare