A unified API gateway providing access to multiple large language models with direct connectivity in China.
YUNWU API offers 711 LLM API models.
Speed benchmark average: 113 tok/s.
YUNWU API is an API aggregator, offering models from multiple vendors.

https://yunwu.aiUS high-defense load balancing site cluster
https://api.apiplus.orgGlobal CDN, average domestic access speed
https://api3.wlai.vipDomestic high-defense server line
https://api.zhongzhuan.chatConvenient for reselling KEYs
https://yunwu.zeabur.appRankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.
| Model | Speed | Latency | Tests |
|---|---|---|---|
82.67 tok/s | 6.57s | 5 | |
147.00 tok/s | 4.37s | 10 | |
| Time | Model | Speed | Latency |
|---|---|---|---|
| May 15, 04:00 PM | gpt-5-mini | 82.67 tok/s | 6.57s |
| Apr 24, 11:02 AM | gpt-5.4-nano | 19.82 tok/s | 7.05s |
| Apr 24, 11:00 AM | deepseek-v4-flash | 67.17 tok/s | 2.68s |
| Apr 24, 05:00 AM | deepseek-v4-flash | 80.40 tok/s | 2.96s |
| Apr 1, 11:04 AM | gpt-5-nano-2025-08-07 | 111.99 tok/s | 10.26s |
| Apr 1, 11:00 AM | gpt-5-nano | 91.01 tok/s | 14.07s |
| Mar 26, 05:04 AM | gpt-5.4-mini | 200.70 tok/s | 1.44s |
| Mar 26, 05:02 AM | gpt-5.4-nano | 274.17 tok/s | 1.68s |
| Mar 26, 05:00 AM | mimo-v2-flash | 49.16 tok/s | 2.81s |
| Mar 21, 01:48 AM | grok-4-1-fast-non-reasoning | 111.23 tok/s | 3.38s |
1. Offers multiple billing models: by request count, by token quantity, etc. 2. Real-time display of user's API usage and costs.
Common reasons for API request failures and solutions: 1. Authentication error: Check if the API key is correct 2. Insufficient balance: Please recharge your account in time 3. Parameter error: Refer to the documentation to check request parameters 4. Model unavailable: Try switching to another available model 5. Request timeout: May be due to network issues or high service load, please retry later If unresolved, contact online customer service.
After logging in, you can view detailed API call records on the "Usage Log" page, including time, model, consumed token quantity, and cost information.
1. We do not store your request content and response data 2. All API requests use TLS encrypted transmission 3. Strict access control and permission management 4. Regular security audits and vulnerability scans.
We promise 99.9% service availability, ensured through global distributed deployment and load balancing for stable service. For enterprise users, we provide Service Level Agreement (SLA) guarantees.
73.79 tok/s |
2.82s |
| 10 |
111.99 tok/s | 10.26s | 5 |
91.01 tok/s | 14.07s | 5 |
200.70 tok/s | 1.44s | 5 |
mimo-v2-flash | 60.50 tok/s | 2.09s | 10 |
grok-4-1-fast-non-reasoning | 111.23 tok/s | 3.38s | 5 |
57.59 tok/s | 2.58s | 5 |
105.66 tok/s | 9.62s | 10 |
1. Consult the detailed development documentation 2. Contact online customer service support.
We provide example code and SDKs for multiple programming languages, including Python, Node.js, Java, etc., see the "Documentation" section at the top for details.
api.kr777.top
CaMeL AI provides an OpenAI-compatible API gateway with extensive model coverage and pricing options.
api2.aigcbest.top
Provides AI-generated content APIs for various applications, including text and image generation.
api.vectorengine.ai
Vector Engine provides an API platform aggregating access to over 500 AI large models with OpenAI API compatibility and global deployment.
yansd666.com
An API platform providing unified access to over 500 AI models, including OpenAI, Claude, and Gemini.
api.n1n.ai
N1N provides API access to a wide range of AI models including GPT-4, Claude 3, Gemini, and others for text, image, and video generation.
api.bltcy.cn
柏拉图AI (api.bltcy.cn) is a multi-dimensional API integration platform providing access to over 600 AI models, serving as an alternate endpoint.