KZW API offers an OpenAI-compatible API gateway for accessing multiple AI models at competitive rates.
KZW API offers 35 LLM API models.
API pricing per token ranges from $0.300 to $75.00/M (input).
Speed benchmark average: 117 tok/s.

https://newapi.kzwbelieve.tophttp://newapi.kzwbelieve.topRankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.
| Model | Speed | Latency | Tests |
|---|---|---|---|
48.66 tok/s | 1.94s | 5 | |
185.65 tok/s | 10.41s | 5 |
| Model | Audit | Input ($/M) | Output ($/M) |
|---|---|---|---|
| — | $0.300 | $2.50 | |
| — | $0.750 | $6.00 | |
| — | $1.00 | $5.00 | |
| — | $1.50 |
| Time | Model | Speed | Latency |
|---|---|---|---|
| Feb 28, 04:33 AM | claude-sonnet-4-6 | 48.66 tok/s | 1.94s |
| Feb 28, 04:30 AM | gemini-2.5-flash | 185.65 tok/s | 10.41s |
| $12.00 |
| — | $1.50 | $12.00 |
| — | $2.50 | $20.00 |
| — | $2.50 | $20.00 |
| — | $3.00 | $15.00 |
gpt-5.5(xhigh) | — | $5.00 | $40.00 |
| — | $5.00 | $40.00 |
api.newagiai.com
Provides API access to a wide range of AI models including text, image, and video generation from multiple providers.
vip.zen-ai.top
Provides API relay services for AI models including Azure OpenAI, Gemini, Claude, and Grok with flexible resource grouping and multi-cloud routing.
api.holdai.top
毫秒API provides a stable, high-bandwidth API forwarding service for OpenAI-compatible models, including GPT, Claude, and Midjourney, with global server deployment and transparent pricing.
api.aigpt4.top
Provides access to multiple AI models including GPT, Claude, DeepSeek, and Grok through a unified API interface.
api.bltcy.cn
柏拉图AI (api.bltcy.cn) is a multi-dimensional API integration platform providing access to over 600 AI models, serving as an alternate endpoint.
api.linkapi.org
Xiaodoubao API is a Chinese AI API relay platform hosted at api.linkapi.org, advertising low-cost and high-concurrency access to OpenAI, Claude, Gemini, Grok, Flux, and agent workflows.