
毫秒API provides a stable, high-bandwidth API forwarding service for OpenAI-compatible models, including GPT, Claude, and Midjourney, with global server deployment and transparent pricing.
Categories
毫秒API offers 4 LLM API models.
API pricing per token ranges from $0.0001 to $350.00/M (input).
Speed benchmark average: 85 tok/s.

api.holdai.top| Model | Speed | Latency | Tests |
|---|---|---|---|
98.49 tok/s | 10.83s | 5 | |
93.21 tok/s | 1.42s | 5 | |
| Model | Input ($/M) | Output ($/M) |
|---|---|---|
| $0.0000 | $0.0000 | |
| $0.0000 | $0.0000 | |
| $0.0000 | $0.0000 | |
| $0.0000 | $0.0000 | |
| $0.0000 | $0.0000 | |
| Time | Model | Speed | Latency |
|---|---|---|---|
| 02/20/2025, 03:27 | gpt-4o-mini | 93.21 tok/s | 1.42s |
| 02/20/2025, 03:26 | gemini-2.0-flash-exp | 98.49 tok/s | 10.83s |
| 02/20/2025, 03:23 |
74.13 tok/s |
8.39s |
| 5 |
72.95 tok/s | 2.66s | 5 |
| $0.0000 |
| $0.0000 |
| $0.0001 | $0.0001 |
| $0.0001 | $0.0001 |
| $0.0001 | $0.0001 |
| $0.0001 | $0.0001 |
74.13 tok/s |
8.39s |
| 02/20/2025, 03:21 | chatgpt-4o-latest | 72.95 tok/s | 2.66s |