glm-4-flash
Developer: Zhipu AI
Also known as
GLM-4 Flash by Zhipu AI is available through 36 API providers on LMSpeed. Compare API pricing from $0.0001 to $75.00 per million input tokens across providers. Free API access is offered by 6 providers. In speed benchmarks, the fastest provider reaches 78 tok/s.
Compare GLM-4 Flash API pricing across 30 providers. Input prices range from $0.0001 to $75.00 per million input. ChatST API offers the lowest rate at $0.0001/M.
| Provider | Model Variant | Input ($/M) | Output ($/M) | Speed (t/s) |
|---|---|---|---|---|
| glm-4-flash | Free | Free | — | |
| glm-4-flash | Free | Free | — | |
| glm-4-flash-250414 | Free | Free | — | |
| SWT-API | glm-4-flash | Free | Free | — |
| glm-4-flash | Free | Free | — | |
| Chlink API | glm-4-flash | Free | Free | — |
| ChatST API | glm-4-flash-250414 | $0.0001 | $0.0001 | — |
| glm-4-flash | $0.0020 | $0.0020 | — | |
| glm-4-flash-250414 | $0.0020 | $0.0002 | — | |
| glm-4-flash | $0.010 | $0.010 | — | |
| glm-4-flash | $0.010 | $0.010 | — | |
| 素墨API | GLM-4-Flash | $0.010 | $0.010 | — |
| 素墨API | glm-4-flash | $0.010 | $0.010 | — |
| 素墨API | glm-4-flash-250414 | $0.010 | $0.010 | — |
| EasyMore | glm-4-flash | $0.014 | $0.014 | — |
| 钱多多 API | glm-4-flash | $0.020 | $0.020 | — |
| glm-4-flash | $0.027 | $0.027 | — | |
| glm-4-flash | $0.030 | $0.030 | — | |
| glm-4-flash | $0.100 | $0.100 | — | |
| Seamee API | glm-4-flash | $0.100 | $0.100 | — |
Pricing data from provider public APIs
GLM-4 Flash is free to use through 6 providers with no per-token charges. These providers offer free API credits or a free tier:
Compare speed and latency performance across all API providers.
Showing 1-11 of 11 providers
| Provider | Speed | Latency | Tests |
|---|---|---|---|
glm-4-flash | 78.17 tok/s | 1.57s | 20 |
glm-4-flash-250414 | 49.95 tok/s | 1.78s | 15 |
GLM-4-Flash | 47.16 tok/s | 0.36s | 5 |
glm-4-flash-250414 | 46.30 tok/s | 0.36s | 40 |
QYES AI GLM-4-Flash-250414 | 39.37 tok/s | 0.86s | 10 |
GLM-4-Flash-250414 | 38.23 tok/s | 0.78s | 5 |
zhipu/glm-4-flash | 34.86 tok/s | 0.92s | 5150 |
Fitue API glm-4-flash | 34.44 tok/s | 0.79s | 35 |
QYES AI zhipu/glm-4-flash | 31.45 tok/s | 1.81s | 5 |
AI Tools zhipu/glm-4-flash | 29.33 tok/s | 0.91s | 6185 |
glm-4-flash | 27.97 tok/s | 0.51s | 5 |
Latest benchmark results measuring API response speed and first-token latency.
| Time | Model | Speed | Latency |
|---|---|---|---|
| 04/17/2026, 24:50 | zhipu/glm-4-flash | 29.60 tok/s | 3.32s |
| 04/17/2026, 24:50 | zhipu/glm-4-flash | 39.14 tok/s | 0.52s |
| 04/17/2026, 24:50 | zhipu/glm-4-flash | 37.36 tok/s | 0.55s |
| 04/17/2026, 24:50 | zhipu/glm-4-flash | 32.97 tok/s | 0.84s |
| 04/17/2026, 24:50 | zhipu/glm-4-flash | 34.17 tok/s | 0.54s |
| 04/16/2026, 21:05 | zhipu/glm-4-flash | 40.10 tok/s | 0.94s |
| 04/16/2026, 21:05 | zhipu/glm-4-flash | 34.10 tok/s | 0.56s |
| 04/16/2026, 21:05 | zhipu/glm-4-flash | 30.12 tok/s | 0.68s |
| 04/16/2026, 21:05 | zhipu/glm-4-flash | 28.91 tok/s | 0.58s |
| 04/16/2026, 21:05 | zhipu/glm-4-flash | 33.16 tok/s | 1.46s |
| 04/16/2026, 21:01 | zhipu/glm-4-flash | 38.32 tok/s | 0.67s |
| 04/16/2026, 21:01 | zhipu/glm-4-flash | 36.75 tok/s | 0.67s |
| 04/16/2026, 21:01 | zhipu/glm-4-flash | 38.26 tok/s | 0.59s |
| 04/16/2026, 21:01 | zhipu/glm-4-flash | 44.54 tok/s | 0.96s |
| 04/16/2026, 21:01 | zhipu/glm-4-flash | 38.37 tok/s | 0.53s |
| 04/16/2026, 20:18 | zhipu/glm-4-flash | 42.90 tok/s | 0.55s |
| 04/16/2026, 20:18 | zhipu/glm-4-flash | 32.54 tok/s | 0.69s |
| 04/16/2026, 20:18 | zhipu/glm-4-flash | 27.61 tok/s | 0.55s |
| 04/16/2026, 20:18 | zhipu/glm-4-flash | 30.10 tok/s | 0.68s |
| 04/16/2026, 20:18 | zhipu/glm-4-flash | 39.54 tok/s | 0.70s |
