GLM-4.1v Thinking Flash by Zhipu AI is available through 13 API providers on LMSpeed. Compare API pricing from $0.0002 to $75.00 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 96 tok/s.
Avg speed
87.46t/s
First token
7.46s
Total tests
230
Providers
13
Variants
15
Pricing Comparison
Compare GLM-4.1v Thinking Flash API pricing across 13 providers. Input prices range from $0.0002 to $75.00 per million input. IXIOCCAPI offers the lowest rate at $0.0002/M.
Latest benchmark results measuring API response speed and first-token latency.
Time
Model
Speed
Latency
04/10/2026, 05:13
zhipu/glm-4.1v-thinking-flash
118.23 tok/s
4.15s
04/10/2026, 05:13
zhipu/glm-4.1v-thinking-flash
111.85 tok/s
8.40s
04/10/2026, 05:13
zhipu/glm-4.1v-thinking-flash
46.19 tok/s
8.41s
04/10/2026, 05:13
zhipu/glm-4.1v-thinking-flash
6.48 tok/s
17.10s
04/10/2026, 05:13
zhipu/glm-4.1v-thinking-flash
94.01 tok/s
12.79s
04/08/2026, 07:14
zhipu/glm-4.1v-thinking-flash
116.89 tok/s
2.19s
04/08/2026, 07:14
zhipu/glm-4.1v-thinking-flash
104.32 tok/s
6.37s
04/08/2026, 07:14
zhipu/glm-4.1v-thinking-flash
116.34 tok/s
7.16s
04/08/2026, 07:14
zhipu/glm-4.1v-thinking-flash
12.94 tok/s
9.20s
04/08/2026, 07:14
zhipu/glm-4.1v-thinking-flash
107.07 tok/s
2.56s
04/08/2026, 07:05
zhipu/glm-4.1v-thinking-flash
98.72 tok/s
8.97s
04/08/2026, 07:05
zhipu/glm-4.1v-thinking-flash
92.84 tok/s
7.52s
04/08/2026, 07:05
zhipu/glm-4.1v-thinking-flash
105.63 tok/s
7.86s
04/08/2026, 07:05
zhipu/glm-4.1v-thinking-flash
16.63 tok/s
5.25s
04/08/2026, 07:05
zhipu/glm-4.1v-thinking-flash
80.83 tok/s
4.63s
04/07/2026, 04:14
zhipu/glm-4.1v-thinking-flash
67.55 tok/s
5.53s
04/07/2026, 04:14
zhipu/glm-4.1v-thinking-flash
98.55 tok/s
9.47s
04/07/2026, 04:14
zhipu/glm-4.1v-thinking-flash
119.00 tok/s
6.59s
04/07/2026, 04:14
zhipu/glm-4.1v-thinking-flash
61.72 tok/s
5.17s
04/07/2026, 04:14
zhipu/glm-4.1v-thinking-flash
122.37 tok/s
2.89s
Frequently Asked Questions
Is GLM-4.1v Thinking Flash API free?
GLM-4.1v Thinking Flash does not currently have a free API tier on LMSpeed. All 13 providers charge per token.
How much does GLM-4.1v Thinking Flash API cost?
GLM-4.1v Thinking Flash API pricing ranges from $0.0002 to $75.00 per million input tokens across 13 providers. IXIOCCAPI offers the cheapest rate at $0.0002/M. Output pricing varies by provider.
Which provider has the cheapest GLM-4.1v Thinking Flash API pricing?
The cheapest GLM-4.1v Thinking Flash API pricing is offered by IXIOCCAPI at $0.0002 per million input tokens. Compare all 13 providers above to find the best pricing per token for your use case.