LogoLMSpeed
  • Home
  • Free
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
首页交流 QQ 群:1034193296,欢迎中转站站长加入讨论 AI 最热话题、newapi、openclaw 等,获取最新测速动态与反馈支持。
Back to models
publish

GLM-4.1v Thinking Flash API Pricing & Performance

glm-4-1v-thinking-flash

Developer: Zhipu AI

Also known as

GLM-4.1V-Thinking-FlashZhipuAI/GLM-4.1V-Thinking-Flashglm-4.1v-thinking-flashzhipu/glm-4.1v-thinking-flash

GLM-4.1v Thinking Flash by Zhipu AI is available through 13 API providers on LMSpeed. Compare API pricing from $0.0002 to $75.00 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 96 tok/s.

Avg speed
87.46t/s
First token
7.46s
Total tests
230
Providers
13
Variants
15

Pricing Comparison

Compare GLM-4.1v Thinking Flash API pricing across 13 providers. Input prices range from $0.0002 to $75.00 per million input. IXIOCCAPI offers the lowest rate at $0.0002/M.

ProviderModel VariantInput ($/M)Output ($/M)Speed (t/s)
IXIOCCAPIChatGLMglm-4.1v-thinking-flash$0.0002$0.0002—
素墨APIChatGLMglm-4.1v-thinking-flash$0.010$0.010—
钱多多 APIChatGLMglm-4.1v-thinking-flash$0.200$0.200—
SWT-APIChatGLMglm-4.1v-thinking-flash$0.375$0.375—
柏拉图AIChatGLMglm-4.1v-thinking-flash$75.00$75.00—
毫秒APIChatGLMglm-4.1v-thinking-flash$75.00$75.00—
NewagiaiChatGLMglm-4.1v-thinking-flash$75.00$75.00—
天絮 APIChatGLMglm-4.1v-thinking-flash$75.00$75.00—
天絮 APIChatGLMzhipu/glm-4.1v-thinking-flash$75.00$75.00—
小豆包APIChatGLMglm-4.1v-thinking-flash$75.00$75.00—
Seamee APIChatGLMGLM-4.1V-Thinking-Flash$75.00$75.00—
Seamee APIChatGLMZhipuAI/GLM-4.1V-Thinking-Flash$75.00$75.00—
APDSMChatGLMzhipu/glm-4.1v-thinking-flash$75.00$75.0069.7 t/s

Pricing data from provider public APIs

API Speed Benchmarks by Provider

Compare speed and latency performance across all API providers.

Showing 1-3 of 3 providers

Most testedRecently testedA–Z
ProviderSpeedLatencyTests
智谱 AI智谱 AI

glm-4.1v-thinking-flash

95.69 tok/s
7.10s
5
AI Tools

zhipu/glm-4.1v-thinking-flash

87.68 tok/s
7.42s
220
APDSMAPDSM

zhipu/glm-4.1v-thinking-flash

69.74 tok/s
9.29s
5

Recent API Speed Tests

20 records

Latest benchmark results measuring API response speed and first-token latency.

TimeModelSpeedLatency
04/10/2026, 05:13
ChatGLMzhipu/glm-4.1v-thinking-flash
118.23 tok/s
4.15s
04/10/2026, 05:13
ChatGLMzhipu/glm-4.1v-thinking-flash
111.85 tok/s
8.40s
04/10/2026, 05:13
ChatGLMzhipu/glm-4.1v-thinking-flash
46.19 tok/s
8.41s
04/10/2026, 05:13
ChatGLMzhipu/glm-4.1v-thinking-flash
6.48 tok/s
17.10s
04/10/2026, 05:13
ChatGLMzhipu/glm-4.1v-thinking-flash
94.01 tok/s
12.79s
04/08/2026, 07:14
ChatGLMzhipu/glm-4.1v-thinking-flash
116.89 tok/s
2.19s
04/08/2026, 07:14
ChatGLMzhipu/glm-4.1v-thinking-flash
104.32 tok/s
6.37s
04/08/2026, 07:14
ChatGLMzhipu/glm-4.1v-thinking-flash
116.34 tok/s
7.16s
04/08/2026, 07:14
ChatGLMzhipu/glm-4.1v-thinking-flash
12.94 tok/s
9.20s
04/08/2026, 07:14
ChatGLMzhipu/glm-4.1v-thinking-flash
107.07 tok/s
2.56s
04/08/2026, 07:05
ChatGLMzhipu/glm-4.1v-thinking-flash
98.72 tok/s
8.97s
04/08/2026, 07:05
ChatGLMzhipu/glm-4.1v-thinking-flash
92.84 tok/s
7.52s
04/08/2026, 07:05
ChatGLMzhipu/glm-4.1v-thinking-flash
105.63 tok/s
7.86s
04/08/2026, 07:05
ChatGLMzhipu/glm-4.1v-thinking-flash
16.63 tok/s
5.25s
04/08/2026, 07:05
ChatGLMzhipu/glm-4.1v-thinking-flash
80.83 tok/s
4.63s
04/07/2026, 04:14
ChatGLMzhipu/glm-4.1v-thinking-flash
67.55 tok/s
5.53s
04/07/2026, 04:14
ChatGLMzhipu/glm-4.1v-thinking-flash
98.55 tok/s
9.47s
04/07/2026, 04:14
ChatGLMzhipu/glm-4.1v-thinking-flash
119.00 tok/s
6.59s
04/07/2026, 04:14
ChatGLMzhipu/glm-4.1v-thinking-flash
61.72 tok/s
5.17s
04/07/2026, 04:14
ChatGLMzhipu/glm-4.1v-thinking-flash
122.37 tok/s
2.89s

Frequently Asked Questions

Is GLM-4.1v Thinking Flash API free?
GLM-4.1v Thinking Flash does not currently have a free API tier on LMSpeed. All 13 providers charge per token.
How much does GLM-4.1v Thinking Flash API cost?
GLM-4.1v Thinking Flash API pricing ranges from $0.0002 to $75.00 per million input tokens across 13 providers. IXIOCCAPI offers the cheapest rate at $0.0002/M. Output pricing varies by provider.
Which provider has the cheapest GLM-4.1v Thinking Flash API pricing?
The cheapest GLM-4.1v Thinking Flash API pricing is offered by IXIOCCAPI at $0.0002 per million input tokens. Compare all 13 providers above to find the best pricing per token for your use case.

Alternatives & Similar Models

DeepSeek
DeepSeek V3deepseek-v3
12 shared providers
DeepSeek
DeepSeek R1deepseek-r1
12 shared providers
Qwen
Qwen3qwen3
12 shared providers
ChatGLM
GLM-4.7glm-4-7
12 shared providers
OpenAI
GPT-OSSgpt-oss
12 shared providers
ChatGLM
GLM-4.6glm-4-6
12 shared providers