GLM-Z1 Flash by Zhipu AI is available through 18 API providers on LMSpeed. Compare API pricing from $0.0060 to $75.00 per million input tokens across providers. Free API access is offered by 2 providers. In speed benchmarks, the fastest provider reaches 125 tok/s.
Zhipu AI GLM-Z1 Flash is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
智谱 AI glm-z1-flash | 125.12 tok/s | 0.34s | 10 |
GLM-Z1-Flash | 76.98 tok/s | 0.68s | 5 |
Показано 1–2 из 2 провайдеров
qwen3
Alibaba Qwen3 is the Qwen family's flagship LLM series with dense and MoE variants, seamless thinking/non-thinking modes, and leading open-source performance in math, code, and agent tasks.
glm-4-7
Zhipu GLM-4.7 is a flagship GLM release from Zhipu AI with advanced Chinese-English reasoning, coding, and agent features.
glm-4-5-air
Zhipu AI GLM-4.5 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
glm-4-5
Zhipu AI GLM-4.5 is a 355B-parameter MoE agent foundation model that unifies reasoning, coding, and tool use with hybrid thinking modes and a 128K context window.
glm-4-5-flash
Zhipu AI GLM-4.5 Flash is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.