GLM-4 FlashX by Zhipu AI is available through 6 API providers on LMSpeed. Compare API pricing from $0.0060 to $75.00 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 72 tok/s.
Zhipu AI GLM-4 FlashX is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
Chiban API glm-4-flashx | 71.76 tok/s | 0.33s | 8 |
智谱 AI GLM-4-FlashX | 65.32 tok/s | 0.43s | 10 |
Seamee API GLM-4-FlashX | 64.39 tok/s | 0.47s | 5 |
GLM-4-FlashX-250414 | 27.54 tok/s | 0.43s | 5 |
Показано 1–4 из 4 провайдеров
glm-4-7
Zhipu GLM-4.7 is a flagship GLM release from Zhipu AI with advanced Chinese-English reasoning, coding, and agent features.
glm-4-6
Zhipu AI GLM-4.6 builds on GLM-4.5 with a 200K context window, stronger real-world coding, advanced reasoning with tool use, and improved agentic performance for complex multi-step tasks.
glm-4-flash
Zhipu AI GLM-4 Flash is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
glm-5
Zhipu GLM-5 is Zhipu flagship GLM series model with enhanced reasoning, agent capabilities, and strong performance on Chinese enterprise and coding scenarios.
qwen3
Alibaba Qwen3 is the Qwen family's flagship LLM series with dense and MoE variants, seamless thinking/non-thinking modes, and leading open-source performance in math, code, and agent tasks.
glm-4-5-air
Zhipu AI GLM-4.5 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.