GLM-4 is available through 49 API providers on LMSpeed. Compare API pricing from $0.050 to $75.00 per million input tokens across providers. Free API access is offered by 1 provider. In speed benchmarks, the fastest provider reaches 74 tok/s.
Zhipu GLM-4 is a flagship bilingual model from Zhipu AI with strong Chinese-English reasoning, tool use, and long-context chat capabilities.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
SiliconFlow THUDM/glm-4-9b-chat | 74.33 tok/s | 0.60s | 20 |
Pro/THUDM/glm-4-9b-chat | 73.52 tok/s | 0.71s | 15 |
THUDM/GLM-4-9B-0414 | 71.99 tok/s | 0.95s | 15 |
zhipu/glm-4-9b | 57.34 tok/s | 0.63s | 75 |
zhipu/glm-4-32b | 34.45 tok/s | 1.88s | 5 |
Показано 1–5 из 5 провайдеров
qwen3
Alibaba Qwen3 is the Qwen family's flagship LLM series with dense and MoE variants, seamless thinking/non-thinking modes, and leading open-source performance in math, code, and agent tasks.
kimi-k2-5
Moonshot Kimi K2.5 is an open-weight multimodal agent model with native vision and text input, strong coding performance, and a 256K context window.
glm-5
Zhipu GLM-5 is Zhipu flagship GLM series model with enhanced reasoning, agent capabilities, and strong performance on Chinese enterprise and coding scenarios.
glm-4-7
Zhipu GLM-4.7 is a flagship GLM release from Zhipu AI with advanced Chinese-English reasoning, coding, and agent features.
gpt-5-4
OpenAI GPT-5.4 extends the GPT-5 family with stronger instruction following, deeper tool use, and improved performance on coding, math, and long-document analysis.
deepseek-v3-2
DeepSeek V3.2 is an upgraded V3-series MoE model with stronger reasoning, coding, and math performance, widely available through OpenAI-compatible API relays.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.