GLM-Z1 by Zhipu AI is available through 13 API providers on LMSpeed. Compare API pricing from $0.010 to $75.00 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 176 tok/s.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
SiliconFlow THUDM/GLM-Z1-9B-0414 | 176.03 tok/s | 13.40s | 24 |
GLM-Z1-0414 | 76.11 tok/s | 10.99s | 5 |
zhipu/glm-z1-32b | 43.87 tok/s | 24.85s | 10 |
Показано 1–3 из 3 провайдеров
deepseek-r1
DeepSeek R1 is a reasoning-focused language model in the DeepSeek series, designed for complex reasoning, problem-solving, and analytical tasks.
qwen3
Alibaba Qwen3 is the Qwen family's flagship LLM series with dense and MoE variants, seamless thinking/non-thinking modes, and leading open-source performance in math, code, and agent tasks.
gemini-2-5-pro
Google Gemini 2.5 Pro is Google advanced multimodal model with a 1M-token context window, strong STEM reasoning, and native support for images, audio, and video understanding.
deepseek-v3
DeepSeek V3 is DeepSeek flagship MoE language model with 671B total parameters, delivering strong performance in reasoning, coding, and multilingual tasks at competitive inference cost.
gpt-oss
GPT-OSS is an open-weight language model family designed for self-hosted inference, research, and cost-efficient alternatives to proprietary GPT-class models.
qwen3-coder
Alibaba Qwen3 Coder is a code-specialized variant in the Qwen series, optimized for code generation, debugging, and software development tasks.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.