GLM-4 AirX by Zhipu AI is available through 8 API providers on LMSpeed. Compare API pricing from $0.180 to $6.71 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 95 tok/s.
Zhipu AI GLM-4 AirX is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
AI98 glm-4-airx | 94.86 tok/s | 3.81s | 5 |
Показано 1–1 из 1 провайдеров
gpt-5
OpenAI GPT-5 is OpenAI frontier general-purpose model with improved reasoning depth, coding reliability, and multimodal understanding for production assistants and agent workflows.
gpt-4o
OpenAI GPT-4o is a multimodal flagship model with fast text, vision, and audio understanding, optimized for real-time chat, coding assistants, and production API workloads.
gpt-4o-mini
OpenAI GPT-4o Mini is a compact language model in the GPT-4 series, optimized for low-latency responses and efficient inference.
claude-sonnet-4
Anthropic Claude Sonnet 4 balances speed and intelligence for coding, analysis, and enterprise automation, with strong instruction following and long-context performance.
gpt-5-mini
OpenAI GPT-5 Mini is a compact language model in the GPT-5 series, optimized for low-latency responses and efficient inference.
gpt-4-1-mini
OpenAI GPT-4.1 Mini is a compact language model in the GPT-4 series, optimized for low-latency responses and efficient inference.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.