Gemma 2 IT by Google is available through 18 API providers on LMSpeed. Compare API pricing from $0.010 to $199.80 per million input tokens across providers. Free API access is offered by 4 providers. Context window: 8,192. In speed benchmarks, the fastest provider reaches 45 tok/s.
Google Gemma 2 IT is an instruction-tuned open model in the Gemma 2 family, designed for efficient text generation on consumer hardware.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
星见雅 API google/gemma-2-27b-it | 44.61 tok/s | 0.55s | 10 |
google/gemma-2-27b-it | 43.69 tok/s | 0.23s | 10 |
google/gemma-2-9b-it | 36.05 tok/s | 0.52s | 10 |
Показано 1–3 из 3 провайдеров
gpt-oss
GPT-OSS is an open-weight language model family designed for self-hosted inference, research, and cost-efficient alternatives to proprietary GPT-class models.
kimi-k2-5
Moonshot Kimi K2.5 is an open-weight multimodal agent model with native vision and text input, strong coding performance, and a 256K context window.
deepseek-v3-2
DeepSeek V3.2 is an upgraded V3-series MoE model with stronger reasoning, coding, and math performance, widely available through OpenAI-compatible API relays.
glm-5-1
Zhipu GLM-5.1 is a next-generation GLM model aimed at frontier reasoning, coding, and bilingual agent applications.
minimax-m2-5
MiniMax M2.5 is MiniMax's flagship text model for coding and agents, with SOTA-level programming and agentic performance, improved token efficiency, and fast high-TPS API deployment.
kimi-k2-thinking
Moonshot AI Kimi K2 Thinking is a reasoning model in the Kimi series, designed for complex reasoning, problem-solving, and analytical tasks.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.