Gemma 3 by Google is available through 0 API providers on LMSpeed. In speed benchmarks, the fastest provider reaches 50 tok/s.
Also known as
2.55s |
| 160 |
Showing 1-2 of 2 providers
deepseek-r1
DeepSeek R1 is a reasoning-focused language model in the DeepSeek series, designed for complex reasoning, problem-solving, and analytical tasks.
deepseek-v3
DeepSeek V3 is DeepSeek flagship MoE language model with 671B total parameters, delivering strong performance in reasoning, coding, and multilingual tasks at competitive inference cost.
gpt-oss
GPT-OSS is an open-weight language model family designed for self-hosted inference, research, and cost-efficient alternatives to proprietary GPT-class models.
qwen3
Alibaba Qwen3 is the Qwen family's flagship LLM series with dense and MoE variants, seamless thinking/non-thinking modes, and leading open-source performance in math, code, and agent tasks.
gemini-2-0-flash
Google Gemini 2.0 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
gpt-5-4
OpenAI GPT-5.4 extends the GPT-5 family with stronger instruction following, deeper tool use, and improved performance on coding, math, and long-document analysis.
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.