Gemini 2.0 Flash Thinking by Google is available through 19 API providers on LMSpeed. Compare API pricing from $0.016 to $547.50 per million input tokens across providers. In speed benchmarks, the fastest provider reaches 231 tok/s.
Также известна как
Сравните производительность по скорости и задержке у всех API-провайдеров.
| Провайдер | Скорость | Задержка | Тесты |
|---|---|---|---|
丸美小沐 gemini-2.0-flash-thinking-exp-01-21 | 230.82 tok/s | 7.47s | 5 |
gemini-2.0-flash-thinking-exp | 224.26 tok/s | 7.53s | 5 |
gemini-2.0-flash-thinking-exp-01-21 | 124.47 tok/s | 2.12s | 5 |
gemini-2.0-flash-thinking-exp-01-21 | 121.89 tok/s | 1.38s | 5 |
Показано 1–4 из 4 провайдеров
deepseek-r1
DeepSeek R1 is a reasoning-focused language model in the DeepSeek series, designed for complex reasoning, problem-solving, and analytical tasks.
gemini-2-5-pro
Google Gemini 2.5 Pro is Google advanced multimodal model with a 1M-token context window, strong STEM reasoning, and native support for images, audio, and video understanding.
deepseek-v3
DeepSeek V3 is DeepSeek flagship MoE language model with 671B total parameters, delivering strong performance in reasoning, coding, and multilingual tasks at competitive inference cost.
gemini-2-5-flash
Google Gemini 2.5 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
gpt-4o
OpenAI GPT-4o is a multimodal flagship model with fast text, vision, and audio understanding, optimized for real-time chat, coding assistants, and production API workloads.
gemini-2-0-flash
Google Gemini 2.0 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
Рейтинги основаны на тестах, предоставленных сообществом, и периодических зондах работоспособности. Носит рекомендательный характер, не является официальными данными.