Browse canonical models across providers with performance and coverage highlights.
A video generation model by MiniMax, producing short video clips from text or image prompts.
Input price
From $0.067/M
Avg speed
—
First token
—
Providers
14
Alibaba Qwen3.0 is a large language model in the Qwen series, offering advanced reasoning, code generation, and multimodal capabilities.
Input price
From $0.0037/M
Avg speed
165 t/s
First token
1.77s
Providers
15
OpenAI GPT-4 is a large language model in the GPT-4 series, offering advanced reasoning, code generation, and multimodal capabilities.
Input price
From $0.014/M
Avg speed
35 t/s
First token
0.74s
Providers
78
DeepSeek R1 MetaSearch is a reasoning model in the DeepSeek series, designed for complex reasoning, problem-solving, and analytical tasks.
Input price
—
Avg speed
—
First token
—
Providers
4
Ring Flash 2.0 is a fast and efficient language model, optimized for quick responses and high throughput.
Input price
From $0.140/M
Avg speed
107 t/s
First token
6.09s
Providers
12
Google Gemini 1.5 Flash 002 is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
Input price
From $0.150/M
Avg speed
155 t/s
First token
1.62s
Providers
5
Google Gemini 3.0 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
Input price
From $0.274/M
Avg speed
28 t/s
First token
20.33s
Providers
10
An open-weight image generation model by Black Forest Labs in the FLUX series, designed for development and experimentation.
Input price
From $0.0001/M
Avg speed
—
First token
—
Providers
21
MiniMax M2.1 is a large language model in the MiniMax series, offering advanced reasoning, code generation, and multimodal capabilities.
Input price+2 free
From $0.0007/M
Avg speed
74 t/s
First token
4.19s
Providers
115
OpenAI GPT-5.3 is a language model in the GPT-5 series, offering general-purpose reasoning, code generation, and multimodal capabilities.
Input price
From $0.050/M
Avg speed
59 t/s
First token
2.57s
Providers
79
Alibaba Qwen3.6 Plus is a large language model in the Qwen series, offering advanced reasoning, code generation, and multimodal capabilities.
Input price+2 free
From $0.0014/M
Avg speed
47 t/s
First token
24.53s
Providers
118
Xiaomi MiMo-V2-Flash is an open-source MoE language model with 309B total and 15B active parameters, optimized for fast reasoning and agentic workflows with up to 256K context.
Input price+2 free
From $0.010/M
Avg speed
107 t/s
First token
3.20s
Providers
57
Anthropic Claude 3 Opus is a language model in the Claude series, offering general-purpose reasoning, code generation, and multimodal capabilities.
Input price
From $1.29/M
Avg speed
—
First token
—
Providers
19
Google Gemini 1.5 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.
Input price
From $0.0005/M
Avg speed
200 t/s
First token
1.17s
Providers
24
BGE Large ZH V1.5 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.
Input price
From $0.0010/M
Avg speed
—
First token
—
Providers
26
Google Gemini Pro is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.
Input price
From $0.014/M
Avg speed
—
First token
—
Providers
32
Google Gemini 2.0 Pro is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.
Input price
From $0.014/M
Avg speed
62 t/s
First token
7.50s
Providers
18
Moonshot AI Kimi K2 Thinking Turbo is a reasoning model in the Kimi series, designed for complex reasoning, problem-solving, and analytical tasks.
Input price
From $1.10/M
Avg speed
—
First token
—
Providers
14
Google Gemini 2.0 Flash Lite is a lightweight and cost-efficient language model in the Gemini series, optimized for fast responses at reduced cost.
Input price
From $0.0090/M
Avg speed
167 t/s
First token
1.41s
Providers
42
Zhipu AI GLM-4.5V is a multimodal vision-language model in the GLM series, supporting both text and image understanding.
Input price
From $0.019/M
Avg speed
73 t/s
First token
6.15s
Providers
38
Google Gemini 2.0 Flash Lite 001 is a compact language model in the Gemini series, optimized for low-latency responses and efficient inference.
Input price
From $0.010/M
Avg speed
—
First token
—
Providers
22
Anthropic Claude 3 Sonnet is a language model in the Claude series, offering general-purpose reasoning, code generation, and multimodal capabilities.
Input price
From $0.411/M
Avg speed
—
First token
—
Providers
15
Anthropic Claude 3.7 Sonnet is a language model in the Claude series, offering general-purpose reasoning, code generation, and multimodal capabilities.
Input price
From $0.0010/M
Avg speed
45 t/s
First token
7.65s
Providers
51
Moonshot Kimi K2.5 is a large language model in the Kimi series, offering advanced reasoning, code generation, and multimodal capabilities.
Input price+4 free
From $0.0007/M
Avg speed
150 t/s
First token
11.68s
Providers
200