Browse canonical models across providers with performance and coverage highlights.
Google Gemini Imagen is an image generation model, capable of producing images from text prompts.
Input price
From $0.014/M
Avg speed
—
First token
—
Providers
3
Zhipu AI GLM-4 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
Input price
From $0.0097/M
Avg speed
66 t/s
First token
0.33s
Providers
29
An instruction-tuned language model by Upstage in the Solar series.
Input price
From $0.010/M
Avg speed
—
First token
—
Providers
9
WizardLM 2 8x22B is a large language model in the WizardLM series, offering advanced reasoning, code generation, and multimodal capabilities.
Input price
From $0.986/M
Avg speed
—
First token
—
Providers
9
Huawei PanGu Pro MoE is a mixture-of-experts language model, offering advanced reasoning, code generation, and multimodal capabilities.
Input price
From $0.029/M
Avg speed
—
First token
—
Providers
9
Zhipu AI GLM-4 Plus is a language model in the GLM series, offering general-purpose reasoning, code generation, and multimodal capabilities.
Input price
From $0.0002/M
Avg speed
—
First token
—
Providers
19
Zhipu AI GLM-4.5 is a 355B-parameter MoE agent foundation model that unifies reasoning, coding, and tool use with hybrid thinking modes and a 128K context window.
Input price+2 free
From $0.0010/M
Avg speed
39 t/s
First token
7.22s
Providers
85
Zhipu AI GLM-4.6V Flash is a multimodal vision-language model in the GLM series, supporting both text and image understanding.
Input price+5 free
From $0.0010/M
Avg speed
111 t/s
First token
11.08s
Providers
35
TeleSpeechASR is a speech-to-text model, designed for accurate audio transcription and recognition.
Input price
From $0.0071/M
Avg speed
—
First token
—
Providers
19
Google Gemini 3.1 Pro is a large language model in the Gemini series, offering advanced reasoning, code generation, and multimodal capabilities.
Input price+2 free
From $0.0041/M
Avg speed
91 t/s
First token
15.81s
Providers
194
OpenAI GPT-5.4 Pro is a high-capability language model in the GPT-5 series, offering enhanced reasoning, code generation, and multimodal capabilities.
Input price
From $1.23/M
Avg speed
—
First token
—
Providers
64
Zhipu AI GLM-5.1 is a language model in the GLM series, offering general-purpose reasoning, code generation, and multimodal capabilities.
Input price+8 free
From $0.0001/M
Avg speed
45 t/s
First token
15.02s
Providers
189
Meta Llama 3.2 Nemoretriever 300m Embed v1 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.
Input price+1 free
From $0.0010/M
Avg speed
—
First token
—
Providers
20
Microsoft Phi 4 Multimodal Instruct is a multimodal instruction-tuned variant in the Phi series, optimized for following instructions and conversational tasks.
Input price+4 free
From $0.010/M
Avg speed
83 t/s
First token
0.39s
Providers
31
Zhipu AI GLM-4.7 FlashX is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
Input price
From $0.210/M
Avg speed
45 t/s
First token
22.27s
Providers
7
An English embedding model by BAAI, designed for generating text embeddings for retrieval and similarity tasks.
Input price
From $0.010/M
Avg speed
—
First token
—
Providers
23
A language model by StepFun, offering general-purpose reasoning capabilities.
Input price
From $2.45/M
Avg speed
—
First token
—
Providers
6
AI21 Jamba 1.5 Large Instruct is a large instruction-tuned language model, optimized for following instructions and conversational tasks.
Input price+1 free
From $0.010/M
Avg speed
56 t/s
First token
0.29s
Providers
18
OpenAI GPT-4o Mini Search is a search-augmented language model in the GPT-4 series, integrating web retrieval to provide up-to-date answers.
Input price
From $0.0062/M
Avg speed
—
First token
—
Providers
36
Zhipu AI GLM-Z1 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
Input price
From $0.068/M
Avg speed
53 t/s
First token
0.30s
Providers
12
Google Gemini Embedding 2 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.
Input price+1 free
From $0.0082/M
Avg speed
—
First token
—
Providers
33
Zhipu AI GLM-4.5 Flash is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.
Input price+3 free
From $0.0000/M
Avg speed
29 t/s
First token
17.14s
Providers
50
A high-resolution image generation model by Black Forest Labs in the FLUX.2 series, offering state-of-the-art quality and detail.
Input price+1 free
From $0.0055/M
Avg speed
—
First token
—
Providers
8
Google Gemini 1.0 Pro Vision is a multimodal vision-language model in the Gemini series, supporting both text and image understanding.
Input price
From $0.049/M
Avg speed
—
First token
—
Providers
3