Model Library

Browse canonical models across providers with performance and coverage highlights.

Visible models

301

Active models

301

Providers covered

554

Model variants

29836

Gemini Imagen

Google Gemini Imagen is an image generation model, capable of producing images from text prompts.

Input price

From $0.014/M

Avg speed

—

First token

—

Providers

GLM-4 Air

Zhipu AI GLM-4 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price

From $0.0097/M

Avg speed

66 t/s

First token

0.33s

Providers

Solar Instruct

An instruction-tuned language model by Upstage in the Solar series.

Input price

From $0.010/M

Avg speed

—

First token

—

Providers

WizardLM 2 8x22B

WizardLM 2 8x22B is a large language model in the WizardLM series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.986/M

Avg speed

—

First token

—

Providers

PanGu Pro MoE

Huawei PanGu Pro MoE is a mixture-of-experts language model, offering advanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.029/M

Avg speed

—

First token

—

Providers

GLM-4 Plus

Zhipu AI GLM-4 Plus is a language model in the GLM series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $0.0002/M

Avg speed

—

First token

—

Providers

GLM-4.5

Zhipu AI GLM-4.5 is a 355B-parameter MoE agent foundation model that unifies reasoning, coding, and tool use with hybrid thinking modes and a 128K context window.

Input price+2 free

From $0.0010/M

Avg speed

39 t/s

First token

7.22s

Providers

GLM-4.6V Flash

Zhipu AI GLM-4.6V Flash is a multimodal vision-language model in the GLM series, supporting both text and image understanding.

Input price+5 free

From $0.0010/M

Avg speed

111 t/s

First token

11.08s

Providers

TeleSpeechASR

TeleSpeechASR is a speech-to-text model, designed for accurate audio transcription and recognition.

Input price

From $0.0071/M

Avg speed

—

First token

—

Providers

Gemini 3.1 Pro

Google Gemini 3.1 Pro is a large language model in the Gemini series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price+2 free

From $0.0041/M

Avg speed

91 t/s

First token

15.81s

Providers

194

GPT-5.4 Pro

OpenAI GPT-5.4 Pro is a high-capability language model in the GPT-5 series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $1.23/M

Avg speed

—

First token

—

Providers

GLM-5.1

Zhipu AI GLM-5.1 is a language model in the GLM series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price+8 free

From $0.0001/M

Avg speed

45 t/s

First token

15.02s

Providers

189

Llama 3.2 Nemoretriever 300m Embed v1

Meta Llama 3.2 Nemoretriever 300m Embed v1 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.

Input price+1 free

From $0.0010/M

Avg speed

—

First token

—

Providers

Phi 4 Multimodal Instruct

Microsoft Phi 4 Multimodal Instruct is a multimodal instruction-tuned variant in the Phi series, optimized for following instructions and conversational tasks.

Input price+4 free

From $0.010/M

Avg speed

83 t/s

First token

0.39s

Providers

GLM-4.7 FlashX

Zhipu AI GLM-4.7 FlashX is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price

From $0.210/M

Avg speed

45 t/s

First token

22.27s

Providers

BGE Large EN V1.5

An English embedding model by BAAI, designed for generating text embeddings for retrieval and similarity tasks.

Input price

From $0.010/M

Avg speed

—

First token

—

Providers

Step3

A language model by StepFun, offering general-purpose reasoning capabilities.

Input price

From $2.45/M

Avg speed

—

First token

—

Providers

Jamba 1.5 Large Instruct

AI21 Jamba 1.5 Large Instruct is a large instruction-tuned language model, optimized for following instructions and conversational tasks.

Input price+1 free

From $0.010/M

Avg speed

56 t/s

First token

0.29s

Providers

GPT-4o Mini Search

OpenAI GPT-4o Mini Search is a search-augmented language model in the GPT-4 series, integrating web retrieval to provide up-to-date answers.

Input price

From $0.0062/M

Avg speed

—

First token

—

Providers

GLM-Z1 Air

Zhipu AI GLM-Z1 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price

From $0.068/M

Avg speed

53 t/s

First token

0.30s

Providers

Gemini Embedding 2

Google Gemini Embedding 2 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.

Input price+1 free

From $0.0082/M

Avg speed

—

First token

—

Providers

GLM-4.5 Flash

Zhipu AI GLM-4.5 Flash is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price+3 free

From $0.0000/M

Avg speed

29 t/s

First token

17.14s

Providers

FLUX.2 Max

A high-resolution image generation model by Black Forest Labs in the FLUX.2 series, offering state-of-the-art quality and detail.

Input price+1 free

From $0.0055/M

Avg speed

—

First token

—

Providers

Gemini 1.0 Pro Vision

Google Gemini 1.0 Pro Vision is a multimodal vision-language model in the Gemini series, supporting both text and image understanding.

Input price

From $0.049/M

Avg speed

—

First token

—

Providers

Model Library

GeminiGemini Imagen

ChatGLMGLM-4 Air

Solar Instruct

WizardLM 2 8x22B

PanGu Pro MoE

ChatGLMGLM-4 Plus

ChatGLMGLM-4.5

ChatGLMGLM-4.6V Flash

TeleSpeechASR

GeminiGemini 3.1 Pro

OpenAIGPT-5.4 Pro

ChatGLMGLM-5.1

MetaAILlama 3.2 Nemoretriever 300m Embed v1

Phi 4 Multimodal Instruct

ChatGLMGLM-4.7 FlashX

BGE Large EN V1.5

StepfunStep3

Jamba 1.5 Large Instruct

OpenAIGPT-4o Mini Search

ChatGLMGLM-Z1 Air

GeminiGemini Embedding 2

ChatGLMGLM-4.5 Flash

FLUX.2 Max

GeminiGemini 1.0 Pro Vision

Gemini Imagen

GLM-4 Air

GLM-4 Plus

GLM-4.5

GLM-4.6V Flash

Gemini 3.1 Pro

GPT-5.4 Pro

GLM-5.1

Llama 3.2 Nemoretriever 300m Embed v1

GLM-4.7 FlashX

Step3

GPT-4o Mini Search

GLM-Z1 Air

Gemini Embedding 2

GLM-4.5 Flash

Gemini 1.0 Pro Vision