Model Library

Browse canonical models across providers with performance and coverage highlights.

Visible models

301

Active models

301

Providers covered

554

Model variants

29885

MiMo-V2-Omni

Xiaomi MiMo-V2-Omni is the omnimodal model in the V2 series on the Xiaomi MiMo API platform, supporting text, image, video, and audio understanding within a unified architecture. Pricing: 1x token consumption (baseline).

Input price+2 free

From $0.014/M

Avg speed

83 t/s

First token

3.43s

Providers

GPT-3.5 Net

OpenAI GPT-3.5 Net is a language model in the GPT-3.5 series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $10.27/M

Avg speed

—

First token

—

Providers

MiniMax M2.5 HighSpeed

MiniMax M2.5 HighSpeed is a fast and efficient language model in the MiniMax series, optimized for quick responses and high throughput.

Input price

From $0.0001/M

Avg speed

59 t/s

First token

6.38s

Providers

DeepSeek Prover v2

DeepSeek Prover v2 is a reasoning model in the DeepSeek series, designed for complex reasoning, problem-solving, and analytical tasks.

Input price+1 free

From $1.00/M

Avg speed

—

First token

—

Providers

Gemini Pro Vision

Google Gemini Pro Vision is a multimodal vision-language model in the Gemini series, supporting both text and image understanding.

Input price

From $0.274/M

Avg speed

—

First token

—

Providers

GLM-4V Flash

Zhipu AI GLM-4V Flash is a multimodal vision-language model in the GLM series, supporting both text and image understanding.

Input price+1 free

From $0.010/M

Avg speed

56 t/s

First token

0.62s

Providers

Gemini Live 2.5 Flash

Google Gemini Live 2.5 Flash is a realtime audio model in the Gemini series, supporting low-latency speech and conversational interactions.

Input price

From $1.47/M

Avg speed

—

First token

—

Providers

Colosseum Instruct

Colosseum Instruct is an instruction-tuned language model, optimized for following instructions and conversational tasks.

Input price

From $0.010/M

Avg speed

—

First token

—

Providers

Arctic Embed L

Arctic Embed L is an embedding model, designed for generating vector representations of text for retrieval and semantic search.

Input price+1 free

From $0.010/M

Avg speed

—

First token

—

Providers

Nova Premier v1

A multimodal vision-language model by Amazon in the Nova series.

Input price

From $5.00/M

Avg speed

—

First token

—

Providers

GLM-4.1v Thinking FlashX

Zhipu AI GLM-4.1v Thinking FlashX is a reasoning model in the GLM series, designed for complex reasoning, problem-solving, and analytical tasks.

Input price

From $0.016/M

Avg speed

—

First token

—

Providers

Qwen3.5 Max

Alibaba Qwen3.5 Max is a high-capability language model in the Qwen series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price+1 free

From $0.086/M

Avg speed

31 t/s

First token

29.33s

Providers

Qwen3.5 Plus Thinking

Alibaba Qwen3.5 Plus Thinking is a reasoning-focused variant in the Qwen series, designed for complex reasoning and problem-solving tasks.

Input price+1 free

From $0.010/M

Avg speed

—

First token

—

Providers

Phi 3.5 MoE Instruct

Microsoft Phi 3.5 MoE Instruct is a mixture-of-experts instruction-tuned variant in the Phi series, optimized for following instructions and conversational tasks.

Input price+2 free

From $0.010/M

Avg speed

—

First token

—

Providers

GLM-4.5 X

Zhipu AI GLM-4.5 X is a language model in the GLM series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $0.163/M

Avg speed

69 t/s

First token

12.39s

Providers

Qwen3.5 Flash

A fast and efficient language model by Alibaba in the Qwen 3.5 series.

Input price+2 free

From $0.010/M

Avg speed

115 t/s

First token

7.92s

Providers

Claude 3 Haiku

Anthropic Claude 3 Haiku is a language model in the Claude series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $0.021/M

Avg speed

—

First token

—

Providers

Granite 4.0 H Micro

IBM Granite 4.0 H Micro is a compact language model in the Granite series, optimized for quick responses and high throughput.

Input price

From $0.034/M

Avg speed

—

First token

—

Providers

Claude Opus 4.6 Max

Anthropic Claude Opus 4.6 Max is a high-capability language model in the Claude series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.0014/M

Avg speed

—

First token

—

Providers

Italia Instruct

Italia Instruct is an instruction-tuned language model, optimized for following instructions and conversational tasks.

Input price

From $0.010/M

Avg speed

36 t/s

First token

0.49s

Providers

MiniMax M2.1 HighSpeed

MiniMax M2.1 HighSpeed is a fast and efficient language model in the MiniMax series, optimized for quick responses and high throughput.

Input price

From $0.575/M

Avg speed

—

First token

—

Providers

Kimi K2 Turbo

Moonshot AI Kimi K2 Turbo is a fast and efficient language model in the Kimi series, optimized for quick responses and high throughput.

Input price

From $1.10/M

Avg speed

83 t/s

First token

3.73s

Providers

GLM-4 Flash

Zhipu AI GLM-4 Flash is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price+7 free

From $0.0000/M

Avg speed

31 t/s

First token

0.92s

Providers

DeepSeek Coder Instruct

DeepSeek Coder Instruct is a code-specialized variant in the DeepSeek series, optimized for code generation, debugging, and software development tasks.

Input price+1 free

From $0.010/M

Avg speed

—

First token

—

Providers

Model Library

MiMo-V2-Omni

OpenAIGPT-3.5 Net

MinimaxMiniMax M2.5 HighSpeed

DeepSeekDeepSeek Prover v2

GeminiGemini Pro Vision

ChatGLMGLM-4V Flash

GeminiGemini Live 2.5 Flash

Colosseum Instruct

Arctic Embed L

Nova Premier v1

ChatGLMGLM-4.1v Thinking FlashX

QwenQwen3.5 Max

QwenQwen3.5 Plus Thinking

Phi 3.5 MoE Instruct

ChatGLMGLM-4.5 X

QwenQwen3.5 Flash

ClaudeClaude 3 Haiku

Granite 4.0 H Micro

ClaudeClaude Opus 4.6 Max

Italia Instruct

MinimaxMiniMax M2.1 HighSpeed

MoonshotAIKimi K2 Turbo

ChatGLMGLM-4 Flash

DeepSeekDeepSeek Coder Instruct

GPT-3.5 Net

MiniMax M2.5 HighSpeed

DeepSeek Prover v2

Gemini Pro Vision

GLM-4V Flash

Gemini Live 2.5 Flash

GLM-4.1v Thinking FlashX

Qwen3.5 Max

Qwen3.5 Plus Thinking

GLM-4.5 X

Qwen3.5 Flash

Claude 3 Haiku

Claude Opus 4.6 Max

MiniMax M2.1 HighSpeed

Kimi K2 Turbo

GLM-4 Flash

DeepSeek Coder Instruct