LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Input Price
  • Output Price
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️

Model Library

Browse canonical models across providers with performance and coverage highlights.

Visible models
301
Active models
301
Providers covered
554
Model variants
29836
Showing 121-144 of 301 models

MinimaxMiniMax Hailuo 02

A video generation model by MiniMax, producing short video clips from text or image prompts.

Input price

From $0.067/M

Avg speed

—

First token

—

Providers

14

QwenQwen3.0

Alibaba Qwen3.0 is a large language model in the Qwen series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.0037/M

Avg speed

165 t/s

First token

1.77s

Providers

15

OpenAIGPT-4

OpenAI GPT-4 is a large language model in the GPT-4 series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.014/M

Avg speed

35 t/s

First token

0.74s

Providers

78

DeepSeekDeepSeek R1 MetaSearch

DeepSeek R1 MetaSearch is a reasoning model in the DeepSeek series, designed for complex reasoning, problem-solving, and analytical tasks.

Input price

—

Avg speed

—

First token

—

Providers

4

Ring Flash 2.0

Ring Flash 2.0 is a fast and efficient language model, optimized for quick responses and high throughput.

Input price

From $0.140/M

Avg speed

107 t/s

First token

6.09s

Providers

12

GeminiGemini 1.5 Flash 002

Google Gemini 1.5 Flash 002 is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.

Input price

From $0.150/M

Avg speed

155 t/s

First token

1.62s

Providers

5

GeminiGemini 3.0 Flash

Google Gemini 3.0 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.

Input price

From $0.274/M

Avg speed

28 t/s

First token

20.33s

Providers

10

FLUX Dev

An open-weight image generation model by Black Forest Labs in the FLUX series, designed for development and experimentation.

Input price

From $0.0001/M

Avg speed

—

First token

—

Providers

21

MinimaxMiniMax M2.1

MiniMax M2.1 is a large language model in the MiniMax series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price+2 free

From $0.0007/M

Avg speed

74 t/s

First token

4.19s

Providers

115

OpenAIGPT-5.3

OpenAI GPT-5.3 is a language model in the GPT-5 series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $0.050/M

Avg speed

59 t/s

First token

2.57s

Providers

79

QwenQwen3.6 Plus

Alibaba Qwen3.6 Plus is a large language model in the Qwen series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price+2 free

From $0.0014/M

Avg speed

47 t/s

First token

24.53s

Providers

118

MiMo-V2-Flash

Xiaomi MiMo-V2-Flash is an open-source MoE language model with 309B total and 15B active parameters, optimized for fast reasoning and agentic workflows with up to 256K context.

Input price+2 free

From $0.010/M

Avg speed

107 t/s

First token

3.20s

Providers

57

ClaudeClaude 3 Opus

Anthropic Claude 3 Opus is a language model in the Claude series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $1.29/M

Avg speed

—

First token

—

Providers

19

GeminiGemini 1.5 Flash

Google Gemini 1.5 Flash is a fast and efficient language model in the Gemini series, optimized for quick responses and high throughput.

Input price

From $0.0005/M

Avg speed

200 t/s

First token

1.17s

Providers

24

BGE Large ZH V1.5

BGE Large ZH V1.5 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.

Input price

From $0.0010/M

Avg speed

—

First token

—

Providers

26

GeminiGemini Pro

Google Gemini Pro is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.014/M

Avg speed

—

First token

—

Providers

32

GeminiGemini 2.0 Pro

Google Gemini 2.0 Pro is a high-capability language model in the Gemini series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.014/M

Avg speed

62 t/s

First token

7.50s

Providers

18

MoonshotAIKimi K2 Thinking Turbo

Moonshot AI Kimi K2 Thinking Turbo is a reasoning model in the Kimi series, designed for complex reasoning, problem-solving, and analytical tasks.

Input price

From $1.10/M

Avg speed

—

First token

—

Providers

14

GeminiGemini 2.0 Flash Lite

Google Gemini 2.0 Flash Lite is a lightweight and cost-efficient language model in the Gemini series, optimized for fast responses at reduced cost.

Input price

From $0.0090/M

Avg speed

167 t/s

First token

1.41s

Providers

42

ChatGLMGLM-4.5V

Zhipu AI GLM-4.5V is a multimodal vision-language model in the GLM series, supporting both text and image understanding.

Input price

From $0.019/M

Avg speed

73 t/s

First token

6.15s

Providers

38

GeminiGemini 2.0 Flash Lite 001

Google Gemini 2.0 Flash Lite 001 is a compact language model in the Gemini series, optimized for low-latency responses and efficient inference.

Input price

From $0.010/M

Avg speed

—

First token

—

Providers

22

ClaudeClaude 3 Sonnet

Anthropic Claude 3 Sonnet is a language model in the Claude series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $0.411/M

Avg speed

—

First token

—

Providers

15

ClaudeClaude 3.7 Sonnet

Anthropic Claude 3.7 Sonnet is a language model in the Claude series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $0.0010/M

Avg speed

45 t/s

First token

7.65s

Providers

51

MoonshotAIKimi K2.5

Moonshot Kimi K2.5 is a large language model in the Kimi series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price+4 free

From $0.0007/M

Avg speed

150 t/s

First token

11.68s

Providers

200

  • 1
  • 5
  • 6
  • 7
  • 13
+7 more
+8 more
Nov 21
+54 more
Feb 19
+2 more
Apr 5
Feb 9
+1 more
Feb 16
+12 more
+79 more
Mar 25
+50 more
May 7
+90 more
May 28
+38 more
May 28
+7 more
+14 more
Feb 20
+16 more
+22 more
+8 more
May 4
+6 more
+24 more
Jan 1
+25 more
Oct 25
+12 more
+5 more
+30 more
Jul 3
+145 more
May 27