LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Input Price
  • Output Price
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️

Model Library

Browse canonical models across providers with performance and coverage highlights.

Visible models
301
Active models
301
Providers covered
554
Model variants
29836
Showing 97-120 of 301 models

GeminiGemini Imagen

Google Gemini Imagen is an image generation model, capable of producing images from text prompts.

Input price

From $0.014/M

Avg speed

—

First token

—

Providers

3

ChatGLMGLM-4 Air

Zhipu AI GLM-4 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price

From $0.0097/M

Avg speed

66 t/s

First token

0.33s

Providers

29

Solar Instruct

An instruction-tuned language model by Upstage in the Solar series.

Input price

From $0.010/M

Avg speed

—

First token

—

Providers

9

WizardLM 2 8x22B

WizardLM 2 8x22B is a large language model in the WizardLM series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.986/M

Avg speed

—

First token

—

Providers

9

PanGu Pro MoE

Huawei PanGu Pro MoE is a mixture-of-experts language model, offering advanced reasoning, code generation, and multimodal capabilities.

Input price

From $0.029/M

Avg speed

—

First token

—

Providers

9

ChatGLMGLM-4 Plus

Zhipu AI GLM-4 Plus is a language model in the GLM series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price

From $0.0002/M

Avg speed

—

First token

—

Providers

19

ChatGLMGLM-4.5

Zhipu AI GLM-4.5 is a 355B-parameter MoE agent foundation model that unifies reasoning, coding, and tool use with hybrid thinking modes and a 128K context window.

Input price+2 free

From $0.0010/M

Avg speed

39 t/s

First token

7.22s

Providers

85

ChatGLMGLM-4.6V Flash

Zhipu AI GLM-4.6V Flash is a multimodal vision-language model in the GLM series, supporting both text and image understanding.

Input price+5 free

From $0.0010/M

Avg speed

111 t/s

First token

11.08s

Providers

35

TeleSpeechASR

TeleSpeechASR is a speech-to-text model, designed for accurate audio transcription and recognition.

Input price

From $0.0071/M

Avg speed

—

First token

—

Providers

19

GeminiGemini 3.1 Pro

Google Gemini 3.1 Pro is a large language model in the Gemini series, offering advanced reasoning, code generation, and multimodal capabilities.

Input price+2 free

From $0.0041/M

Avg speed

91 t/s

First token

15.81s

Providers

194

OpenAIGPT-5.4 Pro

OpenAI GPT-5.4 Pro is a high-capability language model in the GPT-5 series, offering enhanced reasoning, code generation, and multimodal capabilities.

Input price

From $1.23/M

Avg speed

—

First token

—

Providers

64

ChatGLMGLM-5.1

Zhipu AI GLM-5.1 is a language model in the GLM series, offering general-purpose reasoning, code generation, and multimodal capabilities.

Input price+8 free

From $0.0001/M

Avg speed

45 t/s

First token

15.02s

Providers

189

MetaAILlama 3.2 Nemoretriever 300m Embed v1

Meta Llama 3.2 Nemoretriever 300m Embed v1 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.

Input price+1 free

From $0.0010/M

Avg speed

—

First token

—

Providers

20

Phi 4 Multimodal Instruct

Microsoft Phi 4 Multimodal Instruct is a multimodal instruction-tuned variant in the Phi series, optimized for following instructions and conversational tasks.

Input price+4 free

From $0.010/M

Avg speed

83 t/s

First token

0.39s

Providers

31

ChatGLMGLM-4.7 FlashX

Zhipu AI GLM-4.7 FlashX is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price

From $0.210/M

Avg speed

45 t/s

First token

22.27s

Providers

7

BGE Large EN V1.5

An English embedding model by BAAI, designed for generating text embeddings for retrieval and similarity tasks.

Input price

From $0.010/M

Avg speed

—

First token

—

Providers

23

StepfunStep3

A language model by StepFun, offering general-purpose reasoning capabilities.

Input price

From $2.45/M

Avg speed

—

First token

—

Providers

6

Jamba 1.5 Large Instruct

AI21 Jamba 1.5 Large Instruct is a large instruction-tuned language model, optimized for following instructions and conversational tasks.

Input price+1 free

From $0.010/M

Avg speed

56 t/s

First token

0.29s

Providers

18

OpenAIGPT-4o Mini Search

OpenAI GPT-4o Mini Search is a search-augmented language model in the GPT-4 series, integrating web retrieval to provide up-to-date answers.

Input price

From $0.0062/M

Avg speed

—

First token

—

Providers

36

ChatGLMGLM-Z1 Air

Zhipu AI GLM-Z1 Air is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price

From $0.068/M

Avg speed

53 t/s

First token

0.30s

Providers

12

GeminiGemini Embedding 2

Google Gemini Embedding 2 is an embedding model, designed for generating vector representations of text for retrieval and semantic search.

Input price+1 free

From $0.0082/M

Avg speed

—

First token

—

Providers

33

ChatGLMGLM-4.5 Flash

Zhipu AI GLM-4.5 Flash is a fast and efficient language model in the GLM series, optimized for quick responses and high throughput.

Input price+3 free

From $0.0000/M

Avg speed

29 t/s

First token

17.14s

Providers

50

FLUX.2 Max

A high-resolution image generation model by Black Forest Labs in the FLUX.2 series, offering state-of-the-art quality and detail.

Input price+1 free

From $0.0055/M

Avg speed

—

First token

—

Providers

8

GeminiGemini 1.0 Pro Vision

Google Gemini 1.0 Pro Vision is a multimodal vision-language model in the Gemini series, supporting both text and image understanding.

Input price

From $0.049/M

Avg speed

—

First token

—

Providers

3

  • 1
  • 4
  • 5
  • 6
  • 13
+19 more
Jun 25
+1 more
+1 more
+10 more
+58 more
May 13
+21 more
Apr 11
+8 more
+137 more
May 24
+48 more
+133 more
May 30
+10 more
+20 more
May 10
Jan 19
+13 more
+9 more
Aug 21
+24 more
+5 more
Jul 17
+20 more
+37 more
May 15