LogoLMSpeed
  • Home
  • Free
  • Categories
  • Models
  • Docs
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2025 LMSpeed All Rights Reserved.
BACK TO INDEX
G

GPT Load

Website
Updated 12/8/2025
GPT Load interface preview
Performance Stats
Avg Speed
1075.63t/s
Latency
1.12s
Total Tests
55
Models
10

About GPT Load

An intelligent load balancing platform for managing and distributing API requests to multiple AI providers.

MetaAILlama 4MetaAILlama 3.3OpenAIgpt-oss

GPT Load is an open-source intelligent load balancing platform designed to manage and distribute API requests across multiple AI providers. It provides a unified interface for accessing various AI models, helping developers optimize performance and reliability. Key features include load balancing, failover handling, and request routing. The platform supports integration with different AI APIs, allowing users to switch between providers seamlessly. Typical use cases include AI application development, API management, and ensuring high availability for AI services. The project is available on GitHub under the MIT license.

Supported Models

ModelSpeedLatencyTests
llama3.1-8b
2191.20 t/s
0.35s
10
llama-4-scout-17b-16e-instruct
1372.80 t/s
0.36s
5
llama-3.3-70b
1062.69 t/s
0.51s
5
llama-4-maverick-17b-128e-instruct
1052.78 t/s
0.41s
5
qwen-3-coder-480b
894.38 t/s
0.35s
5
gpt-oss-120b
846.32 t/s
0.70s
5
qwen-3-235b-a22b-instruct-2507
754.92 t/s
0.45s
5
qwen-3-32b
705.04 t/s
0.40s
5
qwen-3-235b-a22b-thinking-2507
579.82 t/s
0.44s
5
models/gemini-2.5-flash
180.81 t/s
7.98s
5

Recent Test Records

TimeModelSpeedLatency
Sep 21, 06:22 PMllama3.1-8b
2264.49 t/s
0.35s
Sep 21, 06:21 PMllama-4-maverick-17b-128e-instruct
1052.78 t/s
0.41s
Sep 21, 06:21 PMllama-4-scout-17b-16e-instruct
1372.80 t/s
0.36s
Sep 21, 06:19 PMllama-3.3-70b
1062.69 t/s
0.51s
Sep 21, 06:18 PMqwen-3-235b-a22b-thinking-2507
579.82 t/s
0.44s
Sep 21, 06:18 PMqwen-3-coder-480b
894.38 t/s
0.35s
Sep 21, 06:17 PMllama3.1-8b
2117.91 t/s
0.34s
Sep 21, 06:16 PMqwen-3-32b
705.04 t/s
0.40s
Sep 21, 06:16 PMgpt-oss-120b
846.32 t/s
0.70s
Sep 21, 06:14 PMqwen-3-235b-a22b-instruct-2507
754.92 t/s
0.45s