LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Input Price
  • Output Price
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️

GPT Load (Shiho)

GPT Load (Shiho) is an OpenAI-compatible API load balancing service hosted at gpt-load.shiho.top, distributing requests across multiple AI model providers for improved reliability.

OpenAIGPT-OSSMetaAILlama3 1MetaAILlama 3.3MetaAILlama 4 Maverick 128e Instruct

GPT Load (Shiho) offers 22 LLM API models.

Speed benchmark average: 1164 tok/s.

GPT Load (Shiho) interface preview
OverviewPerformance22HealthEmbed
Avg Speed1164.47 tok/s
Latency0.41 s
Updated4/29/2026
Created At12/7/2025
Website

API Endpoints

  • gpt-load.shiho.top

Data as of Apr 29, 2026, 05:51 AM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.

About GPT Load (Shiho)

Health Check

100%Recent availability
History (72 pts)
PastNow

API Speed Benchmarks

ModelSpeedLatencyTests
OpenAI
481.90 tok/s
0.43s
5
MetaAI
1629.33 tok/s
0.36s
25
MetaAI

Recent Test Records

TimeModelSpeedLatency
Jun 3, 06:58 PM
OpenAIopenai/gpt-oss-120b
481.90 tok/s
0.43s
Dec 25, 02:06 PM
MetaAIllama3.1-8b
2142.09 tok/s
0.19s
Dec 25, 02:02 PM
MetaAIllama-3.3-70b
1374.74 tok/s
0.25s
Sep 21, 06:22 PM
MetaAIllama3.1-8b
1834.10 tok/s
0.35s
Sep 21, 06:21 PM
MetaAIllama-4-maverick-17b-128e-instruct
825.70 tok/s
0.41s
Sep 21, 06:19 PM
MetaAIllama-3.3-70b
890.30 tok/s
0.51s
Sep 21, 06:18 PM
Qwenqwen-3-235b-a22b-thinking-2507
579.82 tok/s
0.44s
Sep 21, 06:17 PM
MetaAIllama3.1-8b
2117.91 tok/s
0.34s
Jun 8, 11:00 PM
MetaAIllama3.1-8b
806.21 tok/s
0.47s
Jun 8, 10:59 PM
MetaAIllama-3.3-70b
794.97 tok/s
0.52s
984.90 tok/s
0.45s
20
MetaAIllama-4-maverick-17b-128e-instruct
825.70 tok/s
0.41s
5
Qwenqwen-3-235b-a22b-thinking-2507
579.82 tok/s
0.44s
5
View all 22 models
openai/gpt-oss-120b
llama3.1-8b
llama-3.3-70b

Similar API Providers to Compare

IXIOCCAPI

newapi.ixio.cc

IXIOCCAPI is a unified API gateway for large language models, providing standardized endpoints for accessing multiple AI model providers.

10 shared models

天絮 API

chat-api4.087654.xyz

天絮 API provides an AI model relay service with multiple access points and stable connectivity.

10 shared models

SSynapse

newapi.exynos.top:8443

Synapse is an OpenAI-compatible API relay service providing access to multiple AI models with unified endpoints.

9 shared models

Seamee API

napi.seaya.link

Seamee API provides an AI model relay for accessing multiple LLMs through OpenAI-compatible endpoints.

9 shared models

素墨API

apifree.rensumo.top

A non-profit AI infrastructure offering free access to integrated large language models with privacy-focused, no-logging policies.

9 shared models

HHotaruAPI

api.hotaruapi.top

HotaruAPI provides API access to AI models for developers, including a model marketplace and self-service options.

8 shared models