GPT Load (Shiho) is an OpenAI-compatible API load balancing service hosted at gpt-load.shiho.top, distributing requests across multiple AI model providers for improved reliability.
GPT Load (Shiho) offers 22 LLM API models.
Speed benchmark average: 1164 tok/s.

gpt-load.shiho.topRankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.