LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Input Price
  • Output Price
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️

TokenPony

An AI model aggregation platform providing unified API access to multiple large language models with cost optimization features.

DeepSeekDeepSeek V3.2HunyuanHunyuan InstructMoonshotAIKimi K2 InstructQwenQwen3

TokenPony offers 4 LLM API models.

Speed benchmark average: 85 tok/s.

TokenPony interface preview
OverviewPerformance4HealthEmbed
Avg Speed84.62 tok/s
Latency4.74 s
Updated4/19/2026
Created At12/7/2025
Website

API Endpoints

  • api.tokenpony.cn

About TokenPony

Health Check

100%Recent availability
History (72 pts)
PastNow

API Speed Benchmarks

ModelSpeedLatencyTests
DeepSeekdeepseek-v3.2-exp
39.89 tok/s
0.58s
5
Hunyuanhunyuan-a13b-instruct
143.03 tok/s
3.85s
5
MoonshotAI
13.20 tok/s
1.09s
5
Qwen
103.87 tok/s
7.64s
15

Recent Test Records

TimeModelSpeedLatency
Nov 14, 10:21 AM
DeepSeekdeepseek-v3.2-exp
39.89 tok/s
0.58s
Nov 3, 12:17 PM
Hunyuanhunyuan-a13b-instruct
143.03 tok/s
3.85s
Sep 23, 04:24 AM
MoonshotAIkimi-k2-instruct-0905
13.20 tok/s
1.09s
Sep 23, 03:44 AM
Qwenqwen3-8b
150.65 tok/s
5.56s
Aug 12, 12:34 PM
Qwenqwen3-8b
80.04 tok/s
9.19s
Jun 11, 02:54 AM
Qwenqwen3-8b
80.91 tok/s
8.15s
kimi-k2-instruct-0905
qwen3-8b

Data as of Apr 19, 2026, 04:05 AM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.

Similar API Providers to Compare

Seamee API

napi.seaya.link

Seamee API provides an AI model relay for accessing multiple LLMs through OpenAI-compatible endpoints.

6 shared models

SSynapse

newapi.exynos.top:8443

Synapse is an OpenAI-compatible API relay service providing access to multiple AI models with unified endpoints.

6 shared models

ZEN-AI VIP

vip.zen-ai.top

Provides API relay services for AI models including Azure OpenAI, Gemini, Claude, and Grok with flexible resource grouping and multi-cloud routing.

6 shared models

钱多多 API

api2.aigcbest.top

Provides AI-generated content APIs for various applications, including text and image generation.

6 shared models

天絮 API

chat-api4.087654.xyz

天絮 API provides an AI model relay service with multiple access points and stable connectivity.

6 shared models

SiliconFlow

cloud.siliconflow.cn

Provides cost-effective generative AI cloud services based on open-source models for text, image, video, and audio generation.

5 shared models