LogoLMSpeed
  • Home
  • Free
  • Models
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
  • Input Price
  • Output Price
  • Reasoning
  • Coding
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️

QWQ Chat API

QWQ Chat API appears to be an OpenAI-compatible API gateway at api.qwq.chat. The public site is currently protected by a Cloudflare challenge.

QWQ Chat API offers 13 LLM API models.

API pricing per token ranges from $0.140 to $5.00/M (input).

Speed benchmark average: 92 tok/s.

OverviewPerformance4Pricing13HealthEmbed
Avg Speed92.07 tok/s
Latency8.70 s
Updated6/2/2026
Created At8/13/2025
Recharge Rate¥7.30 per $1 quota

Features

TaskData Export
Website

API Endpoints

  • Endpoint 1
    https://qwq.chat
  • Endpoint 2
    https://api.qwq.chat

Supported Models

ModelSpeedLatencyTests
OpenAI(NeoJ)gpt-4.1
83.90 tok/s
11.30s
20
OpenAI(NeoJ)gpt-4o
80.18 tok/s
7.56s
15
Gemini(gold)gemini-2.5-flash
158.80 tok/s
9.01s
5
OpenAI(ynpl)gpt-4o-mini
93.74 tok/s
1.38s
5

Data as of Jun 2, 2026, 04:01 PM·Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.