LogoLMSpeed
  • Home
  • Free
  • Providers
  • Docs
LogoLMSpeed
LogoLMSpeed

The best API speed test tool

GitHubGitHubTwitterX (Twitter)Email
Product
  • Features
  • Pricing
  • FAQ
Leaderboard
  • Overview
  • Speed Ranking
  • Latency Ranking
  • Health Ranking
Models
  • All Models
  • GPT
  • Claude
  • Gemini
  • DeepSeek
  • Llama
  • Qwen
Free Models
  • All Free Models
  • Free GPT
  • Free Claude
  • Free Gemini
  • Free DeepSeek
  • Free Llama
  • Free Qwen
Resources
  • Speed Test
  • Provider Directory
  • Documentation
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 LMSpeed All Rights Reserved.Made by Nexmoe with ❤️
首页交流 QQ 群:1034193296,欢迎中转站站长加入讨论 AI 最热话题、newapi、openclaw 等,获取最新测速动态与反馈支持。
Back to models
pending

Qwen3 VL Thinking API Pricing & Performance

qwen3-vl-thinking

Developer: Alibaba

Also known as

Qwen/Qwen3-VL-235B-A22B-ThinkingQwen/Qwen3-VL-30B-A3B-ThinkingQwen/Qwen3-VL-32B-ThinkingQwen/Qwen3-VL-8B-ThinkingQwen3-VL-235B-A22B-thinking

Qwen3 VL Thinking by Alibaba is available through 77 API providers on LMSpeed. Compare API pricing from $0.010 to $75.00 per million input tokens across providers. Free API access is offered by 1 provider. In speed benchmarks, the fastest provider reaches 52 tok/s.

Avg speed
39.67t/s
First token
18.52s
Total tests
74
Providers
27
Variants
83

Pricing Comparison

Compare Qwen3 VL Thinking API pricing across 76 providers. Input prices range from $0.010 to $75.00 per million input. 素墨API offers the lowest rate at $0.010/M.

ProviderModel VariantInput ($/M)Output ($/M)Speed (t/s)
算了么 APIQwenQwen/Qwen3-VL-32B-ThinkingFreeFree38.8 t/s
素墨APIQwenqwen3-vl-235b-a22b-thinking$0.010$0.010—
素墨APIQwenqwen3-vl-8b-thinking$0.010$0.010—
ETOS APIQwenQwen/Qwen3-VL-30B-A3B-Thinking$0.070$0.280—
OpenRouter FansQwenqwen/qwen3-vl-8b-thinking$0.117$1.36—
OpenRouter FansQwenqwen/qwen3-vl-30b-a3b-thinking$0.130$1.56—
ZetaTechs APIQwenqwen3-vl-8b-thinking$0.140$1.40—
Seamee APIQwenQwen/Qwen3-VL-8B-Thinking$0.180$2.00—
OpenRouter FansQwenQwen/Qwen3-VL-8B-Thinking$0.180$2.00—
OpenRouter FansQwenQwen/Qwen3-VL-235B-A22B-Thinking$0.200$0.800—
OpenRouter FansQwenQwen/Qwen3-VL-32B-Thinking$0.200$1.50—
ZetaTechs APIQwenqwen3-vl-30b-a3b-thinking$0.210$2.10—
ETOS APIQwenQwen/Qwen3-VL-235B-A22B-Thinking$0.250$1.00—
OpenRouter FansQwenqwen/qwen3-vl-235b-a22b-thinking$0.260$2.60—
Seamee APIQwenQwen/Qwen3-VL-30B-A3B-Thinking$0.290$1.00—
OpenRouter FansQwenQwen/Qwen3-VL-30B-A3B-Thinking$0.290$1.00—
人人 APIQwenqwen3-vl-235b-a22b-thinking$0.300$0.300—
ChatGTPQwenqwen3-vl-30b-a3b-thinking$0.375$3.75—
AAAIQwenqwen3-vl-8b-thinking$0.500$5.00—
柏拉图AIQwenqwen3-vl-8b-thinking$0.500$5.00—
Showing 20 of 77 providers.

Pricing data from provider public APIs

Free Qwen3 VL Thinking API Tier & Credits

Qwen3 VL Thinking is free to use through 1 provider with no per-token charges. These providers offer free API credits or a free tier:

ProviderSpeed (t/s)
算了么 API38.8 tok/s

API Speed Benchmarks by Provider

Compare speed and latency performance across all API providers.

Showing 1-2 of 2 providers

Most testedRecently testedA–Z
ProviderSpeedLatencyTests
Fireworks AIFireworks AI

accounts/fireworks/models/qwen3-vl-235b-a22b-thinking

51.54 tok/s
1.25s
5
算了么 API算了么 API

Qwen/Qwen3-VL-32B-Thinking

38.81 tok/s
19.77s
69

Recent API Speed Tests

20 records

Latest benchmark results measuring API response speed and first-token latency.

TimeModelSpeedLatency
03/30/2026, 16:26
QwenQwen/Qwen3-VL-32B-Thinking
26.50 tok/s
54.45s
03/30/2026, 16:26
QwenQwen/Qwen3-VL-32B-Thinking
35.49 tok/s
20.31s
03/30/2026, 16:26
QwenQwen/Qwen3-VL-32B-Thinking
28.97 tok/s
25.33s
03/30/2026, 16:26
QwenQwen/Qwen3-VL-32B-Thinking
3.59 tok/s
30.18s
03/30/2026, 16:26
QwenQwen/Qwen3-VL-32B-Thinking
33.21 tok/s
29.26s
03/24/2026, 17:04
QwenQwen/Qwen3-VL-32B-Thinking
33.88 tok/s
21.22s
03/24/2026, 17:04
QwenQwen/Qwen3-VL-32B-Thinking
35.12 tok/s
10.10s
03/24/2026, 17:04
QwenQwen/Qwen3-VL-32B-Thinking
32.49 tok/s
21.67s
03/24/2026, 17:04
QwenQwen/Qwen3-VL-32B-Thinking
39.50 tok/s
9.95s
03/24/2026, 17:04
QwenQwen/Qwen3-VL-32B-Thinking
20.69 tok/s
21.71s
03/23/2026, 01:58
QwenQwen/Qwen3-VL-32B-Thinking
34.68 tok/s
14.42s
03/23/2026, 01:58
QwenQwen/Qwen3-VL-32B-Thinking
35.50 tok/s
8.57s
03/23/2026, 01:58
QwenQwen/Qwen3-VL-32B-Thinking
55.33 tok/s
12.08s
03/23/2026, 01:58
QwenQwen/Qwen3-VL-32B-Thinking
35.71 tok/s
18.73s
03/23/2026, 01:58
QwenQwen/Qwen3-VL-32B-Thinking
42.87 tok/s
16.08s
03/17/2026, 13:54
QwenQwen/Qwen3-VL-32B-Thinking
41.00 tok/s
21.78s
03/17/2026, 13:54
QwenQwen/Qwen3-VL-32B-Thinking
43.46 tok/s
7.96s
03/17/2026, 13:54
QwenQwen/Qwen3-VL-32B-Thinking
60.33 tok/s
7.85s
03/17/2026, 13:54
QwenQwen/Qwen3-VL-32B-Thinking
41.29 tok/s
10.65s
03/17/2026, 13:54
QwenQwen/Qwen3-VL-32B-Thinking
67.28 tok/s
10.46s
See all free LLM models →

Frequently Asked Questions

Is Qwen3 VL Thinking API free?
Yes, Qwen3 VL Thinking is available for free through 1 API provider on LMSpeed, including 算了么 API. These providers offer free API credits or a free tier with no per-token charges.
How much does Qwen3 VL Thinking API cost?
Qwen3 VL Thinking API pricing ranges from $0.010 to $75.00 per million input tokens across 77 providers. 素墨API offers the cheapest rate at $0.010/M. Output pricing varies by provider.
Which provider has the cheapest Qwen3 VL Thinking API pricing?
The cheapest Qwen3 VL Thinking API pricing is offered by 素墨API at $0.010 per million input tokens. Compare all 77 providers above to find the best pricing per token for your use case.

Alternatives & Similar Models

DeepSeek
DeepSeek R1deepseek-r1
26 shared providers
DeepSeek
DeepSeek V3deepseek-v3
25 shared providers
Gemini
Gemini 3 Flashgemini-3-flash
25 shared providers
DeepSeek
DeepSeek V3.2deepseek-v3-2
25 shared providers
Gemini
Gemini 2.5 Progemini-2-5-pro
24 shared providers
OpenAI
GPT-5.2gpt-5-2
23 shared providers