Name: SiliconFlow
Rating: 2.2 (959 reviews)

Supported Models

Model	Speed	Latency	Tests
THUDM/GLM-Z1-9B-0414	171.50 t/s	13.03s	25
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B	142.95 t/s	4.47s	5
Qwen/Qwen3-VL-8B-Instruct	142.73 t/s	0.64s	5
Qwen/Qwen3-VL-8B-Instruct	142.73 t/s	0.64s	5
Qwen/Qwen3-Next-80B-A3B-Instruct	86.82 t/s	0.62s	10
Qwen/Qwen3-Next-80B-A3B-Instruct	86.82 t/s	0.62s	10
Pro/MiniMaxAI/MiniMax-M2.5	85.18 t/s	6.00s	5
Qwen/Qwen2-7B-Instruct	81.93 t/s	0.57s	25
Qwen/Qwen3-14B	78.64 t/s	9.81s	5
Pro/THUDM/glm-4-9b-chat	76.25 t/s	0.63s	10
THUDM/glm-4-9b-chat	75.38 t/s	0.59s	15
Pro/zai-org/GLM-4.7	74.70 t/s	17.00s	15
zai-org/GLM-4.5V	73.00 t/s	6.15s	10
zai-org/GLM-4.5V	73.00 t/s	6.15s	10
zai-org/GLM-4.5V	73.00 t/s	6.15s	10
Pro/Qwen/Qwen2-7B-Instruct	71.29 t/s	0.56s	5
zai-org/GLM-4.6	70.24 t/s	1.60s	15
zai-org/GLM-4.6	70.24 t/s	1.60s	15
Qwen/QwQ-32B-Preview	69.75 t/s	0.62s	5
internlm/internlm2_5-7b-chat	68.17 t/s	0.56s	20

Showing 20 of 89 models.

Recent Test Records

Time	Model	Speed	Latency
Feb 28, 04:36 AM	deepseek-ai/DeepSeek-V3.2	21.69 t/s	0.77s
Feb 25, 06:45 PM	Pro/MiniMaxAI/MiniMax-M2.5	85.18 t/s	6.00s
Feb 25, 06:37 PM	Pro/deepseek-ai/DeepSeek-V3.2	47.80 t/s	32.32s
Feb 25, 06:32 PM	deepseek-ai/DeepSeek-V3.2	21.83 t/s	0.82s
Jan 24, 04:58 PM	Qwen/Qwen3-235B-A22B-Instruct-2507	15.27 t/s	0.71s
Jan 24, 04:56 PM	zai-org/GLM-4.6	67.71 t/s	2.88s
Jan 23, 11:46 PM	zai-org/GLM-4.6	75.43 t/s	0.61s
Jan 23, 02:16 PM	zai-org/GLM-4.6	67.59 t/s	1.32s
Jan 23, 02:13 PM	Pro/zai-org/GLM-4.7	78.69 t/s	15.95s
Jan 21, 08:52 AM	deepseek-ai/DeepSeek-V3.2	20.92 t/s	1.12s