SiliconFlow

Name: SiliconFlow
Rating: 2.3 (1197 reviews)

Provides cost-effective generative AI cloud services based on open-source models for text, image, video, and audio generation.

Supported Models

Model	Speed	Latency	Tests
PaddlePaddle/PaddleOCR-VL-1.5	279.58 tok/s	4.15s	3
THUDM/GLM-Z1-9B-0414	176.03 tok/s	13.40s	24
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B	142.95 tok/s	4.47s	5
Qwen/Qwen3-Omni-30B-A3B-Instruct	128.77 tok/s	0.47s	5
Qwen/Qwen3-VL-8B-Instruct	106.50 tok/s	0.92s	10
Qwen/Qwen3.5-4B	91.28 tok/s	23.54s	5
inclusionAI/Ring-flash-2.0	89.88 tok/s	6.43s	5
Qwen/Qwen3-Next-80B-A3B-Instruct	86.82 tok/s	0.62s	10
stepfun-ai/Step-3.5-Flash	84.22 tok/s	3.37s	5
Qwen/Qwen2-7B-Instruct	81.93 tok/s	0.57s	25
Qwen/Qwen3-14B	78.64 tok/s	9.81s	5
Pro/THUDM/glm-4-9b-chat	76.25 tok/s	0.63s	10
THUDM/glm-4-9b-chat	75.38 tok/s	0.59s	15
zai-org/GLM-4.7	74.72 tok/s	16.10s	5
zai-org/GLM-4.5V	73.00 tok/s	6.15s	10
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B	72.09 tok/s	8.93s	30
THUDM/GLM-4-9B-0414	71.99 tok/s	0.95s	15
Pro/MiniMaxAI/MiniMax-M2.5	71.37 tok/s	9.63s	25
Pro/Qwen/Qwen2-7B-Instruct	71.29 tok/s	0.56s	5
Qwen/QwQ-32B-Preview	69.75 tok/s	0.62s	5

Showing 20 of 67 models.