Website
Updated 12/27/2025
Cerebras interface preview
Performance Stats
Avg Speed
1292.13t/s
Latency
4.79s
Total Tests
30
Models
5
Cerebras logo

Cerebras

About Cerebras

Provides AI inference and training APIs leveraging Cerebras hardware for large-scale model deployment.

OpenAIgpt-ossMetaAILlama 3.3ChatGLMGLM-4

Health Check

100%Recent availability
History (72 pts)
PastNow

Supported Models

ModelSpeedLatencyTests
llama3.1-8b
2142.09 t/s
0.19s
5
gpt-oss-120b
1920.13 t/s
0.54s
5
llama-3.3-70b
1532.55 t/s
0.25s
5
qwen-3-235b-a22b-instruct-2507
851.89 t/s
12.09s
10
zai-glm-4.7
454.25 t/s
3.57s
5

Recent Test Records

TimeModelSpeedLatency
Jan 13, 04:32 PMzai-glm-4.7
454.25 t/s
3.57s
Dec 25, 02:21 PMqwen-3-235b-a22b-instruct-2507
848.10 t/s
12.10s
Dec 25, 02:12 PMqwen-3-235b-a22b-instruct-2507
855.69 t/s
12.09s
Dec 25, 02:06 PMllama3.1-8b
2142.09 t/s
0.19s
Dec 25, 02:06 PMgpt-oss-120b
1920.13 t/s
0.54s
Dec 25, 02:02 PMllama-3.3-70b
1532.55 t/s
0.25s