Data points: 56
The readout for GLM-4.1v Thinking Flash and GLM-4.7, before the detailed comparison sheet.
Decision read
GLM-4.7
GLM-4.7 currently has the stronger profile, with verified wins split 2 to 4.
Evidence depth
56 data points
Includes 0 benchmark rows, 0 audit samples, and 9 provider examples.
Selection signal
Start with GLM-4.7
The charts below split 9 high-signal samples across speed, scores, and audit health.
Model compare GLM-4.1v Thinking Flash vs GLM-4.7glm-4-1v-thinking-flash-vs-glm-4-7 | Model A GLM-4.1v Thinking Flash | Model B GLM-4.7 |
|---|---|---|
| Overall leader | Contender | Leading |
| Verified metric wins | 2 wins | 4 wins |
| Where it leads | First-token latency, Recent tests |
Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.
| Metric | GLM-4.1v Thinking Flash | GLM-4.7 |
|---|---|---|
| No shared benchmark metrics are available yet. | ||
Latest completed audits from shared providers, with four safety and integrity score groups plus report links.
| Provider | GLM-4.1v Thinking Flash | GLM-4.7 |
|---|---|---|
| No completed audits are available from shared providers yet. | ||
Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.
| Provider | GLM-4.1v Thinking Flash | GLM-4.7 |
|---|---|---|
30 tests | GLM-4.1v Thinking Flash speed / latency 96 tok/s / 7100ms input / output No data |
This report only uses LMSpeed data for GLM-4.1v Thinking Flash and GLM-4.7: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.
| Guidance | GLM-4.1v Thinking Flash | GLM-4.7 |
|---|---|---|
| When to choose each model | GLM-4.1v Thinking Flash GLM-4.1v Thinking Flash is stronger when you prioritize First-token latency, Recent tests. | GLM-4.7 GLM-4.7 is stronger when you prioritize Cheapest input price, Average speed, Free providers, Provider coverage. |
TL;DR: GLM-4.7 leads across 56 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.
Continue from GLM-4.1v Thinking Flash vs GLM-4.7 into nearby model comparisons with enough verified LMSpeed data.
| Cheapest input price, Average speed, Free providers, Provider coverage |
| Model metadata | No OpenRouter metadata is available yet for this model. | GLM-4.7 exposes 202.8K tokens; notable signals: Text input, Text output, Tool calling, Structured outputs. |
|---|---|---|
| Developer | Zhipu AI | No data |
| Context window | No data | 202.8K tokens |
| Max output | No data | 131.1K tokens |
| Released | No data | Dec 2025 |
| Modalities | No data | Input Text Output Text |
| Features | None listed | Text inputText outputTool callingStructured outputsJSON modeReasoning |
| Parameters | No data | No data |
| Tokenizer | No data | Other |
| Knowledge cutoff | No data | No data |
| OpenRouter ID | No data | z-ai/glm-4.7 |
| References | No data | No data |
GLM-4.7
speed / latency
42 tok/s / 24443ms
input / output
No data
APDSM5 tests | GLM-4.1v Thinking Flash speed / latency 70 tok/s / 9293ms input / output No data | GLM-4.7 speed / latency N/A / N/A input / output No data |
|---|
素墨API5 tests | GLM-4.1v Thinking Flash glm-4.1v-thinking-flash speed / latency N/A / N/A input / output $0.010/M / $0.010/M | GLM-4.7 glm-4.7-free speed / latency 309 tok/s / 3722ms input / output $0.010/M / $0.010/M |
|---|
6345ywz API0 tests | GLM-4.1v Thinking Flash speed / latency N/A / N/A input / output No data | GLM-4.7 speed / latency N/A / N/A input / output No data |
|---|
DMXAPI0 tests | GLM-4.1v Thinking Flash speed / latency N/A / N/A input / output No data | GLM-4.7 speed / latency N/A / N/A input / output No data |
|---|
GLM-4.1v Thinking Flash glm-4.1v-thinking-flash speed / latency No data input / output $0.0002/M / $0.0002/M | GLM-4.7 glm-4.7 speed / latency No data input / output $75.00/M / $75.00/M |
GLM-4.1v Thinking Flash glm-4.1v-thinking-flash speed / latency No data input / output $0.037/M / $0.037/M | GLM-4.7 glm-4.7 speed / latency No data input / output $0.037/M / $0.037/M |
GLM-4.1v Thinking Flash glm-4.1v-thinking-flash speed / latency No data input / output $0.037/M / $0.037/M | GLM-4.7 glm-4.7 speed / latency No data input / output $0.037/M / $0.037/M |
GLM-4.1v Thinking Flash glm-4.1v-thinking-flash speed / latency No data input / output $0.098/M / $0.098/M | GLM-4.7 glm-4.7 speed / latency No data input / output $1.96/M / $7.83/M |
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.