Data points: 57
The readout for GPT-OSS and Phi 4 Multimodal Instruct, before the detailed comparison sheet.
Decision read
GPT-OSS
GPT-OSS currently has the stronger profile, with verified wins split 5 to 1.
Evidence depth
57 data points
Includes 0 benchmark rows, 0 audit samples, and 7 provider examples.
Selection signal
Start with GPT-OSS
The charts below split 7 high-signal samples across speed, scores, and audit health.
Model compare GPT-OSS vs Phi 4 Multimodal Instructgpt-oss-vs-phi-4-multimodal-instruct | Model A GPT-OSS | Model B Phi 4 Multimodal Instruct |
|---|---|---|
| Overall leader | Leading | Contender |
| Verified metric wins | 5 wins | 1 wins |
| Where it leads | Cheapest input price, Average speed, Free providers, Provider coverage, Recent tests |
Third-party benchmark profile synced into LMSpeed; only metrics available for both models are shown.
| Metric | GPT-OSS | Phi 4 Multimodal Instruct |
|---|---|---|
| No shared benchmark metrics are available yet. | ||
Latest completed audits from shared providers, with four safety and integrity score groups plus report links.
| Provider | GPT-OSS | Phi 4 Multimodal Instruct |
|---|---|---|
| No completed audits are available from shared providers yet. | ||
Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.
| Provider | GPT-OSS | Phi 4 Multimodal Instruct |
|---|---|---|
115 tests | GPT-OSS speed / latency 154 tok/s / 963ms input / output No data | Phi 4 Multimodal Instruct |
This report only uses LMSpeed data for GPT-OSS and Phi 4 Multimodal Instruct: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.
| Guidance | GPT-OSS | Phi 4 Multimodal Instruct |
|---|---|---|
| When to choose each model | GPT-OSS GPT-OSS is stronger when you prioritize Cheapest input price, Average speed, Free providers, Provider coverage, Recent tests. | Phi 4 Multimodal Instruct Phi 4 Multimodal Instruct is stronger when you prioritize First-token latency. |
TL;DR: GPT-OSS leads across 57 verifiable data points, including pricing, speed, latency, benchmarks, and provider examples.
Continue from GPT-OSS vs Phi 4 Multimodal Instruct into nearby model comparisons with enough verified LMSpeed data.
| First-token latency |
| Model metadata | GPT-OSS exposes 131.1K tokens; notable signals: Text input, Text output, Tool calling, Reasoning. | No OpenRouter metadata is available yet for this model. |
|---|---|---|
| Developer | No data | No data |
| Context window | 131.1K tokens | No data |
| Max output | 131.1K tokens | No data |
| Released | Aug 2025 | No data |
| Modalities | Input Text Output Text | No data |
| Features | Text inputText outputTool callingReasoning | None listed |
| Parameters | 120B | No data |
| Tokenizer | GPT | No data |
| Knowledge cutoff | 2024-06-30 | No data |
| OpenRouter ID | openai/gpt-oss-120b:free | No data |
| References | No data | No data |
speed / latency
83 tok/s / 393ms
input / output
No data
素墨API35 tests | GPT-OSS speed / latency 347 tok/s / 2670ms input / output No data | Phi 4 Multimodal Instruct speed / latency N/A / N/A input / output No data |
|---|
星见雅 API20 tests | GPT-OSS speed / latency 164 tok/s / 1247ms input / output No data | Phi 4 Multimodal Instruct speed / latency N/A / N/A input / output No data |
|---|
WSocket AI10 tests | GPT-OSS openai/gpt-oss-20b speed / latency 125 tok/s / 2302ms input / output $499.50/M / $1998.00/M | Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct speed / latency N/A / N/A input / output $0/M / $0/M |
|---|
6345ywz API5 tests | GPT-OSS speed / latency 127 tok/s / 787ms input / output No data | Phi 4 Multimodal Instruct speed / latency N/A / N/A input / output No data |
|---|
GPT-OSS gpt-oss-120b speed / latency No data input / output $1.50/M / $7.50/M | Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct speed / latency No data input / output $0/M / $0/M |
GPT-OSS openai/gpt-oss-120b speed / latency No data input / output $0.039/M / $0.190/M | Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct speed / latency No data input / output $0/M / $0/M |
Rankings are based on community-submitted tests and periodic health probes. Advisory only, not official data.