Why is this comparison indexable?

It has 6 verifiable comparison points, and both models have pricing or benchmark data.

Are missing metrics invented?

No. Metrics without LMSpeed data are omitted from this report.

Sponsored byFusecodeEnterprise coding API for Claude Code, Codex, and model workflows.

LMSpeed

GPT-OSS vs Phi 4 Multimodal Instruct: Price and Speed | LMSpeed

Back to models

Data points: 57

Model compare

GPT-OSS vs Phi 4 Multimodal Instruct

The readout for GPT-OSS and Phi 4 Multimodal Instruct, before the detailed comparison sheet.

Model A

GPT-OSS

gpt-oss

Leading

Model B

Phi 4 Multimodal Instruct

phi-4-multimodal-instruct

Contender

Key Takeaways

Weighted outcome: GPT-OSS. Benchmark capability categories carry 80%, while price, API performance, and availability carry 20%.

Decision read

GPT-OSS

GPT-OSS has the higher weighted result; Model A / B score 80 to 20.

Evidence depth

57 data points

Includes 0 benchmark rows, 0 audit samples, and 8 provider examples.

Selection signal

Start with GPT-OSS

The charts below split 8 high-signal samples across speed, scores, and audit health.

Change comparison

Switch either side of this report to compare another model with the same LMSpeed data pipeline.

Model AModel B

Comparison sheet

This report only uses LMSpeed data for GPT-OSS and Phi 4 Multimodal Instruct: pricing, speed aggregates, third-party benchmark scores, and shared provider samples.

Model compare	GPT-OSS	Phi 4 Multimodal Instruct
Overall leader	Leading	Contender
Weighted overall score	80.0 pts	20.0 pts
Benchmark category leads	0 categories	0 categories
Operational advantages	Cheapest input price, Average speed, Free providers, Provider coverage	First-token latency
Context window	131.1K tokens	No data
Max output	131.1K tokens	No data
Modalities	Input Text Output Text	No data
Features

Model metadata

Model compare	GPT-OSS	Phi 4 Multimodal Instruct
Developer	No data	No data
Released	Aug 2025	No data
Parameters	120B	No data
Tokenizer	GPT	No data
Knowledge cutoff	2024-06-30	No data
OpenRouter ID	openai/gpt-oss-120b:free	No data
References	No data	No data

API audit comparison

Latest completed audits from shared providers, with four safety and integrity score groups plus report links.

Provider	GPT-OSS	Phi 4 Multimodal Instruct
No completed audits are available from shared providers yet.

Provider examples

Speed aggregates and input/output pricing share each provider row for real API selection and migration cost checks.

Provider	GPT-OSS	Phi 4 Multimodal Instruct
120 tests	GPT-OSS speed / latency 153 tok/s / 1011ms input / output No data	Phi 4 Multimodal Instruct speed / latency 83 tok/s / 393ms input / output No data
35 tests	GPT-OSS speed / latency 1061 tok/s / 857ms input / output No data	Phi 4 Multimodal Instruct speed / latency N/A / N/A input / output No data
35 tests	GPT-OSS speed / latency 347 tok/s / 2670ms input / output No data	Phi 4 Multimodal Instruct speed / latency N/A / N/A input / output No data
20 tests	GPT-OSS speed / latency 164 tok/s / 1247ms input / output No data	Phi 4 Multimodal Instruct speed / latency N/A / N/A input / output No data
10 tests	GPT-OSS speed / latency 473 tok/s / 827ms input / output No data	Phi 4 Multimodal Instruct speed / latency N/A / N/A input / output No data
	GPT-OSS gpt-oss-120b speed / latency No data input / output $0.150/M/$0.600/M	Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct speed / latency No data input / output $0/M
	GPT-OSS gpt-oss-120b speed / latency No data input / output $0.150/M/$0.600/M	Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct speed / latency No data input / output $0/M
	GPT-OSS openai/gpt-oss-20b speed / latency No data input / output $0.290/M/$1.40/M	Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct speed / latency No data input / output $0/M

GPT-OSS vs Phi 4 Multimodal Instruct

GPT-OSS

Phi 4 Multimodal Instruct

Key Takeaways

Change comparison

Comparison sheet

Model metadata

When to choose each model

Benchmark score comparison

Category performance

Agents

Coding

Reasoning

Professional benchmark details

API audit comparison

Provider examples

FAQ

Knowledge

Math

Multilingual

Multimodal

Instruction following

Comparison sheet

Model metadata

When to choose each model

Benchmark score comparison

Category performance

Agents

Coding

Reasoning

Professional benchmark details

API audit comparison

Provider examples

FAQ

Related compare reports

Knowledge

Math

Multilingual

Multimodal

Instruction following