gpt-oss
Also known as
GPT-OSS is available through 148 API providers on LMSpeed. Compare API pricing from $0.0001 to $75.00 per million input tokens across providers. Free API access is offered by 10 providers. In speed benchmarks, the fastest provider reaches 1796 tok/s.
Compare GPT-OSS API pricing across 138 providers. Input prices range from $0.0001 to $75.00 per million input. 91VIP offers the lowest rate at $0.0001/M.
| Provider | Model Variant | Input ($/M) | Output ($/M) | Speed (t/s) |
|---|---|---|---|---|
| openai/gpt-oss-120b | Free | Free | — | |
| openai/gpt-oss-20b | Free | Free | — | |
| 乐天图书馆 | gpt-oss-120b-free | Free | Free | — |
| 91VIP API | openai/gpt-oss-20b | Free | Free | — |
| openai/gpt-oss-120b:free | Free | Free | — | |
| openai/gpt-oss-20b:free | Free | Free | — | |
| fffaa AI | openai/gpt-oss-120b:free | Free | Free | — |
| fffaa AI | openai/gpt-oss-20b:free | Free | Free | — |
| MapleLeaf API | openai/gpt-oss-120b:free | Free | Free | — |
| MapleLeaf API | openai/gpt-oss-20b:free | Free | Free | — |
| 91VIP | gpt-oss-20b | $0.0001 | $0.0000 | — |
| Futureppo | gpt-oss-20b | $0.0001 | $0.0000 | — |
| 91VIP | gpt-oss-120b | $0.0001 | $0.0000 | — |
| Futureppo | gpt-oss-120b | $0.0001 | $0.0000 | — |
| Xiao Wan | gpt-oss-120b | $0.0010 | $0.0010 | — |
| 素墨API | gpt-oss-120b | $0.010 | $0.010 | 970.8 t/s |
| 素墨API | openai/gpt-oss-120b | $0.010 | $0.010 | — |
| 素墨API | openai/gpt-oss-120b:free | $0.010 | $0.010 | — |
| 素墨API | openai/gpt-oss-20b | $0.010 | $0.010 | 234.4 t/s |
| 素墨API | openai/gpt-oss-20b:free | $0.010 | $0.010 | — |
Pricing data from provider public APIs
GPT-OSS is free to use through 10 providers with no per-token charges. These providers offer free API credits or a free tier:
| Provider | Speed (t/s) |
|---|---|
| — | |
| — | |
| 乐天图书馆 | — |
| 91VIP API | — |
| — | |
| — | |
| fffaa AI | — |
| fffaa AI | — |
| MapleLeaf API | — |
| MapleLeaf API | — |
Compare speed and latency performance across all API providers.
Showing 1-20 of 35 providers
| Provider | Speed | Latency | Tests |
|---|---|---|---|
玄黄 gpt-oss-120b | 1796.31 tok/s | 0.49s | 5 |
Medu Chat gpt-oss-120b | 1677.82 tok/s | 0.56s | 10 |
gpt-oss-120b | 1637.28 tok/s | 0.91s | 10 |
gpt-oss-120b | 1467.36 tok/s | 0.82s | 5 |
VSLLM gpt-oss-120b | 1319.02 tok/s | 0.61s | 10 |
素墨API gpt-oss-120b | 970.77 tok/s | 0.94s | 5 |
gpt-oss-120b | 516.58 tok/s | 1.41s | 75 |
accounts/fireworks/models/gpt-oss-20b | 359.18 tok/s | 1.14s | 10 |
素墨API gpt-oss-20b:free | 339.45 tok/s | 2.51s | 5 |
Rnglg2 API gpt-oss-120b-medium | 337.69 tok/s | 2.32s | 20 |
对空六课 API openai/gpt-oss-120b | 275.83 tok/s | 0.76s | 5 |
素墨API gpt-oss-120b:free | 257.64 tok/s | 2.50s | 5 |
gpt-oss-120b-medium | 255.84 tok/s | 1.45s | 5 |
小智API gpt-oss-20b | 246.32 tok/s | 2.13s | 20 |
玄黄 gpt-oss-120b-medium | 244.47 tok/s | 1.16s | 5 |
IPv4 Beta LM Studio gpt-oss:20b | 243.42 tok/s | 2.62s | 5 |
openai/gpt-oss-120b:novita | 240.32 tok/s | 1.26s | 5 |
素墨API openai/gpt-oss-20b | 234.37 tok/s | 1.48s | 15 |
openai/gpt-oss-20b | 227.95 tok/s | 1.47s | 5 |
GPT Load (AllAI) openai/gpt-oss-120b | 217.84 tok/s | 7.42s | 60 |
Latest benchmark results measuring API response speed and first-token latency.
| Time | Model | Speed | Latency |
|---|---|---|---|
| 04/12/2026, 15:40 | openai/gpt-oss-120b | 157.75 tok/s | 1.26s |
| 04/12/2026, 15:40 | openai/gpt-oss-120b | 160.02 tok/s | 1.23s |
| 04/12/2026, 15:40 | openai/gpt-oss-120b | 184.11 tok/s | 1.38s |
| 04/12/2026, 15:40 | openai/gpt-oss-120b | 160.23 tok/s | 1.01s |
| 04/12/2026, 15:40 | openai/gpt-oss-120b | 135.94 tok/s | 1.32s |
| 04/12/2026, 11:56 | openai/gpt-oss-120b:free | 30.03 tok/s | 3.30s |
| 04/12/2026, 11:56 | openai/gpt-oss-120b:free | 41.25 tok/s | 1.22s |
| 04/12/2026, 11:56 | openai/gpt-oss-120b:free | 41.77 tok/s | 1.00s |
| 04/12/2026, 11:56 | openai/gpt-oss-120b:free | 26.94 tok/s | 2.07s |
| 04/12/2026, 11:56 | openai/gpt-oss-120b:free | 33.62 tok/s | 1.16s |
| 04/11/2026, 07:44 | 英伟达/openai/gpt-oss-120b | 128.23 tok/s | 0.92s |
| 04/11/2026, 07:44 | 英伟达/openai/gpt-oss-120b | 145.35 tok/s | 0.89s |
| 04/11/2026, 07:44 | 英伟达/openai/gpt-oss-120b | 135.85 tok/s | 1.17s |
| 04/11/2026, 07:44 | 英伟达/openai/gpt-oss-120b | 156.62 tok/s | 0.93s |
| 04/11/2026, 07:44 | 英伟达/openai/gpt-oss-120b | 153.99 tok/s | 0.79s |
| 04/10/2026, 13:58 | openai/gpt-oss-20b | 29.48 tok/s | 1.93s |
| 04/10/2026, 13:58 | openai/gpt-oss-20b | 35.47 tok/s | 2.15s |
| 04/10/2026, 05:15 | openai/gpt-oss-20b | 42.54 tok/s | 1.51s |
| 04/10/2026, 05:15 | openai/gpt-oss-20b | 52.37 tok/s | 1.53s |
| 04/10/2026, 05:15 | openai/gpt-oss-20b | 45.04 tok/s | 2.79s |
