Skip to content
OpenAI

gpt-oss-20b

flagship
OpenAI · released 2025-08-04 · text->text
currently routing · 4.2k rpm
131K tokens
Context
$0.03 / 1M
Input
$0.14 / 1M
Output
— t/s
Speed
open
License
/ ABOUT

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

BENCHMARKS Artificial Analysis Index
Intelligence 24.5
Coding 18.5
Agentic 27.6

Providers for gpt-oss-20b

5 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
131K
bf16
0.00%
131K
bf16
0.00%
131K
bf16
0.00%
131K
bf16
0.00%