gpt-oss-120b

flagship

OpenAI · released 2025-08-04 · text->text

currently routing · 4.2k rpm

131K tokens

Context

$0.04 / 1M

Input

$0.19 / 1M

Output

— t/s

Speed

open

License

/ ABOUT

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation.

BENCHMARKS Artificial Analysis Index

Intelligence 33.3

Coding 28.6

Agentic 37.9