Skip to content
OpenAI

o4-mini

flagship
OpenAI · released 2024-06-30 · text+image+file->text
currently routing · 4.2k rpm
200K tokens
Context
$1.10 / 1M
Input
$4.40 / 1M
Output
— t/s
Speed
proprietary
License
/ ABOUT

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning and coding performance across benchmarks like AIME (99.5% with Python) and SWE-bench, outperforming its predecessor o3-mini and even approaching o3 in some domains.

Despite its smaller size, o4-mini exhibits high accuracy in STEM tasks, visual problem solving (e.g., MathVista, MMMU), and code editing. It is especially well-suited for high-throughput scenarios where latency or cost is critical. Thanks to its efficient architecture and refined reinforcement learning training, o4-mini can chain tools, generate structured outputs, and solve multi-step tasks with minimal delay—often in under a minute.

BENCHMARKS Artificial Analysis Index
Intelligence 33.1
Coding 25.6
Agentic 36.1

Providers for o4-mini

3 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
200K
bf16
0.00%
200K
bf16
0.00%
200K
bf16
0.00%