Skip to content
OpenAI

GPT-4.1-nano

flagship
OpenAI · released 2024-06-30 · text+image+file->text
currently routing · 4.2k rpm
1.0M tokens
Context
$0.10 / 1M
Input
$0.40 / 1M
Output
— t/s
Speed
proprietary
License
/ ABOUT

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.

BENCHMARKS Artificial Analysis Index
Intelligence 13
Coding 11.2
Agentic 5.8

Providers for GPT-4.1-nano

2 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
1.0M
bf16
0.00%
1.0M
bf16
0.00%