Skip to content
NVIDIA

Nemotron Nano 9B V2

flagship
NVIDIA · released 2025-03-31 · text->text
currently routing · 4.2k rpm
131K tokens
Context
$0.04 / 1M
Input
$0.16 / 1M
Output
— t/s
Speed
open
License
/ ABOUT

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response.

The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

BENCHMARKS Artificial Analysis Index
Intelligence 14.8
Coding 8.3
Agentic 9.4

Providers for Nemotron Nano 9B V2

2 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
131K
bf16
0.00%
131K
bf16
0.00%