Skip to content
NVIDIA

Nemotron ASR Streaming

flagship
NVIDIA · released 2025-01-01 · text
currently routing · 4.2k rpm
1K tokens
Context
— / 1M
Input
— / 1M
Output
— t/s
Speed
open
License
/ ABOUT

NVIDIA Nemotron ASR Streaming is an automatic speech recognition model optimized for real-time, streaming transcription. It processes audio in a continuous streaming fashion, providing low-latency partial transcriptions that are refined as more audio becomes available.

The model handles diverse accents, background noise conditions, and speaking styles with high accuracy. It supports word-level timestamps, punctuation prediction, and can be configured for different latency-accuracy tradeoffs depending on the application requirements.

Nemotron ASR Streaming is designed for live captioning, real-time meeting transcription, voice assistants, and any application requiring immediate speech-to-text conversion.

Providers for Nemotron ASR Streaming

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
bf16
0.00%