Nemotron ASR Streaming

flagship

NVIDIA · released 2025-01-01 · text

currently routing · 4.2k rpm

1K tokens

Context

— / 1M

Input

— / 1M

Output

— t/s

Speed

open

License

/ ABOUT

NVIDIA Nemotron ASR Streaming is an automatic speech recognition model optimized for real-time, streaming transcription. It processes audio in a continuous streaming fashion, providing low-latency partial transcriptions that are refined as more audio becomes available.

The model handles diverse accents, background noise conditions, and speaking styles with high accuracy. It supports word-level timestamps, punctuation prediction, and can be configured for different latency-accuracy tradeoffs depending on the application requirements.

Nemotron ASR Streaming is designed for live captioning, real-time meeting transcription, voice assistants, and any application requiring immediate speech-to-text conversion.

Providers for Nemotron ASR Streaming

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider

Context

Quant

Uptime · 30d

NVIDIA NIM

—

bf16

0.00%