Whisper

flagship

OpenAI · released 2022-09-01 · text

currently routing · 4.2k rpm

1K tokens

Context

— / 1M

Input

— / 1M

Output

— t/s

Speed

proprietary

License

/ ABOUT

OpenAI Whisper is a general-purpose speech recognition model trained on 680,000 hours of multilingual and multitask supervised data. It supports transcription in over 50 languages, translation to English, language identification, and voice activity detection, making it one of the most versatile open speech recognition systems available.

The model uses a simple encoder-decoder Transformer architecture trained on diverse audio quality, accents, and background noise conditions, resulting in robust performance across real-world scenarios. It approaches human-level accuracy on many transcription benchmarks, particularly for English.

Whisper has become the foundation for countless speech recognition applications, from automated transcription services to voice assistants, and is available in multiple model sizes for different speed-accuracy tradeoffs.

Providers for Whisper

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider

Context

Quant

Uptime · 30d

Cloudflare Workers AI

—

bf16

0.00%