Skip to content
pipecat

Pipecat Smart Turn v2 (VAD)

flagship
pipecat · released 2024-06-01 · text
currently routing · 4.2k rpm
1K tokens
Context
— / 1M
Input
— / 1M
Output
— t/s
Speed
open
License
/ ABOUT

Pipecat Smart Turn v2 is a Voice Activity Detection (VAD) model that determines when a speaker has finished their turn in a conversation. It goes beyond simple silence detection to understand conversational boundaries, distinguishing between pauses within speech and actual turn endings.

The model analyzes audio patterns to predict whether a speaker is likely to continue or has completed their thought, reducing premature interruptions in voice AI applications. It handles natural speech patterns including hesitations, filler words, and mid-sentence pauses more accurately than threshold-based VAD systems.

Smart Turn v2 is designed for real-time voice assistant and conversational AI applications where natural turn-taking is essential for a good user experience.

Providers for Pipecat Smart Turn v2 (VAD)

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
bf16
0.00%