Deepgram only. Other engines ignore this parameter.
100 ms.
Values
| Value | Behavior |
|---|---|
| Integer (ms) | Finalize after this many ms of silence. Lower = faster but more splits. |
"false" | Disable endpointing entirely. No automatic utterance boundaries. |
Trade-offs
Low values (50–100 ms) — Fast response. Utterances may split mid-sentence on short pauses. High values (300–1000 ms) — More complete sentences. Higher latency before finalization. Disabled ("false") — No automatic splits. Use Finalize control messages to manually trigger boundaries, or rely on CloseStream for a single final transcript.
Interaction With Utterance End
When endpointing triggers, Deepgram sends the final transcript followed by an utterance end event (ifutterance_end_ms is configured server-side — currently 1000 ms).