Text-to-Speech Overview

1. Choose your interface

Real-time streaming. Send text, receive audio chunks as they’re synthesized.

HTTP POST. Get audio back as binary, base64, or async URL. OpenAI SDK compatible.

TTS during live calls via Call Control speak or TeXML <Say>.

Natural, NaturalHD, Ultra, Kokoro, Qwen3TTS, xAI Grok.

AWS Polly, Azure, ElevenLabs, Minimax, MurfAI, Rime, Resemble, Inworld.

Clone and design custom voices. Available on select providers: Qwen3TTS, Minimax, ElevenLabs, Resemble.

⌘I