Skip to main content
Voice format: Telnyx.Ultra.<voice> Sub-100ms latency. 36 languages.
REST only — Ultra is not available over public WebSocket.

Voice Samples

VoiceLanguageGenderSample
Telnyx.Ultra.AsherenMale
Telnyx.Ultra.CallieenFemale
Telnyx.Ultra.Claraen-USFemale

SSML Emotions

Ultra supports inline SSML emotion tags. Place the tag before the text:
<emotion value="excited" />Great news — your order shipped early!
Primary emotions: angry, excited, content, sad, scared. Additional: happy, enthusiastic, curious, calm, grateful, affectionate, sarcastic, surprised, confident, hesitant, apologetic, determined, frustrated, disappointed. Omitting the tag = neutral delivery. Use sparingly — Ultra interprets emotional subtext from the text itself.

Nonverbal Cues

Insert [laughter] inline for natural laughing:
That's hilarious! [laughter] Anyway, let me check your account.

Language Support

Set language_boost to improve pronunciation for the target language: Arabic, Bengali, Bulgarian, Chinese, Czech, Danish, Dutch, English, Finnish, French, German, Gujarati, Hebrew, Hindi, Indonesian, Italian, Japanese, Korean, Malay, Marathi, Māori, Norwegian, Polish, Portuguese, Punjabi, Romanian, Russian, Slovak, Spanish, Swedish, Tamil, Telugu, Thai, Turkish, Ukrainian, Vietnamese.

REST API

Fields

FieldTypeDefaultDescription
voice_speedfloat1.0Speech rate multiplier.
language_booststringTarget language hint.
volumefloatOutput volume.
emotionstringneutral, happy, sad, angry, fearful, disgusted, surprised.
sampling_rateintegerOutput sample rate in Hz.
output_typestringbinary_outputbinary_output, base64_output, or audio_id.

Response

Default (binary_output): chunked audio bytes with Content-Type: audio/mpeg. With output_type: "base64_output": JSON with base64-encoded audio. With output_type: "audio_id": JSON with an audio_url for deferred retrieval.