Skip to main content
Voice format: Telnyx.Qwen3TTS.<clone_name> Voice cloning model. 11 languages: en, zh, fr, de, it, ja, ko, pt, ru, es, ar. The voice_id is the name of a clone you created in the Voice Design Lab. Clones are scoped to your organization.

Voice Samples

VoiceGenderSample
Telnyx.Qwen3TTS.DeltaFemale
Telnyx.Qwen3TTS.WhiskeyMale

WebSocket

Query Parameters

wss://api.telnyx.com/v2/text-to-speech/speech?voice=Telnyx.Qwen3TTS.Delta
ParameterTypeDefaultDescription
audio_formatstringmp3mp3, linear16.
sample_rateinteger2400024000.

Voice Settings

Send in the init frame ({"text": " "}):
FieldTypeDefaultDescription
language_booststringTarget language hint: Auto, English, Chinese, French, German, Italian, Japanese, Korean, Portuguese, Russian, Spanish, or ISO codes.
force_xvectorbooleanfalseForce x-vector voice embedding.
{
  "text": " ",
  "voice_settings": {
    "language_boost": "English"
  }
}

REST API

Fields

FieldTypeDefaultDescription
language_booststringTarget language hint.
force_xvectorbooleanfalseForce x-vector voice embedding.
output_typestringbinary_outputbinary_output, base64_output, or audio_id.

Response

Default (binary_output): chunked PCM audio bytes. Always 24kHz signed 16-bit LE mono. With output_type: "base64_output": JSON with base64-encoded PCM. With output_type: "audio_id": JSON with an audio_url for deferred retrieval.