Qwen3TTS

Voice format: Telnyx.Qwen3TTS.<clone_name> Voice cloning model. 11 languages: en, zh, fr, de, it, ja, ko, pt, ru, es, ar. The voice_id is the name of a clone you created in the Voice Design. Clones are scoped to your organization.

Voice Samples

Voice	Gender	Sample
`Telnyx.Qwen3TTS.Delta`	Female
`Telnyx.Qwen3TTS.Whiskey`	Male

WebSocket

Query Parameters

wss://api.telnyx.com/v2/text-to-speech/speech?voice=Telnyx.Qwen3TTS.Delta

Parameter	Type	Default	Description
`audio_format`	string	`mp3`	`mp3`, `linear16`.
`sample_rate`	integer	`24000`	24000.

Voice Settings

Send in the init frame ({"text": " "}):

Field	Type	Default	Description
`language_boost`	string	—	Target language hint: `Auto`, `English`, `Chinese`, `French`, `German`, `Italian`, `Japanese`, `Korean`, `Portuguese`, `Russian`, `Spanish`, or ISO codes.
`force_xvector`	boolean	`false`	Force x-vector voice embedding.

{
  "text": " ",
  "voice_settings": {
    "language_boost": "English"
  }
}

REST API

Fields

Field	Type	Default	Description
`language_boost`	string	—	Target language hint.
`force_xvector`	boolean	`false`	Force x-vector voice embedding.
`output_type`	string	`binary_output`	`binary_output`, `base64_output`, or `audio_id`.

Response

Default (binary_output): chunked PCM audio bytes. Always 24kHz signed 16-bit LE mono. With output_type: "base64_output": JSON with base64-encoded PCM. With output_type: "audio_id": JSON with an audio_url for deferred retrieval.

KokoroTTS

Ultra

⌘I

WebSocket Streaming

REST API

Providers

Other

API Reference

For AI Agents

Voice Samples

WebSocket

Query Parameters

Voice Settings

REST API

Fields

Response

​Voice Samples

​WebSocket

​Query Parameters

​Voice Settings

​REST API

​Fields

​Response

Voice Samples

WebSocket

Query Parameters

Voice Settings

REST API

Fields

Response