Voice format: Telnyx.Sukhan.<voice_id>
Urdu-only voice model, 14 curated voices. No prosody controls (speed/pitch) or language selection — each voice speaks Urdu only. Native audio is 22050Hz.
Voice Samples
| Voice | Gender | Accent |
|---|
Telnyx.Sukhan.urdu-professor | Male | Standard |
Telnyx.Sukhan.news-reader | Female | Standard |
Telnyx.Sukhan.sindhi-networker | Male | Sindhi |
Telnyx.Sukhan.podcast-host | Male | Standard |
Browse the full 14-voice catalogue via the Voices API.
WebSocket
Query Parameters
wss://api.telnyx.com/v2/text-to-speech/speech?voice=Telnyx.Sukhan.urdu-professor
| Parameter | Type | Default | Description |
|---|
audio_format | string | mp3 | mp3, linear16. wav is not supported. |
sample_rate | integer | 22050 | Only 22050 (native) is supported. |
Voice Settings
None. No prosody controls are exposed — voice_speed and other settings have no effect if sent. The init frame only needs {"text": " "}.
REST API
Fields
| Field | Type | Default | Description |
|---|
response_format | string | native | pcm, mp3. wav is not supported. |
sampling_rate | integer | 22050 | Only 22050 (native) is supported. |
output_type | string | binary_output | binary_output, base64_output, or audio_id. |
Response
Default (binary_output): chunked audio bytes, 22050Hz signed 16-bit LE mono (native PCM, transcoded to MP3 on request).
With output_type: "base64_output": JSON with base64-encoded audio.
With output_type: "audio_id": JSON with an audio_url for deferred retrieval.
sampling_rate/sample_rate values other than 22050 are rejected with a 400 error. Unsupported format values are also rejected — REST response_format accepts only pcm/mp3; WebSocket audio_format accepts only mp3/linear16 (its PCM equivalent). Neither accepts wav.