Skip to main content
Voice format: Telnyx.Sukhan.<voice_id> Urdu-only voice model, 14 curated voices. No prosody controls (speed/pitch) or language selection — each voice speaks Urdu only. Native audio is 22050Hz.

Voice Samples

VoiceGenderAccent
Telnyx.Sukhan.urdu-professorMaleStandard
Telnyx.Sukhan.news-readerFemaleStandard
Telnyx.Sukhan.sindhi-networkerMaleSindhi
Telnyx.Sukhan.podcast-hostMaleStandard
Browse the full 14-voice catalogue via the Voices API.

WebSocket

Query Parameters

wss://api.telnyx.com/v2/text-to-speech/speech?voice=Telnyx.Sukhan.urdu-professor
ParameterTypeDefaultDescription
audio_formatstringmp3mp3, linear16. wav is not supported.
sample_rateinteger22050Only 22050 (native) is supported.

Voice Settings

None. No prosody controls are exposed — voice_speed and other settings have no effect if sent. The init frame only needs {"text": " "}.

REST API

Fields

FieldTypeDefaultDescription
response_formatstringnativepcm, mp3. wav is not supported.
sampling_rateinteger22050Only 22050 (native) is supported.
output_typestringbinary_outputbinary_output, base64_output, or audio_id.

Response

Default (binary_output): chunked audio bytes, 22050Hz signed 16-bit LE mono (native PCM, transcoded to MP3 on request). With output_type: "base64_output": JSON with base64-encoded audio. With output_type: "audio_id": JSON with an audio_url for deferred retrieval.
sampling_rate/sample_rate values other than 22050 are rejected with a 400 error. Unsupported format values are also rejected — REST response_format accepts only pcm/mp3; WebSocket audio_format accepts only mp3/linear16 (its PCM equivalent). Neither accepts wav.