Voice format: Telnyx.Bayan.<speaker>
Arabic voice model, 113 speakers across 13 dialects (Modern Standard Arabic plus Egyptian, Emirati, Saudi, Jordanian, Iraqi, Lebanese, Syrian, Palestinian, Kuwaiti, Bahraini, Qatari, and Omani), plus a set of English speakers. Native audio is 16kHz.
Voice Samples
| Voice | Dialect | Gender |
|---|
Telnyx.Bayan.Ahmed | Egyptian | Male |
Telnyx.Bayan.Alia | Emirati | Female |
Telnyx.Bayan.Lana | Jordanian | Female |
Telnyx.Bayan.Hind | Qatari | Female |
Browse the full 113-speaker catalogue via the Voices API.
WebSocket
Query Parameters
wss://api.telnyx.com/v2/text-to-speech/speech?voice=Telnyx.Bayan.Ahmed
| Parameter | Type | Default | Description |
|---|
audio_format | string | mp3 | mp3, linear16, wav. |
sample_rate | integer | 16000 | Only 16000 (native) is supported. |
Voice Settings
None. The init frame only needs {"text": " "}.
REST API
Fields
| Field | Type | Default | Description |
|---|
language | string | Speaker’s dialect | Language/dialect hint. |
response_format | string | native | pcm, wav, mp3. |
sampling_rate | integer | 16000 | Only 16000 (native) is supported. |
output_type | string | binary_output | binary_output, base64_output, or audio_id. |
Response
Default (binary_output): chunked audio bytes, 16kHz signed 16-bit LE mono (native PCM, transcoded to MP3/WAV on request).
With output_type: "base64_output": JSON with base64-encoded audio.
With output_type: "audio_id": JSON with an audio_url for deferred retrieval.
sampling_rate/sample_rate values other than 16000 are rejected with a 400 error — Bayan only supports its native rate.