azure.<VoiceId>
Example: azure.en-US-AvaMultilingualNeural
No model ID segment — Azure voices are flat identifiers. Default voice: en-US-AvaMultilingualNeural.
Voice Samples
| Voice | Language | Gender | Sample |
|---|---|---|---|
azure.en-US-AvaMultilingualNeural | en-US | Female | |
azure.en-US-AndrewMultilingualNeural | en-US | Male |
WebSocket
Query Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
audio_format | string | mp3 | mp3, wav, linear16, mulaw, alaw. |
sample_rate | integer | 24000 | 8000, 16000, 24000, 48000. |
language | string | en-US | BCP-47 language code. |
text_type | string | text | text or ssml. Azure supports SSML for pronunciation and prosody control. |
Voice Settings
| Field | Type | Default | Description |
|---|---|---|---|
output_format | string | audio-24khz-160kbitrate-mono-mp3 | See Azure audio formats. |
language_code | string | en-US | BCP-47. Overrides language query param. |
text_type | string | text | text or ssml. Overrides query param. |
effect | string | — | eq_car, eq_telecomhp8k. Audio equalization. |
gender | string | — | Male, Female. Voice gender filter. |
REST API
Fields
| Field | Type | Default | Description |
|---|---|---|---|
output_format | string | audio-24khz-160kbitrate-mono-mp3 | Azure audio format string. |
language_code | string | en-US | BCP-47 language code. |
text_type | string | text | text or ssml. |
effect | string | — | eq_car, eq_telecomhp8k. |
gender | string | — | Male, Female. |
output_type | string | binary_output | binary_output, base64_output, or audio_id. |
Response
Default (binary_output): chunked audio bytes.
With output_type: "base64_output": JSON with base64-encoded audio.
With output_type: "audio_id": JSON with an audio_url for deferred retrieval.