inworld.<Model>.<VoiceId>
Models: inworld-tts-1.5-mini (alias Mini — faster) and inworld-tts-1.5-max (alias Max — higher quality). Defaults to mini.
Voice Samples
| Voice | Model | Gender | Sample |
|---|---|---|---|
Inworld.Max.Hank | Max | Male | |
Inworld.Mini.Loretta | Mini | Female |
WebSocket
Query Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
audio_format | string | mp3 | mp3, linear16. |
sample_rate | integer | 24000 | 8000, 16000, 22050, 24000, 44100, 48000. |
language | string | — | BCP-47 language code. |
Voice Settings
| Field | Type | Default | Description |
|---|---|---|---|
encoding | string | MP3 | MP3 or LINEAR16. |
sample_rate | integer | 24000 | Output sample rate in Hz. |
language_code | string | — | BCP-47. Overrides language query param. |
REST API
Fields
| Field | Type | Default | Description |
|---|---|---|---|
encoding | string | MP3 | MP3 or LINEAR16. |
sample_rate | integer | 24000 | Output sample rate in Hz. |
language_code | string | — | BCP-47 language code. |
output_type | string | binary_output | binary_output, base64_output, or audio_id. |
Response
Default (binary_output): chunked audio bytes.
With output_type: "base64_output": JSON with base64-encoded audio.
With output_type: "audio_id": JSON with an audio_url for deferred retrieval.