Skip to main content
Voice format: Telnyx.KokoroTTS.<voice> Lightweight, lowest-latency model. 5 languages: en, es, fr, it, pt.

Voice Samples

VoiceLanguageGenderSample
Telnyx.KokoroTTS.af_hearten-USFemale
Telnyx.KokoroTTS.am_adamen-USMale
Telnyx.KokoroTTS.bf_emmaen-UKFemale

WebSocket

Query Parameters

wss://api.telnyx.com/v2/text-to-speech/speech?voice=Telnyx.KokoroTTS.af_heart
ParameterTypeDefaultDescription
audio_formatstringmp3mp3, linear16.
sample_rateinteger2400024000.

Voice Settings

None. All synthesis parameters are fixed. The init frame only needs {"text": " "}.

REST API

Fields

No model-specific fields. Audio format is always MP3.
FieldTypeDefaultDescription
output_typestringbinary_outputbinary_output, base64_output, or audio_id.

Response

Default (binary_output): chunked audio bytes with Content-Type: audio/mpeg. With output_type: "base64_output": JSON with base64-encoded audio. With output_type: "audio_id": JSON with an audio_url for deferred retrieval.