Creates a new voice clone by uploading an audio file directly. Supported formats: WAV, MP3, FLAC, OGG, M4A. For best results, provide 5–10 seconds of clear speech. Maximum file size: 2MB.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Multipart form data for creating a voice clone from a direct audio upload.
Audio file to clone the voice from. Supported formats: WAV, MP3, FLAC, OGG, M4A. For best quality, provide 5–10 seconds of clear, uninterrupted speech. Maximum size: 2MB.
Name for the voice clone.
255ISO 639-1 language code (e.g. en, fr) or auto for automatic detection.
^([Aa]uto|[a-z]{2,3}(-[A-Za-z0-9]{2,8})*)$Gender of the voice clone.
male, female, neutral Optional transcript of the audio file. Providing this improves clone quality.
Optional custom label describing the voice style. If omitted, falls back to the source design's prompt text.
Voice clone created successfully from the uploaded audio.
Response envelope for a single voice clone.
A voice clone object.