Creates a new voice design (version 1) when voice_design_id is omitted. When voice_design_id is provided, adds a new version to the existing design instead. A design can have at most 50 versions.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Request body for creating a new voice design or adding a version to an existing one. Omit voice_design_id to create a new design; include it to add a new version.
Sample text to synthesize for this voice design.
Natural language description of the voice style, e.g. 'Speak in a warm, friendly tone with a slight British accent'.
Name for the voice design. Required when creating a new design (voice_design_id is not provided); ignored when adding a version. Cannot be a UUID.
1 - 255ID of an existing voice design to add a new version to. When provided, a new version is created instead of a new design.
Language for synthesis. Supported values: Auto, Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, Italian. Defaults to Auto.
Sampling temperature controlling randomness. Higher values produce more varied output. Default: 0.9.
0 <= x <= 2Top-k sampling parameter — limits the token vocabulary considered at each step. Default: 50.
1 <= x <= 1000Top-p (nucleus) sampling parameter — cumulative probability cutoff for token selection. Default: 1.0.
0 <= x <= 1Repetition penalty to reduce repeated patterns in generated audio. Default: 1.05.
1 <= x <= 2Maximum number of tokens to generate. Default: 2048.
100 <= x <= 4096Voice design created or new version added successfully.
Response envelope for a single voice design with full version detail.
A voice design object with full version detail.