Transcribe speech to text (BETA)
POST/ai/audio/transcriptions
Transcribe speech to text. This endpoint is consistent with the OpenAI Transcription API and may be used with the OpenAI JS or Python SDK.
Request
- multipart/form-data
Body
required
The audio file object to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm. File uploads are limited to 100 MB.
Possible values: [distil-whisper/distil-large-v2
]
ID of the model to use. Only distil-whisper/distil-large-v2
is currently available.
Possible values: [json
, verbose_json
]
Default value: json
The format of the transcript output. Use verbose_json
to take advantage of timestamps.
Possible values: [segment
]
The timestamp granularities to populate for this transcription. response_format
must be set verbose_json to use timestamp granularities. Currently segment
is supported.
Responses
200: Successful Response
- application/json
422: Validation Error
- application/json