POST /v2/ai/audio/transcriptions
Synchronous file transcription. Upload audio or pass a URL, get text back.
Feature Support
If you’re coming from alternative providers:| Feature | Status |
|---|---|
| OpenAI SDK compatible | Yes — swap base_url and api_key, existing code works |
| Multi-engine selection | Yes — 3 models behind one endpoint |
| File upload | Yes |
| URL transcription | Yes (file_url) |
| Timestamps (segment) | Yes (verbose_json) |
| Timestamps (word-level) | Deepgram only (via model_config) |
| Diarization | Deepgram only (via model_config) |
| Smart formatting | Deepgram only (via model_config) |
| Multilingual | Model-dependent — distil-whisper: English only, whisper-turbo: 80+ languages, Deepgram: English only |
| Async / webhooks | No |
| Multichannel | No (forced mono) |
| Export formats (SRT/VTT) | No |
| Audio event tagging | No |
| YouTube/TikTok URL | No |
| Transcript retrieval | No |
| File size limit | 100 MB |