Audio
Speech to Text
Transcribe audio files to text.
POST
Speech to Text
Transcribe an audio file to text. Uses multipart form upload (not JSON). Maximum file size: 25 MB.
Authentication
Requires a Bearer token. See Authentication.Request Body (multipart/form-data)
Audio file to transcribe. Supported formats: MP3, MP4, MPEG, MPGA, M4A, WAV, WEBM.
Language hint (ISO 639-1 code, e.g.,
"en", "es", "fr"). If omitted, language is auto-detected.Request Examples
Response
Unique transcription ID (prefixed
stt_).ISO 8601 timestamp.
The transcribed text.
Detected or specified language.
Duration of the audio file in seconds.
Example Response
Errors
| Status | Code | Description |
|---|---|---|
| 400 | missing_file | file is required (multipart form upload) |
| 401 | invalid_api_key | Missing or invalid API key |
| 502 | stt_provider_error | Theo transcription engine returned an error — retry |
| 503 | stt_unavailable | STT is not configured on this instance |
Speech to Text
