Audio
Text to Speech
Convert text to spoken audio (returns MP3 binary).
POST
Text to Speech
Convert text to speech. Returns raw audio bytes (
audio/mpeg) — not JSON.
The response is a binary MP3 file, not a JSON object. The
Content-Type header is audio/mpeg.Authentication
Requires a Bearer token. See Authentication.Request Body
Text to convert to speech. Maximum 4,096 characters.
Optional Theo voice identifier. Omit to use the default voice. Unknown values fall back to
theo-voice-classic.| Voice | Character |
|---|---|
theo-voice-classic | Neutral, versatile — default |
theo-voice-bright | Bright, clear, upbeat |
theo-voice-storyteller | Narrative, expressive |
theo-voice-deep | Deep, authoritative |
theo-voice-warm | Warm, professional |
theo-voice-soft | Soft, thoughtful |
Playback speed multiplier (0.25–4.0).
Request Examples
Response
The response body is raw MP3 audio bytes.| Header | Value |
|---|---|
Content-Type | audio/mpeg |
Content-Length | File size in bytes |
X-Request-Id | Unique request ID (prefixed tts_) |
Errors
| Status | Code | Description |
|---|---|---|
| 400 | missing_text | text is required |
| 401 | invalid_api_key | Missing or invalid API key |
| 502 | tts_provider_error | Theo voice engine returned an error — retry |
| 503 | tts_unavailable | TTS is not configured on this instance |
Text to Speech
