Hume AI Octave TTS
"Text-to-speech with emotional intelligence. Generate expressive, natural-sounding speech that conveys the full range of human emotion." [1]
Hume AI Octave is a text-to-speech API focused on emotionally expressive, natural-sounding voice synthesis, targeting voice agents, audiobooks, podcasts, and conversational applications. Pricing starts at $50 per million characters with a free tier of 10,000 characters per month, self-serve signup, and an enterprise plan for higher volume. SDKs are available for Python, TypeScript, C#/.NET, and Swift, and the API supports WebSocket streaming with first-audio latency as low as 100ms on Octave 2. The service holds SOC 2 Type 2, HIPAA, and GDPR certifications.
Best for / Avoid if
Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt)
Pricing & procurement
- Pricing model
- Hybrid (base + usage) [2]
- Published pricing
- ✓ Yes
- Free tier
- ✓ Yes [3]
- Free tier details
- Free plan at $0/month includes 10,000 characters (~10 minutes) per month; non-commercial use only; rate limited to 15 RPM. [4]
- Self-serve signup
- ✓ Yes [5]
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes [6]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Free | Speech synthesis monthly plan | month | $0 | source |
| Free | Speech synthesis included characters | 10,000 characters/month | $0 | source |
| Free | Speech synthesis overage | 1,000 characters | $0.15 | source |
| Starter | Speech synthesis monthly plan | month | $3 | source |
| Starter | Speech synthesis included characters | 30,000 characters/month | $0 | source |
| Starter | Speech synthesis overage | 1,000 characters | $0.15 | source |
| Creator | Speech synthesis monthly plan | month | $7 | source |
| Creator | Speech synthesis included characters | 140,000 characters/month | $0 | source |
| Creator | Speech synthesis overage | 1,000 characters | $0.15 | source |
| Pro | Speech synthesis monthly plan | month | $70 | source |
| Pro | Speech synthesis included characters | 1,000,000 characters/month | $0 | source |
| Pro | Speech synthesis overage | 1,000 characters | $0.12 | source |
| Scale | Speech synthesis monthly plan | month | $200 | source |
| Scale | Speech synthesis included characters | 3,300,000 characters/month | $0 | source |
| Scale | Speech synthesis overage | 1,000 characters | $0.1 | source |
| Business | Speech synthesis monthly plan | month | $500 | source |
| Business | Speech synthesis included characters | 10,000,000 characters/month | $0 | source |
| Business | Speech synthesis overage | 1,000 characters | $0.05 | source |
Capabilities
- Supported actions
- synthesize_speech, streaming_tts, websocket_streaming, instant_voice_cloning, voice_design, word_timestamps, phoneme_timestamps, acting_instructions, multilingual_synthesis, continuation_context, audio_normalization, speed_control, multi_generation_per_request [7]
- Regions
- US
- Languages
- English, Spanish, Japanese, Korean, French, Portuguese, Italian, German, Russian, Hindi, Arabic [8]
- Input types
- plain text, utterance objects with acting instructions/description [9]
- Output types
- mp3, wav, pcm [10]
- Webhooks
- ✗ No
- Sandbox / test mode
- ✗ No
- SDK languages
- Python, TypeScript/Node.js, C#/.NET, Swift, Node.js CLI [11]
- MCP server
- ✓ Yes [12]
Trust & compliance
- SOC 2
- SOC 2 Type II [13]
- HIPAA
- ✓ Yes [14]
- GDPR
- ✓ Yes [15]
- ISO 27001
- – Unknown
- PCI DSS
- – Unknown
- Published SLA
- ✗ No [16]
- Rate limits
- Rate limits are per subscription tier (RPM): Free/Starter: 15 RPM; Creator: 75 RPM; Pro: 75 RPM; Scale: 150 RPM; Business: 225 RPM; Enterprise: custom. First audio latency (TTFB): Octave 1 ~200ms; Octave 2 ~100ms; instant mode typically ~200ms. [17]
- Known restrictions
- Maximum 5,000 characters per Utterance, Maximum 1,000 characters per description per Utterance, Maximum 5 generations per request, Speed control range: 0.75x–1.5x multiplier, Free and Starter plans: non-commercial use only, Voice design (custom voice from description) is English only in Octave 2 (multilingual coming soon), Voice cloning requires Octave 2, Instant mode unavailable for non-streaming endpoints, Requests using Octave 2 without a voice will be rejected, Users grant Hume a perpetual license to use voice recordings and voice models for service provision and product development, HIPAA-covered entities require an executed Business Associate Agreement (BAA) [18]
Developer surface
Integration
- API style
- rest
- Base URL
- https://api.hume.ai
- Version
- v0
- Versioning
- url
- Stability
- ga
- Auth methods
- api_key, oauth2
- Error format
- vendor-specific (application/json with HTTPValidationError schema; detail array for 422)
- Rate limit
- 15 / minute
Adoption & maturity
- Launched
- 2025-02-26
- GA
- 2025-02-26
- Notable customers
- Niantic Spatial, GAF, Coconote
Other Text-to-Speech APIs
ElevenLabs Text to Speech
"Text to Speech with high quality, human-like AI voices"
Azure AI Text to Speech
"Text to speech enables your applications, tools, or devices to convert text into human like synthesized speech. The text to speech capability is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand."
Amazon Polly
"Amazon Polly is a cloud service that converts text into lifelike speech. You can use Amazon Polly to develop applications that increase engagement and accessibility."
Google Cloud Text-to-Speech
"Cloud Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech."
Cartesia (Sonic)
"The fastest and most natural text to speech model"
Murf AI
"Enterprise-grade AI voice generation with 150+ natural-sounding voices across 35 languages and 20+ speaking styles."
References
- ↑Description: hume.ai
- ↑Pricing model: hume.ai
- ↑Free tier: hume.ai
- ↑Free tier details: hume.ai · dev.hume.ai
- ↑Self-serve signup: app.hume.ai
- ↑Enterprise plan: hume.ai
- ↑Supported actions: dev.hume.ai · dev.hume.ai
- ↑Languages: dev.hume.ai · dev.hume.ai
- ↑Input types: dev.hume.ai
- ↑Output types: dev.hume.ai · dev.hume.ai
- ↑SDK languages: dev.hume.ai · dev.hume.ai
- ↑MCP server: dev.hume.ai
- ↑SOC 2: hume.ai · hume.ai
- ↑HIPAA: dev.hume.ai · hume.ai
- ↑GDPR: dev.hume.ai · hume.ai
- ↑Published SLA: hume.ai
- ↑Rate limits: hume.ai · dev.hume.ai
- ↑Known restrictions: dev.hume.ai · dev.hume.ai · dev.hume.ai
Change history
- 2026-06-21 Capabilities: {} → {"streaming":true,"multilingual":true,"voice_design":true,"voice_cloning":true,…
- 2026-06-21 Summary Md: (none) → Hume AI Octave is a text-to-speech API focused on emotionally expressive, natur…
- 2026-06-21 Score Pricing Transparency: (none) → 100
- 2026-06-21 Score Agent Friendliness: (none) → 40
- 2026-06-21 Score Setup Speed: (none) → 85
- 2026-06-21 Score Docs Quality: (none) → 35
- 2026-06-21 Score Procurement Friction: (none) → 100
- 2026-06-21 Score Trust Readiness: (none) → 55
- 2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
- 2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-21 Robots Allows Agents: (none) → Yes
- 2026-06-21 API Reference URL: (none) → https://dev.hume.ai/reference
- 2026-06-21 Status Page URL: (none) → https://status.hume.ai
- 2026-06-21 Docs URL: (none) → https://dev.hume.ai/intro
- 2026-06-21 Rendering: (none) → static
- 2026-06-21 Llms Txt Present: (none) → No
- 2026-06-21 Has Structured Data: (none) → No
- 2026-06-21 Has Published Pricing: set to Yes
- 2026-06-21 Free Tier Available: set to Yes
- 2026-06-21 Free Tier Details: set to Free plan at $0/month includes 10,000 characters (~10 minutes) per month; non-c…
- 2026-06-21 Self Serve Signup: set to Yes
- 2026-06-21 Requires Sales Call: set to No
- 2026-06-21 Enterprise Plan Available: set to Yes
- 2026-06-21 SOC 2: set to type_2
- 2026-06-21 HIPAA: set to Yes
- 2026-06-21 GDPR: set to Yes
- 2026-06-21 SLA Published: set to No
- 2026-06-21 Data Retention Policy URL: set to https://www.hume.ai/api-data-usage-policy
- 2026-06-21 Documented Rate Limits: set to Rate limits are per subscription tier (RPM): Free/Starter: 15 RPM; Creator: 75 …
- 2026-06-21 Rate Limit Requests: set to 15
- 2026-06-21 Rate Limit Window: set to minute
- 2026-06-21 Known Restrictions: set to Maximum 5,000 characters per Utterance, Maximum 1,000 characters per descriptio…
- 2026-06-21 Auth Methods: set to api_key, oauth2
- 2026-06-21 Auth Docs URL: set to https://dev.hume.ai/docs/introduction/api-key
- 2026-06-21 API Style: set to rest
- 2026-06-21 Base URL: set to https://api.hume.ai
- 2026-06-21 API Version: set to v0
- 2026-06-21 Versioning Scheme: set to url
- 2026-06-21 Stability: set to ga
- 2026-06-21 MCP URL: set to https://github.com/HumeAI/mcp-server-hume
- 2026-06-21 Quickstart URL: set to https://dev.hume.ai/docs/text-to-speech-tts/quickstart/python
- 2026-06-21 Error Format: set to vendor-specific (application/json with HTTPValidationError schema; detail array…
- 2026-06-21 Requires Verification: set to No
- 2026-06-21 Slug: set to hume-octave
- 2026-06-21 Price Basis: set to 1M characters
- 2026-06-21 Free Tier Limit: set to 10,000 characters/month
- 2026-06-21 Launched At: set to 2025-02-26
- 2026-06-21 GA Date: set to 2025-02-26
- 2026-06-21 Notable Customers: set to Niantic Spatial, GAF, Coconote
- 2026-06-21 Fields Not Found: set to iso_27001, pci_dss, sla_published, supported_regions_explicit, webhooks_support…
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/hume-octave \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/hume-octave/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'