ElevenLabs Text to Speech

"Text to Speech with high quality, human-like AI voices" [1]

Text-to-Speech APIs

elevenlabs.io/text-to-speech · By ElevenLabs · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

ElevenLabs Text to Speech is a REST API delivering high-quality, human-like AI voices for use cases spanning voice agents, audiobook production, video narration, game character voiceovers, and real-time conversational AI, with support for over a dozen synthesis capabilities including streaming, voice cloning, and multilingual output. Pricing starts at $6/month for 30,000 characters on the Starter plan, with a free tier of 10,000 characters per month and self-serve signup requiring no sales call. The API holds SOC 2 Type 2, ISO 27001, HIPAA, GDPR, and PCI DSS certifications, and offers Python and Node.js SDKs plus an MCP server. Notable customers include the Washington Post, HarperCollins, ESPN, and NVIDIA.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt)

Pricing & procurement

Pricing model: Hybrid (base + usage) [2]
Published pricing: Yes [3]
Free tier: Yes [4]
Free tier details: Free plan at $0/month includes 10,000 credits per month (1 text character = 1 credit for standard models); no commercial rights on free tier.
Self-serve signup: Yes
Requires sales call: No
Enterprise plan: Yes [5]

Published prices
Plan	Item	Per	Amount	Source
Free	Speech synthesis (plan fee)	month	$0	source
Free	Speech synthesis - Flash/Turbo models (included quota)	20,000 characters/month	$0	source
Free	Speech synthesis - Multilingual v2/v3 models (included quota)	10,000 characters/month	$0	source
Starter	Speech synthesis (plan fee)	month	$6	source
Starter	Speech synthesis - Flash/Turbo models (included quota)	20,000 characters/month	$0	source
Starter	Speech synthesis - Multilingual v2/v3 models (included quota)	10,000 characters/month	$0	source
Creator	Speech synthesis (plan fee)	month	$22	source
Creator	Speech synthesis - Flash/Turbo models (included quota)	120,000 characters/month	$0	source
Creator	Speech synthesis - Multilingual v2/v3 models (included quota)	60,000 characters/month	$0	source
Pro	Speech synthesis (plan fee)	month	$99	source
Pro	Speech synthesis - Flash/Turbo models (included quota)	440,000 characters/month	$0	source
Pro	Speech synthesis - Multilingual v2/v3 models (included quota)	220,000 characters/month	$0	source
Scale	Speech synthesis (plan fee)	month	$299	source
Scale	Speech synthesis - Flash/Turbo models (included quota)	1,980,000 characters/month	$0	source
Scale	Speech synthesis - Multilingual v2/v3 models (included quota)	990,000 characters/month	$0	source
Business	Speech synthesis (plan fee)	month	$990	source
Business	Speech synthesis - Flash/Turbo models (included quota)	5,980,000 characters/month	$0	source
Business	Speech synthesis - Multilingual v2/v3 models (included quota)	2,990,000 characters/month	$0	source
Pay As You Go	Flash/Turbo model speech synthesis (overage or standalone)	1,000 characters	$0.05	source
Pay As You Go	Multilingual v2/v3 model speech synthesis (overage or standalone)	1,000 characters	$0.1	source

Capabilities

Real-time streaming
Voice cloning
Voice design
SSML control
Multilingual voices
Word timestamps

Supported actions: synthesize_speech, streaming_tts, instant_voice_cloning, professional_voice_cloning, voice_design, ssml_support, word_timestamps, speech_to_speech, multilingual_synthesis, pronunciation_dictionary, audio_tagging, voice_library_access, text_normalization, websocket_streaming, request_stitching [6]
Regions: United States, European Union, India, Singapore [7]
Languages: English (US), English (UK), English (AU), English (IN), Japanese, Chinese, German, Hindi, French, Korean, Portuguese, Italian, Spanish, Indonesian, Dutch, Turkish, Filipino, Polish, Swedish, Bulgarian, Romanian, Arabic, Czech, Greek, Finnish, Croatian, Malay, Slovak, Danish, Tamil, Ukrainian, Russian, Hungarian, Norwegian, Vietnamese, 70+ languages total via eleven_v3 model [8]
Input types: plain text, SSML, audio tags (Eleven v3 model) [9]
Output types: mp3, pcm, wav, opus, ulaw, alaw [10]
Webhooks: Yes [11]
Sandbox / test mode: No [12]
SDK languages: Python, Node.js, Python (MCP server) [13]
MCP server: Yes [14]

Trust & compliance

SOC 2: SOC 2 Type II [15]
HIPAA: Yes [16]
GDPR: Yes [17]
ISO 27001: Yes [18]
PCI DSS: Yes [19]
Published SLA: No [20]
Rate limits: TTS concurrency limits by plan: Free=2, Starter=3, Creator=5, Pro=10, Scale=15, Business=15 concurrent requests. Response headers expose current-concurrent-requests and maximum-concurrent-requests. Burst pricing allows up to 3x normal concurrency limit at double the standard rate. Flash v2.5 model inference latency ~75ms for typical short inputs. [21]
Known restrictions: Commercial usage rights require paid (Starter+) plan, MP3 192kbps output requires Creator tier or higher, PCM/WAV 44.1kHz requires Pro tier or higher, Professional Voice Cloning requires paid plan, Free tier restricted from voice library API access, HIPAA BAA only available for Enterprise tier subscriptions, Data residency (EU/India/Singapore) only available to Enterprise customers, Maximum 3 pronunciation dictionary locators per request, Maximum 3 request IDs for audio stitching per request, Character limit per request: 5,000 (Eleven v3), 10,000 (Multilingual v2), 40,000 (Flash v2.5), Several models deprecated for removal July 9, 2026: eleven_monolingual_v1, eleven_multilingual_v1, scribe_v1, eleven_turbo_v2_5, eleven_turbo_v2, Voice cloning requires consent from voice owner; platform enforces this, No publicly published uptime SLA; custom SLAs available for Enterprise only [22]

Developer surface

Docs rendering: static · llms.txt present

Integration

API style: rest
Base URL: https://api.elevenlabs.io/v1
Version: v1
Versioning: url
Stability: ga
Auth methods: api_key
Idempotency keys: No
Error format: vendor-specific
Webhook signing: hmac
Rate limit: 2 / concurrent

SDKs

Python elevenlabs · repo
Node.js @elevenlabs/elevenlabs-js · repo
Python (MCP server) elevenlabs-mcp · repo

Adoption & maturity

Launched: 2022-01-01
GA: 2023-01-01
Notable customers: Washington Post, HarperCollins, TIME, The New Yorker, Bertelsmann, NVIDIA, ESPN, Paradox Interactive, Perplexity, Chess.com

Other Text-to-Speech APIs

Azure AI Text to Speech
"Text to speech enables your applications, tools, or devices to convert text into human like synthesized speech. The text to speech capability is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand."
Usage · free tier · public pricing · self-serve
Amazon Polly
"Amazon Polly is a cloud service that converts text into lifelike speech. You can use Amazon Polly to develop applications that increase engagement and accessibility."
Usage · free tier · public pricing · self-serve
Google Cloud Text-to-Speech
"Cloud Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech."
Usage · free tier · public pricing · self-serve
Cartesia (Sonic)
"The fastest and most natural text to speech model"
Hybrid · free tier · public pricing · self-serve
Murf AI
"Enterprise-grade AI voice generation with 150+ natural-sounding voices across 35 languages and 20+ speaking styles."
Usage · public pricing · self-serve
OpenAI Text to Speech (gpt-4o-mini-tts / tts-1)
"Transform text into lifelike spoken audio" - OpenAI's TTS service enabling blog narration, multilingual audio production, and realtime voice output via gpt-4o-mini-tts, tts-1, and tts-1-hd models.
Usage · public pricing · self-serve

ElevenLabs Text to Speech alternatives · ElevenLabs Text to Speech vs Azure AI Text to Speech · All Text-to-Speech APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

↑Description: elevenlabs.io
↑Pricing model: elevenlabs.io · elevenlabs.io
↑Published pricing: elevenlabs.io
↑Free tier: elevenlabs.io · elevenlabs.io
↑Enterprise plan: elevenlabs.io
↑Supported actions: elevenlabs.io
↑Regions: elevenlabs.io
↑Languages: elevenlabs.io
↑Input types: elevenlabs.io
↑Output types: elevenlabs.io
↑Webhooks: elevenlabs.io
↑Sandbox: elevenlabs.io
↑SDK languages: elevenlabs.io
↑MCP server: elevenlabs.io · github.com
↑SOC 2: compliance.elevenlabs.io · elevenlabs.io
↑HIPAA: elevenlabs.io · elevenlabs.io
↑GDPR: elevenlabs.io
↑ISO 27001: elevenlabs.io · compliance.elevenlabs.io
↑PCI DSS: elevenlabs.io
↑Published SLA: elevenlabs.io
↑Rate limits: help.elevenlabs.io
↑Known restrictions: elevenlabs.io · elevenlabs.io

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

2026-06-21 Capabilities: {} → {"ssml":true,"streaming":true,"multilingual":true,"voice_design":true,"voice_cl…
2026-06-21 Summary Md: (none) → ElevenLabs Text to Speech is a REST API delivering high-quality, human-like AI …
2026-06-21 Score Pricing Transparency: (none) → 100
2026-06-21 Score Setup Speed: (none) → 85
2026-06-21 Score Docs Quality: (none) → 55
2026-06-21 Score Procurement Friction: (none) → 100
2026-06-21 Score Trust Readiness: (none) → 80
2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
2026-06-21 Score Agent Friendliness: (none) → 65
2026-06-21 Llms Txt URL: (none) → https://elevenlabs.io/llms.txt
2026-06-21 Llms Txt Present: (none) → Yes
2026-06-21 Rendering: (none) → static
2026-06-21 Has Structured Data: (none) → Yes
2026-06-21 Robots Allows Agents: (none) → Yes
2026-06-21 API Reference URL: (none) → https://elevenlabs.io/api
2026-06-21 Status Page URL: (none) → https://status.elevenlabs.io
2026-06-21 Changelog URL: (none) → https://elevenlabs.io/changelog
2026-06-21 Docs URL: (none) → https://elevenlabs.io/docs/overview/intro
2026-06-21 Free Tier Details: set to Free plan at $0/month includes 10,000 credits per month (1 text character = 1 c…
2026-06-21 Self Serve Signup: set to Yes
2026-06-21 Requires Sales Call: set to No
2026-06-21 Enterprise Plan Available: set to Yes
2026-06-21 SOC 2: set to type_2
2026-06-21 HIPAA: set to Yes
2026-06-21 GDPR: set to Yes
2026-06-21 ISO 27001: set to Yes
2026-06-21 PCI DSS: set to Yes
2026-06-21 SLA Published: set to No
2026-06-21 Data Retention Policy URL: set to https://elevenlabs.io/privacy-policy
2026-06-21 Documented Rate Limits: set to TTS concurrency limits by plan: Free=2, Starter=3, Creator=5, Pro=10, Scale=15,…
2026-06-21 Rate Limit Requests: set to 2
2026-06-21 Rate Limit Window: set to concurrent
2026-06-21 Known Restrictions: set to Commercial usage rights require paid (Starter+) plan, MP3 192kbps output requir…
2026-06-21 Auth Methods: set to api_key
2026-06-21 Auth Docs URL: set to https://elevenlabs.io/docs/api-reference/authentication
2026-06-21 API Style: set to rest
2026-06-21 Base URL: set to https://api.elevenlabs.io/v1
2026-06-21 API Version: set to v1
2026-06-21 Versioning Scheme: set to url
2026-06-21 Stability: set to ga
2026-06-21 MCP URL: set to https://github.com/elevenlabs/elevenlabs-mcp
2026-06-21 Quickstart URL: set to https://elevenlabs.io/docs/eleven-api/quickstart
2026-06-21 Idempotency Supported: set to No
2026-06-21 Error Format: set to vendor-specific
2026-06-21 Webhook Signing: set to hmac
2026-06-21 Webhook Events URL: set to https://elevenlabs.io/docs/eleven-api/resources/webhooks
2026-06-21 Requires Verification: set to No
2026-06-21 Primary Use Cases: set to voice agents and chatbots, audiobook production, video and TV narration, video …
2026-06-21 Price Basis: set to month (30,000 characters on Starter; ~$200/1M chars)

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/elevenlabs-tts \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/elevenlabs-tts/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →

Best for / Avoid if

Pricing & procurement

Capabilities

Trust & compliance

Developer surface

Integration

Adoption & maturity

Other Text-to-Speech APIs

Azure AI Text to Speech

Amazon Polly

Google Cloud Text-to-Speech

Cartesia (Sonic)

Murf AI

OpenAI Text to Speech (gpt-4o-mini-tts / tts-1)

References

Change history

Suggest an edit / leave a review