Deepgram Aura (Text to Speech)
"Sub-200ms streaming text-to-speech built for voice agents with domain-specific accuracy and secure, scalable deployment across cloud and on-prem environments" [1]
Deepgram Aura is a streaming text-to-speech API built for real-time voice agents, contact centers, and conversational AI, with sub-200ms latency delivered over WebSocket or REST. It is priced per 1,000 characters on a usage-based model with a one-time $200 sign-up credit, no sales call required, and enterprise plans available. The API carries SOC 2 Type 2, HIPAA, GDPR, and PCI DSS certifications, supports self-hosted deployment, and offers SDKs for JavaScript, Python, Go, and .NET.
Best for / Avoid if
Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box
Avoid if: You want to try it free before paying
Pricing & procurement
- Pricing model
- Usage-based [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✗ No [4]
- Self-serve signup
- ✓ Yes
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes [5]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Pay As You Go | Aura-2 speech synthesis | 1,000 characters | $0.03 | source |
| Pay As You Go | Aura-1 speech synthesis | 1,000 characters | $0.015 | source |
| Growth (prepaid annual credits) | Aura-2 speech synthesis | 1,000 characters | $0.027 | source |
| Growth (prepaid annual credits) | Aura-1 speech synthesis | 1,000 characters | $0.0135 | source |
Capabilities
- Supported actions
- synthesize_speech, streaming_tts, websocket_streaming, rest_tts, tts_callback, audio_output_streaming, multilingual_synthesis, codeswitching_voices, self_hosted_deployment
- Regions
- United States, European Union (api.eu.deepgram.com), Australia (api.au.deepgram.com) [6]
- Languages
- English (US), English (UK/British), English (Australian), English (Irish), English (Filipino/Philippine), Spanish (Mexican), Spanish (Peninsular), Spanish (Colombian), Spanish (Latin American), German, French, Dutch, Italian, Japanese [7]
- Input types
- plain text, JSON text payload
- Output types
- mp3, wav/linear16, mulaw, alaw, opus, flac, aac, streaming audio chunks [8]
- Webhooks
- ✓ Yes [9]
- Sandbox / test mode
- ✗ No [10]
- SDK languages
- JavaScript, Python, Go, .NET [11]
- MCP server
- ✓ Yes [12]
Trust & compliance
- SOC 2
- SOC 2 Type II [13]
- HIPAA
- ✓ Yes [14]
- GDPR
- ✓ Yes [15]
- ISO 27001
- – Unknown [16]
- PCI DSS
- ✓ Yes [17]
- Published SLA
- ✓ Yes [18]
- Rate limits
- Max 2,000 characters per request (Aura-2 and Aura-1); requests exceeding this return HTTP 413. Concurrent request limit enforced per project: 15 concurrent REST requests on Pay As You Go; 45 concurrent WebSocket requests. Exceeding returns HTTP 429 Too Many Requests. [19]
- Known restrictions
- No SSML support; uses text-based prompting techniques (punctuation, filler words) for speech control instead, WebSocket/streaming mode only supports linear16, mulaw, and alaw encodings; compressed formats (mp3, opus, flac, aac) are REST-only, Maximum 2,000 characters per REST request, $200 sign-up credit is a one-time trial credit, not a recurring free tier, Voice cloning not offered as a self-serve feature, Spanish, Dutch, French, German, Italian, Japanese voices are Aura-2 only (Aura-1 has English voices only) [20]
Developer surface
Integration
- API style
- rest
- Base URL
- https://api.deepgram.com/v1
- Version
- v1
- Versioning
- url
- Stability
- ga
- Auth methods
- api_key, jwt
- Error format
- vendor-specific
- Webhook signing
- dg-token header
- Rate limit
- 15 / concurrent
Adoption & maturity
- Launched
- 2024-03-12
- GA
- 2024-03-12
- Notable customers
- Humach, Vapi, Daily, Twilio, Quiq
Other Text-to-Speech APIs
ElevenLabs Text to Speech
"Text to Speech with high quality, human-like AI voices"
Azure AI Text to Speech
"Text to speech enables your applications, tools, or devices to convert text into human like synthesized speech. The text to speech capability is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand."
Amazon Polly
"Amazon Polly is a cloud service that converts text into lifelike speech. You can use Amazon Polly to develop applications that increase engagement and accessibility."
Google Cloud Text-to-Speech
"Cloud Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech."
Cartesia (Sonic)
"The fastest and most natural text to speech model"
Murf AI
"Enterprise-grade AI voice generation with 150+ natural-sounding voices across 35 languages and 20+ speaking styles."
References
- ↑Description: deepgram.com
- ↑Pricing model: deepgram.com · deepgram.com
- ↑Published pricing: deepgram.com
- ↑Free tier: deepgram.com
- ↑Enterprise plan: deepgram.com
- ↑Regions: developers.deepgram.com
- ↑Languages: developers.deepgram.com
- ↑Output types: developers.deepgram.com
- ↑Webhooks: developers.deepgram.com
- ↑Sandbox: developers.deepgram.com
- ↑SDK languages: developers.deepgram.com
- ↑MCP server: developers.deepgram.com
- ↑SOC 2: developers.deepgram.com · deepgram.com
- ↑HIPAA: developers.deepgram.com · developers.deepgram.com
- ↑GDPR: developers.deepgram.com
- ↑ISO 27001: developers.deepgram.com
- ↑PCI DSS: developers.deepgram.com · developers.deepgram.com
- ↑Published SLA: deepgram.com
- ↑Rate limits: developers.deepgram.com · developers.deepgram.com
- ↑Known restrictions: developers.deepgram.com · developers.deepgram.com
Change history
- 2026-06-21 Capabilities: {} → {"streaming":true,"multilingual":true}
- 2026-06-21 Summary Md: (none) → Deepgram Aura is a streaming text-to-speech API built for real-time voice agent…
- 2026-06-21 Score Agent Friendliness: (none) → 55
- 2026-06-21 Score Pricing Transparency: (none) → 85
- 2026-06-21 Score Setup Speed: (none) → 60
- 2026-06-21 Score Docs Quality: (none) → 35
- 2026-06-21 Score Procurement Friction: (none) → 85
- 2026-06-21 Score Trust Readiness: (none) → 85
- 2026-06-21 Best For: (none) → Regulated or enterprise workloads - compliance attestations and an enterprise p…
- 2026-06-21 Avoid If: (none) → You want to try it free before paying
- 2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-21 Has Structured Data: (none) → No
- 2026-06-21 Robots Allows Agents: (none) → Yes
- 2026-06-21 Status Page URL: (none) → https://status.deepgram.com
- 2026-06-21 Changelog URL: (none) → https://deepgram.com/changelog
- 2026-06-21 Docs URL: (none) → https://developers.deepgram.com/home
- 2026-06-21 Rendering: (none) → static
- 2026-06-21 Llms Txt Present: (none) → Yes
- 2026-06-21 Llms Txt URL: (none) → https://deepgram.com/llms.txt
- 2026-06-21 Self Serve Signup: set to Yes
- 2026-06-21 Requires Sales Call: set to No
- 2026-06-21 Enterprise Plan Available: set to Yes
- 2026-06-21 SOC 2: set to type_2
- 2026-06-21 HIPAA: set to Yes
- 2026-06-21 GDPR: set to Yes
- 2026-06-21 PCI DSS: set to Yes
- 2026-06-21 SLA Published: set to Yes
- 2026-06-21 Data Retention Policy URL: set to https://developers.deepgram.com/trust-security/data-privacy-compliance
- 2026-06-21 Documented Rate Limits: set to Max 2,000 characters per request (Aura-2 and Aura-1); requests exceeding this r…
- 2026-06-21 Rate Limit Requests: set to 15
- 2026-06-21 Rate Limit Window: set to concurrent
- 2026-06-21 Known Restrictions: set to No SSML support; uses text-based prompting techniques (punctuation, filler word…
- 2026-06-21 Auth Methods: set to api_key, jwt
- 2026-06-21 Auth Docs URL: set to https://developers.deepgram.com/reference/authentication
- 2026-06-21 API Style: set to rest
- 2026-06-21 Base URL: set to https://api.deepgram.com/v1
- 2026-06-21 API Version: set to v1
- 2026-06-21 Versioning Scheme: set to url
- 2026-06-21 Stability: set to ga
- 2026-06-21 MCP URL: set to https://developers.deepgram.com/_mcp/server
- 2026-06-21 Quickstart URL: set to https://developers.deepgram.com/docs/text-to-speech
- 2026-06-21 Error Format: set to vendor-specific
- 2026-06-21 Webhook Signing: set to dg-token header
- 2026-06-21 Webhook Events URL: set to https://developers.deepgram.com/docs/tts-callback
- 2026-06-21 Requires Verification: set to No
- 2026-06-21 Starting Price Usd: set to 0
- 2026-06-21 Slug: set to deepgram-aura
- 2026-06-21 Free Tier Limit: set to $200 credit
- 2026-06-21 Launched At: set to 2024-03-12
- 2026-06-21 GA Date: set to 2024-03-12
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/deepgram-aura \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/deepgram-aura/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'