Speechmatics Text to Speech
"Natural, human-like synthetic voices" with low latency designed for real-time conversational applications and voice AI agents. [1]
Speechmatics Text to Speech is a REST API for low-latency speech synthesis aimed at voice AI agents and real-time conversational applications, offering streaming output with initial audio latency under 200ms. Pricing starts at $0.011 per 1,000 characters with a free tier of 1 million characters per month, and enterprise plans are available. The service is certified SOC 2 Type 2, HIPAA, GDPR, and ISO 27001, and supports both SaaS and on-premises deployment. Current coverage is English only (US and UK voices), with SDKs for Python, Node.js, .NET, and Rust.
Best for / Avoid if
Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; Cost-sensitive teams - low, transparent entry price
Pricing & procurement
- Pricing model
- Hybrid (base + usage) [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✓ Yes [4]
- Free tier details
- 1 million characters (~20 hours) per month included free on Free and Pro plans, English language only. [5]
- Self-serve signup
- ✓ Yes [6]
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes [7]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Free | Text-to-speech synthesis | 1,000,000 characters per month (included) | $0 | source |
| Pro | Text-to-speech synthesis (included allowance) | 1,000,000 characters per month (included) | $0 | source |
| Text-to-speech synthesis (overage / pay-as-you-go) | 1,000 characters | $0.011 | source |
Capabilities
- Supported actions
- synthesize_speech, streaming_tts, low_latency_synthesis [8]
- Regions
- global (SaaS), on-premises deployment
- Languages
- English (US), English (UK) [9]
- Input types
- plain text
- Output types
- wav, pcm [10]
- Webhooks
- ✗ No [11]
- Sandbox / test mode
- ✗ No [12]
- SDK languages
- Python, Node.js, .NET, Rust [13]
- MCP server
- ✗ No [14]
Trust & compliance
- SOC 2
- SOC 2 Type II [15]
- HIPAA
- ✓ Yes [16]
- GDPR
- ✓ Yes [17]
- ISO 27001
- ✓ Yes [18]
- PCI DSS
- ✗ No [19]
- Published SLA
- ✗ No [20]
- Rate limits
- No specific numeric rate/concurrency limits published for TTS. Documentation notes: "If you encounter rate limit errors, use retry with exponential backoff." Initial audio latency: less than 200ms, with subsequent chunks faster than real time. [21]
- Known restrictions
- English language only (US and UK voices); additional languages in development, No SSML support documented, No bidirectional streaming - output streaming only, No voice speed/pitch/emphasis controls, Python SDK only (TTS-specific); no official Node.js/other SDK for TTS yet, Input text and generated audio retained during preview to improve the service; production will offer non-retentive option, 4 voices only: Sarah (UK Female), Theo (UK Male), Megan (US Female), Jack (US Male), Audio output at 16 kHz, 16-bit, mono only, Product still in preview (billing announced for October 2025) [22]
Developer surface
Integration
- API style
- rest
- Base URL
- https://preview.tts.speechmatics.com
- Versioning
- none
- Stability
- beta
- Auth methods
- api_key, jwt
- Idempotency keys
- ✗ No
- Error format
- vendor-specific
Adoption & maturity
- Launched
- 2006-01-01
- Notable customers
- VAPI, LiveKit, AI Media, Content Guru, Echo360, Pipecat, Ubisoft (Blue Mammoth Games)
Other Text-to-Speech APIs
ElevenLabs Text to Speech
"Text to Speech with high quality, human-like AI voices"
Azure AI Text to Speech
"Text to speech enables your applications, tools, or devices to convert text into human like synthesized speech. The text to speech capability is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand."
Amazon Polly
"Amazon Polly is a cloud service that converts text into lifelike speech. You can use Amazon Polly to develop applications that increase engagement and accessibility."
Google Cloud Text-to-Speech
"Cloud Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech."
Cartesia (Sonic)
"The fastest and most natural text to speech model"
Murf AI
"Enterprise-grade AI voice generation with 150+ natural-sounding voices across 35 languages and 20+ speaking styles."
References
- ↑Description: speechmatics.com
- ↑Pricing model: speechmatics.com · speechmatics.com
- ↑Published pricing: speechmatics.com · speechmatics.com
- ↑Free tier: speechmatics.com
- ↑Free tier details: speechmatics.com
- ↑Self-serve signup: portal.speechmatics.com
- ↑Enterprise plan: speechmatics.com
- ↑Supported actions: docs.speechmatics.com
- ↑Languages: docs.speechmatics.com · speechmatics.com
- ↑Output types: docs.speechmatics.com
- ↑Webhooks: docs.speechmatics.com
- ↑Sandbox: docs.speechmatics.com
- ↑SDK languages: docs.speechmatics.com · docs.speechmatics.com
- ↑MCP server: speechmatics.com
- ↑SOC 2: speechmatics.com
- ↑HIPAA: speechmatics.com
- ↑GDPR: speechmatics.com
- ↑ISO 27001: speechmatics.com
- ↑PCI DSS: speechmatics.com
- ↑Published SLA: speechmatics.com
- ↑Rate limits: docs.speechmatics.com · docs.speechmatics.com
- ↑Known restrictions: docs.speechmatics.com · docs.speechmatics.com
Change history
- 2026-06-21 Capabilities: {} → {"streaming":true}
- 2026-06-21 Summary Md: (none) → Speechmatics Text to Speech is a REST API for low-latency speech synthesis aime…
- 2026-06-21 Score Setup Speed: (none) → 85
- 2026-06-21 Score Pricing Transparency: (none) → 100
- 2026-06-21 Score Docs Quality: (none) → 15
- 2026-06-21 Score Procurement Friction: (none) → 100
- 2026-06-21 Score Trust Readiness: (none) → 70
- 2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
- 2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-21 Score Agent Friendliness: (none) → 30
- 2026-06-21 Robots Allows Agents: (none) → Yes
- 2026-06-21 Status Page URL: (none) → https://status.speechmatics.com
- 2026-06-21 Docs URL: (none) → https://docs.speechmatics.com/
- 2026-06-21 Rendering: (none) → static
- 2026-06-21 Has Structured Data: (none) → Yes
- 2026-06-21 Llms Txt Present: (none) → No
- 2026-06-21 Pricing Model: set to hybrid
- 2026-06-21 Has Published Pricing: set to Yes
- 2026-06-21 Free Tier Available: set to Yes
- 2026-06-21 Requires Verification: set to No
- 2026-06-21 Free Tier Details: set to 1 million characters (~20 hours) per month included free on Free and Pro plans,…
- 2026-06-21 Self Serve Signup: set to Yes
- 2026-06-21 Requires Sales Call: set to No
- 2026-06-21 Enterprise Plan Available: set to Yes
- 2026-06-21 SOC 2: set to type_2
- 2026-06-21 HIPAA: set to Yes
- 2026-06-21 GDPR: set to Yes
- 2026-06-21 ISO 27001: set to Yes
- 2026-06-21 PCI DSS: set to No
- 2026-06-21 SLA Published: set to No
- 2026-06-21 Data Retention Policy URL: set to https://www.speechmatics.com/legal/terms-of-service
- 2026-06-21 Documented Rate Limits: set to No specific numeric rate/concurrency limits published for TTS. Documentation no…
- 2026-06-21 Known Restrictions: set to English language only (US and UK voices); additional languages in development, …
- 2026-06-21 Auth Methods: set to api_key, jwt
- 2026-06-21 Auth Docs URL: set to https://docs.speechmatics.com/get-started/authentication
- 2026-06-21 API Style: set to rest
- 2026-06-21 Base URL: set to https://preview.tts.speechmatics.com
- 2026-06-21 Versioning Scheme: set to none
- 2026-06-21 Stability: set to beta
- 2026-06-21 Quickstart URL: set to https://docs.speechmatics.com/text-to-speech/quickstart
- 2026-06-21 Idempotency Supported: set to No
- 2026-06-21 Error Format: set to vendor-specific
- 2026-06-21 Slug: set to speechmatics-tts
- 2026-06-21 Price Basis: set to 1,000 characters
- 2026-06-21 Free Tier Limit: set to 1,000,000 characters/month
- 2026-06-21 Launched At: set to 2006-01-01
- 2026-06-21 Notable Customers: set to VAPI, LiveKit, AI Media, Content Guru, Echo360, Pipecat, Ubisoft (Blue Mammoth …
- 2026-06-21 Fields Not Found: set to pci_dss (not mentioned on security page - set false), sla_published (security p…
- 2026-06-21 Source Confidence: set to high
- 2026-06-21 Extractor: set to claude-subagent:sonnet
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/speechmatics-tts \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/speechmatics-tts/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'