Speechmatics Text to Speech

"Natural, human-like synthetic voices" with low latency designed for real-time conversational applications and voice AI agents. [1]

Text-to-Speech APIs

www.speechmatics.com/text-to-speech · By Speechmatics · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

Speechmatics Text to Speech is a REST API for low-latency speech synthesis aimed at voice AI agents and real-time conversational applications, offering streaming output with initial audio latency under 200ms. Pricing starts at $0.011 per 1,000 characters with a free tier of 1 million characters per month, and enterprise plans are available. The service is certified SOC 2 Type 2, HIPAA, GDPR, and ISO 27001, and supports both SaaS and on-premises deployment. Current coverage is English only (US and UK voices), with SDKs for Python, Node.js, .NET, and Rust.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; Cost-sensitive teams - low, transparent entry price

Pricing & procurement

Pricing model: Hybrid (base + usage) [2]
Published pricing: Yes [3]
Free tier: Yes [4]
Free tier details: 1 million characters (~20 hours) per month included free on Free and Pro plans, English language only. [5]
Self-serve signup: Yes [6]
Requires sales call: No
Enterprise plan: Yes [7]

Published prices
Plan	Item	Per	Amount	Source
Free	Text-to-speech synthesis	1,000,000 characters per month (included)	$0	source
Pro	Text-to-speech synthesis (included allowance)	1,000,000 characters per month (included)	$0	source
	Text-to-speech synthesis (overage / pay-as-you-go)	1,000 characters	$0.011	source

Capabilities

Real-time streaming

Supported actions: synthesize_speech, streaming_tts, low_latency_synthesis [8]
Regions: global (SaaS), on-premises deployment
Languages: English (US), English (UK) [9]
Input types: plain text
Output types: wav, pcm [10]
Webhooks: No [11]
Sandbox / test mode: No [12]
SDK languages: Python, Node.js, .NET, Rust [13]docs.speechmatics.com/integrations-and-sdks/sdks“TTS (Python) - Convert text to speech — available in the speechmatics-python-sdk repository, sdk/tts directory.”docs.speechmatics.com/text-to-speech/quickstart“Official SDKs for different programming languages are not available yet. The API can be accessed using standard HTTP requests as shown in the quickstart example. We will release official SDKs later.”
MCP server: No [14]

Trust & compliance

SOC 2: SOC 2 Type II [15]
HIPAA: Yes [16]
GDPR: Yes [17]
ISO 27001: Yes [18]
PCI DSS: No [19]
Published SLA: No [20]
Rate limits: No specific numeric rate/concurrency limits published for TTS. Documentation notes: "If you encounter rate limit errors, use retry with exponential backoff." Initial audio latency: less than 200ms, with subsequent chunks faster than real time. [21]
Known restrictions: English language only (US and UK voices); additional languages in development, No SSML support documented, No bidirectional streaming - output streaming only, No voice speed/pitch/emphasis controls, Python SDK only (TTS-specific); no official Node.js/other SDK for TTS yet, Input text and generated audio retained during preview to improve the service; production will offer non-retentive option, 4 voices only: Sarah (UK Female), Theo (UK Male), Megan (US Female), Jack (US Male), Audio output at 16 kHz, 16-bit, mono only, Product still in preview (billing announced for October 2025) [22]

Developer surface

Docs rendering: static

Integration

API style: rest
Base URL: https://preview.tts.speechmatics.com
Versioning: none
Stability: beta
Auth methods: api_key, jwt
Idempotency keys: No
Error format: vendor-specific

SDKs

Python speechmatics-tts · repo
Python speechmatics-python-sdk · repo
Node.js @speechmatics/real-time-client · repo
Node.js @speechmatics/batch-client · repo
.NET speechmatics-dotnet · repo
Rust speechmatics-rs · repo

Adoption & maturity

Launched: 2006-01-01
Notable customers: VAPI, LiveKit, AI Media, Content Guru, Echo360, Pipecat, Ubisoft (Blue Mammoth Games)

Other Text-to-Speech APIs

ElevenLabs Text to Speech
"Text to Speech with high quality, human-like AI voices"
Hybrid · free tier · public pricing · self-serve
Azure AI Text to Speech
"Text to speech enables your applications, tools, or devices to convert text into human like synthesized speech. The text to speech capability is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand."
Usage · free tier · public pricing · self-serve
Amazon Polly
"Amazon Polly is a cloud service that converts text into lifelike speech. You can use Amazon Polly to develop applications that increase engagement and accessibility."
Usage · free tier · public pricing · self-serve
Google Cloud Text-to-Speech
"Cloud Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech."
Usage · free tier · public pricing · self-serve
Cartesia (Sonic)
"The fastest and most natural text to speech model"
Hybrid · free tier · public pricing · self-serve
Murf AI
"Enterprise-grade AI voice generation with 150+ natural-sounding voices across 35 languages and 20+ speaking styles."
Usage · public pricing · self-serve

Speechmatics Text to Speech alternatives · Speechmatics Text to Speech vs ElevenLabs Text to Speech · All Text-to-Speech APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

↑Description: speechmatics.com
↑Pricing model: speechmatics.com · speechmatics.com
↑Published pricing: speechmatics.com · speechmatics.com
↑Free tier: speechmatics.com
↑Free tier details: speechmatics.com
↑Self-serve signup: portal.speechmatics.com
↑Enterprise plan: speechmatics.com
↑Supported actions: docs.speechmatics.com
↑Languages: docs.speechmatics.com · speechmatics.com
↑Output types: docs.speechmatics.com
↑Webhooks: docs.speechmatics.com
↑Sandbox: docs.speechmatics.com
↑SDK languages: docs.speechmatics.com · docs.speechmatics.com
↑MCP server: speechmatics.com
↑SOC 2: speechmatics.com
↑HIPAA: speechmatics.com
↑GDPR: speechmatics.com
↑ISO 27001: speechmatics.com
↑PCI DSS: speechmatics.com
↑Published SLA: speechmatics.com
↑Rate limits: docs.speechmatics.com · docs.speechmatics.com
↑Known restrictions: docs.speechmatics.com · docs.speechmatics.com

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

2026-06-21 Capabilities: {} → {"streaming":true}
2026-06-21 Summary Md: (none) → Speechmatics Text to Speech is a REST API for low-latency speech synthesis aime…
2026-06-21 Score Setup Speed: (none) → 85
2026-06-21 Score Pricing Transparency: (none) → 100
2026-06-21 Score Docs Quality: (none) → 15
2026-06-21 Score Procurement Friction: (none) → 100
2026-06-21 Score Trust Readiness: (none) → 70
2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
2026-06-21 Score Agent Friendliness: (none) → 30
2026-06-21 Robots Allows Agents: (none) → Yes
2026-06-21 Status Page URL: (none) → https://status.speechmatics.com
2026-06-21 Docs URL: (none) → https://docs.speechmatics.com/
2026-06-21 Rendering: (none) → static
2026-06-21 Has Structured Data: (none) → Yes
2026-06-21 Llms Txt Present: (none) → No
2026-06-21 Pricing Model: set to hybrid
2026-06-21 Has Published Pricing: set to Yes
2026-06-21 Free Tier Available: set to Yes
2026-06-21 Requires Verification: set to No
2026-06-21 Free Tier Details: set to 1 million characters (~20 hours) per month included free on Free and Pro plans,…
2026-06-21 Self Serve Signup: set to Yes
2026-06-21 Requires Sales Call: set to No
2026-06-21 Enterprise Plan Available: set to Yes
2026-06-21 SOC 2: set to type_2
2026-06-21 HIPAA: set to Yes
2026-06-21 GDPR: set to Yes
2026-06-21 ISO 27001: set to Yes
2026-06-21 PCI DSS: set to No
2026-06-21 SLA Published: set to No
2026-06-21 Data Retention Policy URL: set to https://www.speechmatics.com/legal/terms-of-service
2026-06-21 Documented Rate Limits: set to No specific numeric rate/concurrency limits published for TTS. Documentation no…
2026-06-21 Known Restrictions: set to English language only (US and UK voices); additional languages in development, …
2026-06-21 Auth Methods: set to api_key, jwt
2026-06-21 Auth Docs URL: set to https://docs.speechmatics.com/get-started/authentication
2026-06-21 API Style: set to rest
2026-06-21 Base URL: set to https://preview.tts.speechmatics.com
2026-06-21 Versioning Scheme: set to none
2026-06-21 Stability: set to beta
2026-06-21 Quickstart URL: set to https://docs.speechmatics.com/text-to-speech/quickstart
2026-06-21 Idempotency Supported: set to No
2026-06-21 Error Format: set to vendor-specific
2026-06-21 Slug: set to speechmatics-tts
2026-06-21 Price Basis: set to 1,000 characters
2026-06-21 Free Tier Limit: set to 1,000,000 characters/month
2026-06-21 Launched At: set to 2006-01-01
2026-06-21 Notable Customers: set to VAPI, LiveKit, AI Media, Content Guru, Echo360, Pipecat, Ubisoft (Blue Mammoth …
2026-06-21 Fields Not Found: set to pci_dss (not mentioned on security page - set false), sla_published (security p…
2026-06-21 Source Confidence: set to high
2026-06-21 Extractor: set to claude-subagent:sonnet

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/speechmatics-tts \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/speechmatics-tts/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →

Best for / Avoid if

Pricing & procurement

Capabilities

Trust & compliance

Developer surface

Integration

Adoption & maturity

Other Text-to-Speech APIs

ElevenLabs Text to Speech

Azure AI Text to Speech

Amazon Polly

Google Cloud Text-to-Speech

Cartesia (Sonic)

Murf AI

References

Change history

Suggest an edit / leave a review