OpenAI Text to Speech (gpt-4o-mini-tts / tts-1)

"Transform text into lifelike spoken audio" - OpenAI's TTS service enabling blog narration, multilingual audio production, and realtime voice output via gpt-4o-mini-tts, tts-1, and tts-1-hd models. [1]

Text-to-Speech APIs

platform.openai.com/docs/guides/text-to-speech · By OpenAI · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

OpenAI Text to Speech converts text into lifelike spoken audio via three models, gpt-4o-mini-tts, tts-1, and tts-1-hd, targeting use cases such as voice agents, audiobooks, video narration, accessibility tools, and IVR. Pricing is usage-based at $15.00 per million characters with no sales call required to get started. The REST API ships with official SDKs for Python, Node.js, Java, Go, Ruby, and .NET, and the service is backed by SOC 2 Type II, ISO 27001, HIPAA, GDPR, and PCI DSS compliance alongside a published SLA.

Best for / Avoid if

Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; Cost-sensitive teams - low, transparent entry price

Avoid if: You want to try it free before paying

Pricing & procurement

Pricing model: Usage-based [2]
Published pricing: Yes [3]
Free tier: No [4]
Self-serve signup: Yes
Requires sales call: No
Enterprise plan: Yes

Published prices
Plan	Item	Per	Amount	Source
Pay As You Go	Speech synthesis (tts-1)	1M characters	$15	source
Pay As You Go	Speech synthesis (tts-1-hd)	1M characters	$30	source
Pay As You Go	Speech synthesis text input (gpt-4o-mini-tts)	1M text tokens	$0.6	source
Pay As You Go	Speech synthesis audio output (gpt-4o-mini-tts)	1M audio tokens	$12	source

Capabilities

Real-time streaming
Voice design

Supported actions: synthesize_speech, streaming_tts, voice_design, speed_control, instruction_based_voice_control, realtime_tts, custom_voice_creation [5]
Languages: Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh [6]developers.openai.com/api/docs/guides/text-to-speech“The API generally follows the Whisper model in terms of language support. Includes: Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh.”
Input types: plain text
Output types: mp3, opus, aac, flac, wav, pcm [7]
Webhooks: No [8]
Sandbox / test mode: No
SDK languages: Python, Node.js, Java, Go, Ruby, .NET [9]
MCP server: No [10]

Trust & compliance

SOC 2: SOC 2 Type II [11]
HIPAA: Yes [12]
GDPR: Yes [13]
ISO 27001: Yes [14]
PCI DSS: Yes [15]
Published SLA: Yes [16]
Rate limits: tts-1/tts-1-hd: Free 3 RPM / 200 RPD; Tier 1: 500 RPM; Tier 2: 2,500 RPM; Tier 3: 5,000 RPM; Tier 4: 7,500 RPM; Tier 5: 10,000 RPM. gpt-4o-mini-tts: Tier 1 500 RPM / 50,000 TPM; Tier 5 10,000 RPM / 8,000,000 TPM. [17]
Known restrictions: Maximum input length: 4096 characters per request (tts-1/tts-1-hd); 2,000 tokens for gpt-4o-mini-tts, Custom voices limited to up to 20 per organization; requires eligibility and sales contact, Custom voice audio samples must be 30 seconds or less, Instructions parameter (voice control) not available on tts-1 or tts-1-hd; only gpt-4o-mini-tts, SSE stream_format not available for tts-1/tts-1-hd, Usage policies require clear disclosure to end users that TTS voice is AI-generated, Custom voice consent phrases required in specific languages, SSML not supported, Speed parameter range: 0.25 to 4.0 [18]

Developer surface

Docs rendering: static · markdown variants served

Integration

API style: rest
Base URL: https://api.openai.com/v1
Version: v1
Versioning: url
Stability: ga
Auth methods: api_key
Idempotency keys: No
Error format: vendor-specific
Rate limit: 500 / minute

SDKs

Python openai · repo
Node.js openai · repo
Java openai-java · repo
Go openai-go · repo
Ruby openai · repo
.NET OpenAI · repo

Adoption & maturity

Launched: 2023-11-06
GA: 2023-11-06

Other Text-to-Speech APIs

ElevenLabs Text to Speech
"Text to Speech with high quality, human-like AI voices"
Hybrid · free tier · public pricing · self-serve
Azure AI Text to Speech
"Text to speech enables your applications, tools, or devices to convert text into human like synthesized speech. The text to speech capability is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand."
Usage · free tier · public pricing · self-serve
Amazon Polly
"Amazon Polly is a cloud service that converts text into lifelike speech. You can use Amazon Polly to develop applications that increase engagement and accessibility."
Usage · free tier · public pricing · self-serve
Google Cloud Text-to-Speech
"Cloud Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech."
Usage · free tier · public pricing · self-serve
Cartesia (Sonic)
"The fastest and most natural text to speech model"
Hybrid · free tier · public pricing · self-serve
Murf AI
"Enterprise-grade AI voice generation with 150+ natural-sounding voices across 35 languages and 20+ speaking styles."
Usage · public pricing · self-serve

OpenAI Text to Speech (gpt-4o-mini-tts / tts-1) alternatives · OpenAI Text to Speech (gpt-4o-mini-tts / tts-1) vs ElevenLabs Text to Speech · All Text-to-Speech APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

↑Description: developers.openai.com
↑Pricing model: developers.openai.com · developers.openai.com
↑Published pricing: developers.openai.com · developers.openai.com
↑Free tier: developers.openai.com
↑Supported actions: developers.openai.com · developers.openai.com
↑Languages: developers.openai.com
↑Output types: developers.openai.com
↑Webhooks: developers.openai.com
↑SDK languages: developers.openai.com
↑MCP server: developers.openai.com
↑SOC 2: trust.openai.com
↑HIPAA: openai.com · help.openai.com
↑GDPR: trust.openai.com
↑ISO 27001: trust.openai.com
↑PCI DSS: trust.openai.com
↑Published SLA: openai.com
↑Rate limits: developers.openai.com · developers.openai.com
↑Known restrictions: developers.openai.com · developers.openai.com

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

2026-06-21 Capabilities: {} → {"streaming":true,"voice_design":true}
2026-06-21 Summary Md: (none) → OpenAI Text to Speech converts text into lifelike spoken audio via three models…
2026-06-21 Score Agent Friendliness: (none) → 30
2026-06-21 Score Pricing Transparency: (none) → 85
2026-06-21 Score Setup Speed: (none) → 60
2026-06-21 Score Docs Quality: (none) → 50
2026-06-21 Score Procurement Friction: (none) → 85
2026-06-21 Score Trust Readiness: (none) → 100
2026-06-21 Best For: (none) → Regulated or enterprise workloads - compliance attestations and an enterprise p…
2026-06-21 Avoid If: (none) → You want to try it free before paying
2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
2026-06-21 Robots Allows Agents: (none) → Yes
2026-06-21 API Reference URL: (none) → https://platform.openai.com/api/reference/overview
2026-06-21 Markdown Docs Served: (none) → Yes
2026-06-21 Markdown Docs URL: (none) → https://platform.openai.com/docs/guides/text-to-speech.md
2026-06-21 Docs URL: (none) → https://developers.openai.com/api/docs
2026-06-21 Has Structured Data: (none) → No
2026-06-21 Llms Txt Present: (none) → No
2026-06-21 Rendering: (none) → static
2026-06-21 Self Serve Signup: set to Yes
2026-06-21 Requires Sales Call: set to No
2026-06-21 Enterprise Plan Available: set to Yes
2026-06-21 SOC 2: set to type_2
2026-06-21 HIPAA: set to Yes
2026-06-21 GDPR: set to Yes
2026-06-21 ISO 27001: set to Yes
2026-06-21 PCI DSS: set to Yes
2026-06-21 SLA Published: set to Yes
2026-06-21 SLA URL: set to https://openai.com/api-scale-tier/
2026-06-21 Data Retention Policy URL: set to https://developers.openai.com/api/docs/guides/your-data
2026-06-21 Documented Rate Limits: set to tts-1/tts-1-hd: Free 3 RPM / 200 RPD; Tier 1: 500 RPM; Tier 2: 2,500 RPM; Tier …
2026-06-21 Rate Limit Requests: set to 500
2026-06-21 Rate Limit Window: set to minute
2026-06-21 Known Restrictions: set to Maximum input length: 4096 characters per request (tts-1/tts-1-hd); 2,000 token…
2026-06-21 Auth Methods: set to api_key
2026-06-21 Auth Docs URL: set to https://developers.openai.com/api/reference/overview
2026-06-21 API Style: set to rest
2026-06-21 Base URL: set to https://api.openai.com/v1
2026-06-21 API Version: set to v1
2026-06-21 Versioning Scheme: set to url
2026-06-21 Stability: set to ga
2026-06-21 Deprecation Policy URL: set to https://developers.openai.com/api/docs/deprecations
2026-06-21 MCP URL: set to https://developers.openai.com/mcp
2026-06-21 Quickstart URL: set to https://developers.openai.com/api/docs/guides/text-to-speech
2026-06-21 Idempotency Supported: set to No
2026-06-21 Error Format: set to vendor-specific
2026-06-21 Requires Verification: set to No
2026-06-21 Slug: set to openai-tts
2026-06-21 Price Basis: set to 1M characters
2026-06-21 Launched At: set to 2023-11-06

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/openai-tts \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/openai-tts/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →

Best for / Avoid if

Pricing & procurement

Capabilities

Trust & compliance

Developer surface

Integration

Adoption & maturity

Other Text-to-Speech APIs

ElevenLabs Text to Speech

Azure AI Text to Speech

Amazon Polly

Google Cloud Text-to-Speech

Cartesia (Sonic)

Murf AI

References

Change history

Suggest an edit / leave a review