OpenAI Text to Speech (gpt-4o-mini-tts / tts-1)
"Transform text into lifelike spoken audio" - OpenAI's TTS service enabling blog narration, multilingual audio production, and realtime voice output via gpt-4o-mini-tts, tts-1, and tts-1-hd models. [1]
OpenAI Text to Speech converts text into lifelike spoken audio via three models, gpt-4o-mini-tts, tts-1, and tts-1-hd, targeting use cases such as voice agents, audiobooks, video narration, accessibility tools, and IVR. Pricing is usage-based at $15.00 per million characters with no sales call required to get started. The REST API ships with official SDKs for Python, Node.js, Java, Go, Ruby, and .NET, and the service is backed by SOC 2 Type II, ISO 27001, HIPAA, GDPR, and PCI DSS compliance alongside a published SLA.
Best for / Avoid if
Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; Cost-sensitive teams - low, transparent entry price
Avoid if: You want to try it free before paying
Pricing & procurement
- Pricing model
- Usage-based [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✗ No [4]
- Self-serve signup
- ✓ Yes
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Pay As You Go | Speech synthesis (tts-1) | 1M characters | $15 | source |
| Pay As You Go | Speech synthesis (tts-1-hd) | 1M characters | $30 | source |
| Pay As You Go | Speech synthesis text input (gpt-4o-mini-tts) | 1M text tokens | $0.6 | source |
| Pay As You Go | Speech synthesis audio output (gpt-4o-mini-tts) | 1M audio tokens | $12 | source |
Capabilities
- Supported actions
- synthesize_speech, streaming_tts, voice_design, speed_control, instruction_based_voice_control, realtime_tts, custom_voice_creation [5]
- Languages
- Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh [6]
- Input types
- plain text
- Output types
- mp3, opus, aac, flac, wav, pcm [7]
- Webhooks
- ✗ No [8]
- Sandbox / test mode
- ✗ No
- SDK languages
- Python, Node.js, Java, Go, Ruby, .NET [9]
- MCP server
- ✗ No [10]
Trust & compliance
- SOC 2
- SOC 2 Type II [11]
- HIPAA
- ✓ Yes [12]
- GDPR
- ✓ Yes [13]
- ISO 27001
- ✓ Yes [14]
- PCI DSS
- ✓ Yes [15]
- Published SLA
- ✓ Yes [16]
- Rate limits
- tts-1/tts-1-hd: Free 3 RPM / 200 RPD; Tier 1: 500 RPM; Tier 2: 2,500 RPM; Tier 3: 5,000 RPM; Tier 4: 7,500 RPM; Tier 5: 10,000 RPM. gpt-4o-mini-tts: Tier 1 500 RPM / 50,000 TPM; Tier 5 10,000 RPM / 8,000,000 TPM. [17]
- Known restrictions
- Maximum input length: 4096 characters per request (tts-1/tts-1-hd); 2,000 tokens for gpt-4o-mini-tts, Custom voices limited to up to 20 per organization; requires eligibility and sales contact, Custom voice audio samples must be 30 seconds or less, Instructions parameter (voice control) not available on tts-1 or tts-1-hd; only gpt-4o-mini-tts, SSE stream_format not available for tts-1/tts-1-hd, Usage policies require clear disclosure to end users that TTS voice is AI-generated, Custom voice consent phrases required in specific languages, SSML not supported, Speed parameter range: 0.25 to 4.0 [18]
Developer surface
Integration
Adoption & maturity
- Launched
- 2023-11-06
- GA
- 2023-11-06
Other Text-to-Speech APIs
ElevenLabs Text to Speech
"Text to Speech with high quality, human-like AI voices"
Azure AI Text to Speech
"Text to speech enables your applications, tools, or devices to convert text into human like synthesized speech. The text to speech capability is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand."
Amazon Polly
"Amazon Polly is a cloud service that converts text into lifelike speech. You can use Amazon Polly to develop applications that increase engagement and accessibility."
Google Cloud Text-to-Speech
"Cloud Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech."
Cartesia (Sonic)
"The fastest and most natural text to speech model"
Murf AI
"Enterprise-grade AI voice generation with 150+ natural-sounding voices across 35 languages and 20+ speaking styles."
References
- ↑Description: developers.openai.com
- ↑Pricing model: developers.openai.com · developers.openai.com
- ↑Published pricing: developers.openai.com · developers.openai.com
- ↑Free tier: developers.openai.com
- ↑Supported actions: developers.openai.com · developers.openai.com
- ↑Languages: developers.openai.com
- ↑Output types: developers.openai.com
- ↑Webhooks: developers.openai.com
- ↑SDK languages: developers.openai.com
- ↑MCP server: developers.openai.com
- ↑SOC 2: trust.openai.com
- ↑HIPAA: openai.com · help.openai.com
- ↑GDPR: trust.openai.com
- ↑ISO 27001: trust.openai.com
- ↑PCI DSS: trust.openai.com
- ↑Published SLA: openai.com
- ↑Rate limits: developers.openai.com · developers.openai.com
- ↑Known restrictions: developers.openai.com · developers.openai.com
Change history
- 2026-06-21 Capabilities: {} → {"streaming":true,"voice_design":true}
- 2026-06-21 Summary Md: (none) → OpenAI Text to Speech converts text into lifelike spoken audio via three models…
- 2026-06-21 Score Agent Friendliness: (none) → 30
- 2026-06-21 Score Pricing Transparency: (none) → 85
- 2026-06-21 Score Setup Speed: (none) → 60
- 2026-06-21 Score Docs Quality: (none) → 50
- 2026-06-21 Score Procurement Friction: (none) → 85
- 2026-06-21 Score Trust Readiness: (none) → 100
- 2026-06-21 Best For: (none) → Regulated or enterprise workloads - compliance attestations and an enterprise p…
- 2026-06-21 Avoid If: (none) → You want to try it free before paying
- 2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-21 Robots Allows Agents: (none) → Yes
- 2026-06-21 API Reference URL: (none) → https://platform.openai.com/api/reference/overview
- 2026-06-21 Markdown Docs Served: (none) → Yes
- 2026-06-21 Markdown Docs URL: (none) → https://platform.openai.com/docs/guides/text-to-speech.md
- 2026-06-21 Docs URL: (none) → https://developers.openai.com/api/docs
- 2026-06-21 Has Structured Data: (none) → No
- 2026-06-21 Llms Txt Present: (none) → No
- 2026-06-21 Rendering: (none) → static
- 2026-06-21 Self Serve Signup: set to Yes
- 2026-06-21 Requires Sales Call: set to No
- 2026-06-21 Enterprise Plan Available: set to Yes
- 2026-06-21 SOC 2: set to type_2
- 2026-06-21 HIPAA: set to Yes
- 2026-06-21 GDPR: set to Yes
- 2026-06-21 ISO 27001: set to Yes
- 2026-06-21 PCI DSS: set to Yes
- 2026-06-21 SLA Published: set to Yes
- 2026-06-21 SLA URL: set to https://openai.com/api-scale-tier/
- 2026-06-21 Data Retention Policy URL: set to https://developers.openai.com/api/docs/guides/your-data
- 2026-06-21 Documented Rate Limits: set to tts-1/tts-1-hd: Free 3 RPM / 200 RPD; Tier 1: 500 RPM; Tier 2: 2,500 RPM; Tier …
- 2026-06-21 Rate Limit Requests: set to 500
- 2026-06-21 Rate Limit Window: set to minute
- 2026-06-21 Known Restrictions: set to Maximum input length: 4096 characters per request (tts-1/tts-1-hd); 2,000 token…
- 2026-06-21 Auth Methods: set to api_key
- 2026-06-21 Auth Docs URL: set to https://developers.openai.com/api/reference/overview
- 2026-06-21 API Style: set to rest
- 2026-06-21 Base URL: set to https://api.openai.com/v1
- 2026-06-21 API Version: set to v1
- 2026-06-21 Versioning Scheme: set to url
- 2026-06-21 Stability: set to ga
- 2026-06-21 Deprecation Policy URL: set to https://developers.openai.com/api/docs/deprecations
- 2026-06-21 MCP URL: set to https://developers.openai.com/mcp
- 2026-06-21 Quickstart URL: set to https://developers.openai.com/api/docs/guides/text-to-speech
- 2026-06-21 Idempotency Supported: set to No
- 2026-06-21 Error Format: set to vendor-specific
- 2026-06-21 Requires Verification: set to No
- 2026-06-21 Slug: set to openai-tts
- 2026-06-21 Price Basis: set to 1M characters
- 2026-06-21 Launched At: set to 2023-11-06
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/openai-tts \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/openai-tts/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'