OpenAI Text to Speech (gpt-4o-mini-tts / tts-1)

"Transform text into lifelike spoken audio" - OpenAI's TTS service enabling blog narration, multilingual audio production, and realtime voice output via gpt-4o-mini-tts, tts-1, and tts-1-hd models. [1]

platform.openai.com/docs/guides/text-to-speech · By OpenAI · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

OpenAI Text to Speech converts text into lifelike spoken audio via three models, gpt-4o-mini-tts, tts-1, and tts-1-hd, targeting use cases such as voice agents, audiobooks, video narration, accessibility tools, and IVR. Pricing is usage-based at $15.00 per million characters with no sales call required to get started. The REST API ships with official SDKs for Python, Node.js, Java, Go, Ruby, and .NET, and the service is backed by SOC 2 Type II, ISO 27001, HIPAA, GDPR, and PCI DSS compliance alongside a published SLA.

Best for / Avoid if

Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; Cost-sensitive teams - low, transparent entry price

Avoid if: You want to try it free before paying

Pricing & procurement

Pricing model
Usage-based [2]
Published pricing
Yes [3]
Free tier
No [4]
Self-serve signup
Yes
Requires sales call
No
Enterprise plan
Yes
Published prices
PlanItemPerAmountSource
Pay As You GoSpeech synthesis (tts-1)1M characters$15source
Pay As You GoSpeech synthesis (tts-1-hd)1M characters$30source
Pay As You GoSpeech synthesis text input (gpt-4o-mini-tts)1M text tokens$0.6source
Pay As You GoSpeech synthesis audio output (gpt-4o-mini-tts)1M audio tokens$12source

Capabilities

  • Real-time streaming
  • Voice design
Supported actions
synthesize_speech, streaming_tts, voice_design, speed_control, instruction_based_voice_control, realtime_tts, custom_voice_creation [5]
Languages
Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh [6]
Input types
plain text
Output types
mp3, opus, aac, flac, wav, pcm [7]
Webhooks
No [8]
Sandbox / test mode
No
SDK languages
Python, Node.js, Java, Go, Ruby, .NET [9]
MCP server
No [10]

Trust & compliance

SOC 2
SOC 2 Type II [11]
HIPAA
Yes [12]
GDPR
Yes [13]
ISO 27001
Yes [14]
PCI DSS
Yes [15]
Published SLA
Yes [16]
Rate limits
tts-1/tts-1-hd: Free 3 RPM / 200 RPD; Tier 1: 500 RPM; Tier 2: 2,500 RPM; Tier 3: 5,000 RPM; Tier 4: 7,500 RPM; Tier 5: 10,000 RPM. gpt-4o-mini-tts: Tier 1 500 RPM / 50,000 TPM; Tier 5 10,000 RPM / 8,000,000 TPM. [17]
Known restrictions
Maximum input length: 4096 characters per request (tts-1/tts-1-hd); 2,000 tokens for gpt-4o-mini-tts, Custom voices limited to up to 20 per organization; requires eligibility and sales contact, Custom voice audio samples must be 30 seconds or less, Instructions parameter (voice control) not available on tts-1 or tts-1-hd; only gpt-4o-mini-tts, SSE stream_format not available for tts-1/tts-1-hd, Usage policies require clear disclosure to end users that TTS voice is AI-generated, Custom voice consent phrases required in specific languages, SSML not supported, Speed parameter range: 0.25 to 4.0 [18]

Developer surface

Docs rendering: static · markdown variants served

Integration

API style
rest
Base URL
https://api.openai.com/v1
Version
v1
Versioning
url
Stability
ga
Auth methods
api_key
Idempotency keys
No
Error format
vendor-specific
Rate limit
500 / minute

SDKs

  • Python openai · repo
  • Node.js openai · repo
  • Java openai-java · repo
  • Go openai-go · repo
  • Ruby openai · repo
  • .NET OpenAI · repo

Adoption & maturity

Launched
2023-11-06
GA
2023-11-06

Other Text-to-Speech APIs

  • ElevenLabs Text to Speech

    "Text to Speech with high quality, human-like AI voices"

    Hybrid · free tier · public pricing · self-serve

  • Azure AI Text to Speech

    "Text to speech enables your applications, tools, or devices to convert text into human like synthesized speech. The text to speech capability is also known as speech synthesis. Use human like standard voices out of the box, or create a custom voice that's unique to your product or brand."

    Usage · free tier · public pricing · self-serve

  • Amazon Polly

    "Amazon Polly is a cloud service that converts text into lifelike speech. You can use Amazon Polly to develop applications that increase engagement and accessibility."

    Usage · free tier · public pricing · self-serve

  • Google Cloud Text-to-Speech

    "Cloud Text-to-Speech converts text or Speech Synthesis Markup Language (SSML) input into audio data of natural human speech."

    Usage · free tier · public pricing · self-serve

  • Cartesia (Sonic)

    "The fastest and most natural text to speech model"

    Hybrid · free tier · public pricing · self-serve

  • Murf AI

    "Enterprise-grade AI voice generation with 150+ natural-sounding voices across 35 languages and 20+ speaking styles."

    Usage · public pricing · self-serve

OpenAI Text to Speech (gpt-4o-mini-tts / tts-1) alternatives · OpenAI Text to Speech (gpt-4o-mini-tts / tts-1) vs ElevenLabs Text to Speech · All Text-to-Speech APIs APIs

References

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

  1. 2026-06-21 Capabilities: {}{"streaming":true,"voice_design":true}
  2. 2026-06-21 Summary Md: (none)OpenAI Text to Speech converts text into lifelike spoken audio via three models…
  3. 2026-06-21 Score Agent Friendliness: (none)30
  4. 2026-06-21 Score Pricing Transparency: (none)85
  5. 2026-06-21 Score Setup Speed: (none)60
  6. 2026-06-21 Score Docs Quality: (none)50
  7. 2026-06-21 Score Procurement Friction: (none)85
  8. 2026-06-21 Score Trust Readiness: (none)100
  9. 2026-06-21 Best For: (none)Regulated or enterprise workloads - compliance attestations and an enterprise p…
  10. 2026-06-21 Avoid If: (none)You want to try it free before paying
  11. 2026-06-21 Scoring Methodology: (none)Scores are computed deterministically from this profile's published, sourced fi…
  12. 2026-06-21 Robots Allows Agents: (none)Yes
  13. 2026-06-21 API Reference URL: (none)https://platform.openai.com/api/reference/overview
  14. 2026-06-21 Markdown Docs Served: (none)Yes
  15. 2026-06-21 Markdown Docs URL: (none)https://platform.openai.com/docs/guides/text-to-speech.md
  16. 2026-06-21 Docs URL: (none)https://developers.openai.com/api/docs
  17. 2026-06-21 Has Structured Data: (none)No
  18. 2026-06-21 Llms Txt Present: (none)No
  19. 2026-06-21 Rendering: (none)static
  20. 2026-06-21 Self Serve Signup: set to Yes
  21. 2026-06-21 Requires Sales Call: set to No
  22. 2026-06-21 Enterprise Plan Available: set to Yes
  23. 2026-06-21 SOC 2: set to type_2
  24. 2026-06-21 HIPAA: set to Yes
  25. 2026-06-21 GDPR: set to Yes
  26. 2026-06-21 ISO 27001: set to Yes
  27. 2026-06-21 PCI DSS: set to Yes
  28. 2026-06-21 SLA Published: set to Yes
  29. 2026-06-21 SLA URL: set to https://openai.com/api-scale-tier/
  30. 2026-06-21 Data Retention Policy URL: set to https://developers.openai.com/api/docs/guides/your-data
  31. 2026-06-21 Documented Rate Limits: set to tts-1/tts-1-hd: Free 3 RPM / 200 RPD; Tier 1: 500 RPM; Tier 2: 2,500 RPM; Tier …
  32. 2026-06-21 Rate Limit Requests: set to 500
  33. 2026-06-21 Rate Limit Window: set to minute
  34. 2026-06-21 Known Restrictions: set to Maximum input length: 4096 characters per request (tts-1/tts-1-hd); 2,000 token…
  35. 2026-06-21 Auth Methods: set to api_key
  36. 2026-06-21 Auth Docs URL: set to https://developers.openai.com/api/reference/overview
  37. 2026-06-21 API Style: set to rest
  38. 2026-06-21 Base URL: set to https://api.openai.com/v1
  39. 2026-06-21 API Version: set to v1
  40. 2026-06-21 Versioning Scheme: set to url
  41. 2026-06-21 Stability: set to ga
  42. 2026-06-21 Deprecation Policy URL: set to https://developers.openai.com/api/docs/deprecations
  43. 2026-06-21 MCP URL: set to https://developers.openai.com/mcp
  44. 2026-06-21 Quickstart URL: set to https://developers.openai.com/api/docs/guides/text-to-speech
  45. 2026-06-21 Idempotency Supported: set to No
  46. 2026-06-21 Error Format: set to vendor-specific
  47. 2026-06-21 Requires Verification: set to No
  48. 2026-06-21 Slug: set to openai-tts
  49. 2026-06-21 Price Basis: set to 1M characters
  50. 2026-06-21 Launched At: set to 2023-11-06

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/openai-tts \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/openai-tts/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →