OpenAI
AI research and API company; audio, realtime voice, and language models.
API products
OpenAI Realtime API (gpt-realtime)
"The Realtime API enables low-latency, bidirectional audio communication for building voice agents and audio applications."
OpenAI Speech-to-Text
"The Audio API provides two speech-to-text endpoints, transcriptions and translations, based on our state-of-the-art open source large-v2 Whisper model."
OpenAI Text to Speech (gpt-4o-mini-tts / tts-1)
"Transform text into lifelike spoken audio" - OpenAI's TTS service enabling blog narration, multilingual audio production, and realtime voice output via gpt-4o-mini-tts, tts-1, and tts-1-hd models.