Soniox
"Turn live conversations into structured intelligence." [1]
Soniox is a speech-to-text API built for real-time and batch transcription workloads, targeting voice agents, call centers, medical teams, and media producers who need multilingual support, speaker diarization, and word-level timestamps. Pricing is usage-based at $0.0017 per minute with self-serve sign-up and no sales call required, though free credits were discontinued in October 2025. The platform holds SOC 2 Type 2, HIPAA, GDPR, and ISO 27001 certifications, with data residency options across the United States, European Union, and Japan. SDKs are available for Python, Node.js, and browser JavaScript, and an MCP server is also supported.
Best for / Avoid if
Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box
Avoid if: You want to try it free before paying
Pricing & procurement
- Pricing model
- Usage-based [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✗ No [4]
- Self-serve signup
- ✓ Yes [5]
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes [6]
| Item | Per | Amount | Source |
|---|---|---|---|
| Async (file) transcription - input audio tokens | 1M tokens | $1.5 | source |
| Async (file) transcription - input text tokens | 1M tokens | $3.5 | source |
| Async (file) transcription - output text tokens | 1M tokens | $3.5 | source |
| Real-time (streaming) transcription - input audio tokens | 1M tokens | $2 | source |
| Real-time (streaming) transcription - input text tokens | 1M tokens | $4 | source |
| Real-time (streaming) transcription - output text tokens | 1M tokens | $4 | source |
Capabilities
- Supported actions
- transcribe_streaming, transcribe_batch, speaker_diarization, language_detection, word_timestamps, translation, multilingual_transcription, context_customization, turn_detection, temporary_api_keys, webhook_notifications [7]
- Regions
- United States, European Union, Japan [8]
- Languages
- Afrikaans, Albanian, Arabic, Azerbaijani, Basque, Belarusian, Bengali, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Gujarati, Hebrew, Hindi, Hungarian, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Malayalam, Marathi, Norwegian, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh [9]
- Input types
- audio/aac, audio/aiff, audio/amr, audio/asf, audio/flac, audio/mp3, audio/ogg, audio/wav, audio/webm, audio/m4a, video/mp4, PCM s8, PCM s16le, PCM s24le, PCM s32le, PCM f32le, mulaw, alaw, live WebSocket stream, file URL, local file upload [10]
- Output types
- JSON with word-level tokens, start_ms/end_ms word timestamps, speaker labels, language codes, confidence scores, is_final flags [11]
- Webhooks
- ✓ Yes [12]
- Sandbox / test mode
- ✗ No [13]
- SDK languages
- Python, Node.js, JavaScript (browser) [14]
- MCP server
- ✓ Yes [15]
Trust & compliance
- SOC 2
- SOC 2 Type II [16]
- HIPAA
- ✓ Yes [17]
- GDPR
- ✓ Yes [18]
- ISO 27001
- ✓ Yes [19]
- PCI DSS
- ✗ No [20]
- Published SLA
- ✓ Yes [21]
- Rate limits
- Async API: max 1,000 files stored simultaneously, 10 GB total storage, 100 pending transcriptions, 2,000 total transcriptions, 300-minute max file duration (non-negotiable). Real-time WebSocket: per-organization and per-project concurrent request caps and requests-per-minute caps (exact numbers not published; increase requests via Soniox Console). [22]
- Known restrictions
- Maximum audio file duration is 300 minutes (fixed, cannot be increased), Uploaded files are not deleted automatically - must be manually deleted, PCI DSS compliance not stated, Specific concurrent connection numbers not published - must request increase via Console, Free credits discontinued as of October 2025 - pay-as-you-go only [23]
Developer surface
Integration
Adoption & maturity
- Launched
- 2020-01-01
- GA
- 2022-01-01
- Notable customers
- Scribe
Other Speech-to-Text & Transcription APIs
ElevenLabs Scribe (Speech to Text)
"Scribe v2 is the most accurate Speech to Text model" offering "real-time Speech to Text in under 150 ms" across "90+ languages."
Azure AI Speech to Text
"Azure Speech in Foundry Tools provides speech to text, text to speech, and other capabilities through a Microsoft Foundry resource. You can transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and conduct live AI voice conversations."
Amazon Transcribe
"Amazon Transcribe is an automatic speech recognition service that uses machine learning models to convert audio to text. You can use Amazon Transcribe as a standalone transcription service or to add speech-to-text capabilities to any application."
Google Cloud Speech-to-Text
"Accurate voice typing and transcription powered by Gemini."
IBM watsonx Speech to Text
"IBM Watson® Speech to Text technology enables fast and accurate speech transcription in multiple languages for a variety of use cases, including but not limited to customer self-service, agent assistance and speech analytics."
AssemblyAI
"Voice AI infrastructure for developers building products that transcribe, understand, and act on speech."
References
- ↑Description: soniox.com
- ↑Pricing model: soniox.com · soniox.com
- ↑Published pricing: soniox.com
- ↑Free tier: soniox.com · soniox.com
- ↑Self-serve signup: soniox.com
- ↑Enterprise plan: soniox.com
- ↑Supported actions: soniox.com · soniox.com
- ↑Regions: soniox.com · soniox.com
- ↑Languages: soniox.com · soniox.com
- ↑Input types: soniox.com · soniox.com
- ↑Output types: soniox.com
- ↑Webhooks: soniox.com
- ↑Sandbox: soniox.com
- ↑SDK languages: github.com · soniox.com
- ↑MCP server: soniox.com · soniox.com
- ↑SOC 2: soniox.com · soniox.com
- ↑HIPAA: soniox.com · soniox.com
- ↑GDPR: soniox.com · soniox.com
- ↑ISO 27001: soniox.com · soniox.com
- ↑PCI DSS: soniox.com
- ↑Published SLA: soniox.com
- ↑Rate limits: soniox.com · soniox.com
- ↑Known restrictions: soniox.com · soniox.com
Change history
- 2026-06-21 Capabilities: {} → {"medical":true,"translation":true,"real_time_streaming":true,"speaker_diarizat…
- 2026-06-21 Summary Md: (none) → Soniox is a speech-to-text API built for real-time and batch transcription work…
- 2026-06-21 Score Docs Quality: (none) → 25
- 2026-06-21 Score Procurement Friction: (none) → 85
- 2026-06-21 Score Trust Readiness: (none) → 90
- 2026-06-21 Best For: (none) → Regulated or enterprise workloads - compliance attestations and an enterprise p…
- 2026-06-21 Avoid If: (none) → You want to try it free before paying
- 2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-21 Score Agent Friendliness: (none) → 65
- 2026-06-21 Score Pricing Transparency: (none) → 85
- 2026-06-21 Score Setup Speed: (none) → 60
- 2026-06-21 Llms Txt Present: (none) → Yes
- 2026-06-21 Rendering: (none) → static
- 2026-06-21 Has Structured Data: (none) → Yes
- 2026-06-21 Robots Allows Agents: (none) → Yes
- 2026-06-21 Status Page URL: (none) → https://status.soniox.com
- 2026-06-21 Docs URL: (none) → https://soniox.com/docs
- 2026-06-21 Llms Txt URL: (none) → https://soniox.com/llms.txt
- 2026-06-21 Free Tier Available: set to No
- 2026-06-21 Self Serve Signup: set to Yes
- 2026-06-21 Requires Sales Call: set to No
- 2026-06-21 Enterprise Plan Available: set to Yes
- 2026-06-21 SOC 2: set to type_2
- 2026-06-21 HIPAA: set to Yes
- 2026-06-21 GDPR: set to Yes
- 2026-06-21 ISO 27001: set to Yes
- 2026-06-21 PCI DSS: set to No
- 2026-06-21 SLA Published: set to Yes
- 2026-06-21 Data Retention Policy URL: set to https://soniox.com/docs/security-and-privacy
- 2026-06-21 Documented Rate Limits: set to Async API: max 1,000 files stored simultaneously, 10 GB total storage, 100 pend…
- 2026-06-21 Known Restrictions: set to Maximum audio file duration is 300 minutes (fixed, cannot be increased), Upload…
- 2026-06-21 Auth Methods: set to api_key
- 2026-06-21 Auth Docs URL: set to https://soniox.com/docs/api-reference/auth/create_temporary_api_key
- 2026-06-21 API Style: set to rest
- 2026-06-21 Base URL: set to https://api.soniox.com/v1
- 2026-06-21 API Version: set to v1
- 2026-06-21 Versioning Scheme: set to url
- 2026-06-21 Stability: set to ga
- 2026-06-21 MCP URL: set to https://soniox.com/docs/api/mcp/mcp
- 2026-06-21 Quickstart URL: set to https://soniox.com/docs/stt/get-started
- 2026-06-21 Error Format: set to vendor-specific
- 2026-06-21 Webhook Events URL: set to https://soniox.com/docs/stt/async/webhooks
- 2026-06-21 Requires Verification: set to No
- 2026-06-21 Slug: set to soniox
- 2026-06-21 Price Basis: set to minute
- 2026-06-21 Launched At: set to 2020-01-01
- 2026-06-21 GA Date: set to 2022-01-01
- 2026-06-21 Notable Customers: set to Scribe
- 2026-06-21 Fields Not Found: set to exact concurrent connection numbers for real-time streaming, maximum file uploa…
- 2026-06-21 Source Confidence: set to high
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/soniox \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/soniox/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'