Soniox

"Turn live conversations into structured intelligence." [1]

soniox.com · By Soniox · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

Soniox is a speech-to-text API built for real-time and batch transcription workloads, targeting voice agents, call centers, medical teams, and media producers who need multilingual support, speaker diarization, and word-level timestamps. Pricing is usage-based at $0.0017 per minute with self-serve sign-up and no sales call required, though free credits were discontinued in October 2025. The platform holds SOC 2 Type 2, HIPAA, GDPR, and ISO 27001 certifications, with data residency options across the United States, European Union, and Japan. SDKs are available for Python, Node.js, and browser JavaScript, and an MCP server is also supported.

Best for / Avoid if

Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box

Avoid if: You want to try it free before paying

Pricing & procurement

Pricing model
Usage-based [2]
Published pricing
Yes [3]
Free tier
No [4]
Self-serve signup
Yes [5]
Requires sales call
No
Enterprise plan
Yes [6]
Published prices
ItemPerAmountSource
Async (file) transcription - input audio tokens1M tokens$1.5source
Async (file) transcription - input text tokens1M tokens$3.5source
Async (file) transcription - output text tokens1M tokens$3.5source
Real-time (streaming) transcription - input audio tokens1M tokens$2source
Real-time (streaming) transcription - input text tokens1M tokens$4source
Real-time (streaming) transcription - output text tokens1M tokens$4source

Capabilities

  • Real-time streaming
  • Speaker diarization
  • Speech translation
  • Medical transcription
Supported actions
transcribe_streaming, transcribe_batch, speaker_diarization, language_detection, word_timestamps, translation, multilingual_transcription, context_customization, turn_detection, temporary_api_keys, webhook_notifications [7]
Regions
United States, European Union, Japan [8]
Languages
Afrikaans, Albanian, Arabic, Azerbaijani, Basque, Belarusian, Bengali, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Gujarati, Hebrew, Hindi, Hungarian, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Malayalam, Marathi, Norwegian, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh [9]
Input types
audio/aac, audio/aiff, audio/amr, audio/asf, audio/flac, audio/mp3, audio/ogg, audio/wav, audio/webm, audio/m4a, video/mp4, PCM s8, PCM s16le, PCM s24le, PCM s32le, PCM f32le, mulaw, alaw, live WebSocket stream, file URL, local file upload [10]
Output types
JSON with word-level tokens, start_ms/end_ms word timestamps, speaker labels, language codes, confidence scores, is_final flags [11]
Webhooks
Yes [12]
Sandbox / test mode
No [13]
SDK languages
Python, Node.js, JavaScript (browser) [14]
MCP server
Yes [15]

Trust & compliance

SOC 2
SOC 2 Type II [16]
HIPAA
Yes [17]
GDPR
Yes [18]
ISO 27001
Yes [19]
PCI DSS
No [20]
Published SLA
Yes [21]
Rate limits
Async API: max 1,000 files stored simultaneously, 10 GB total storage, 100 pending transcriptions, 2,000 total transcriptions, 300-minute max file duration (non-negotiable). Real-time WebSocket: per-organization and per-project concurrent request caps and requests-per-minute caps (exact numbers not published; increase requests via Soniox Console). [22]
Known restrictions
Maximum audio file duration is 300 minutes (fixed, cannot be increased), Uploaded files are not deleted automatically - must be manually deleted, PCI DSS compliance not stated, Specific concurrent connection numbers not published - must request increase via Console, Free credits discontinued as of October 2025 - pay-as-you-go only [23]

Developer surface

Docs rendering: static · llms.txt present

Integration

API style
rest
Base URL
https://api.soniox.com/v1
Version
v1
Versioning
url
Stability
ga
Auth methods
api_key
Error format
vendor-specific

SDKs

  • Python soniox · repo
  • Node.js @soniox/node · repo
  • JavaScript (browser) @soniox/speech-to-text-web · repo

Adoption & maturity

Launched
2020-01-01
GA
2022-01-01
Notable customers
Scribe

Other Speech-to-Text & Transcription APIs

  • ElevenLabs Scribe (Speech to Text)

    "Scribe v2 is the most accurate Speech to Text model" offering "real-time Speech to Text in under 150 ms" across "90+ languages."

    Hybrid · free tier · public pricing · self-serve

  • Azure AI Speech to Text

    "Azure Speech in Foundry Tools provides speech to text, text to speech, and other capabilities through a Microsoft Foundry resource. You can transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and conduct live AI voice conversations."

    Usage · free tier · public pricing · self-serve

  • Amazon Transcribe

    "Amazon Transcribe is an automatic speech recognition service that uses machine learning models to convert audio to text. You can use Amazon Transcribe as a standalone transcription service or to add speech-to-text capabilities to any application."

    Usage · free tier · public pricing · self-serve

  • Google Cloud Speech-to-Text

    "Accurate voice typing and transcription powered by Gemini."

    Usage · free tier · public pricing · self-serve

  • IBM watsonx Speech to Text

    "IBM Watson® Speech to Text technology enables fast and accurate speech transcription in multiple languages for a variety of use cases, including but not limited to customer self-service, agent assistance and speech analytics."

    Usage · free tier · public pricing · self-serve

  • AssemblyAI

    "Voice AI infrastructure for developers building products that transcribe, understand, and act on speech."

    Usage · public pricing · self-serve

Soniox alternatives · Soniox vs ElevenLabs Scribe (Speech to Text) · All Speech-to-Text & Transcription APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

  1. Description: soniox.com
  2. Pricing model: soniox.com · soniox.com
  3. Published pricing: soniox.com
  4. Free tier: soniox.com · soniox.com
  5. Self-serve signup: soniox.com
  6. Enterprise plan: soniox.com
  7. Supported actions: soniox.com · soniox.com
  8. Regions: soniox.com · soniox.com
  9. Languages: soniox.com · soniox.com
  10. Input types: soniox.com · soniox.com
  11. Output types: soniox.com
  12. Webhooks: soniox.com
  13. Sandbox: soniox.com
  14. SDK languages: github.com · soniox.com
  15. MCP server: soniox.com · soniox.com
  16. SOC 2: soniox.com · soniox.com
  17. HIPAA: soniox.com · soniox.com
  18. GDPR: soniox.com · soniox.com
  19. ISO 27001: soniox.com · soniox.com
  20. PCI DSS: soniox.com
  21. Published SLA: soniox.com
  22. Rate limits: soniox.com · soniox.com
  23. Known restrictions: soniox.com · soniox.com

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

  1. 2026-06-21 Capabilities: {}{"medical":true,"translation":true,"real_time_streaming":true,"speaker_diarizat…
  2. 2026-06-21 Summary Md: (none)Soniox is a speech-to-text API built for real-time and batch transcription work…
  3. 2026-06-21 Score Docs Quality: (none)25
  4. 2026-06-21 Score Procurement Friction: (none)85
  5. 2026-06-21 Score Trust Readiness: (none)90
  6. 2026-06-21 Best For: (none)Regulated or enterprise workloads - compliance attestations and an enterprise p…
  7. 2026-06-21 Avoid If: (none)You want to try it free before paying
  8. 2026-06-21 Scoring Methodology: (none)Scores are computed deterministically from this profile's published, sourced fi…
  9. 2026-06-21 Score Agent Friendliness: (none)65
  10. 2026-06-21 Score Pricing Transparency: (none)85
  11. 2026-06-21 Score Setup Speed: (none)60
  12. 2026-06-21 Llms Txt Present: (none)Yes
  13. 2026-06-21 Rendering: (none)static
  14. 2026-06-21 Has Structured Data: (none)Yes
  15. 2026-06-21 Robots Allows Agents: (none)Yes
  16. 2026-06-21 Status Page URL: (none)https://status.soniox.com
  17. 2026-06-21 Docs URL: (none)https://soniox.com/docs
  18. 2026-06-21 Llms Txt URL: (none)https://soniox.com/llms.txt
  19. 2026-06-21 Free Tier Available: set to No
  20. 2026-06-21 Self Serve Signup: set to Yes
  21. 2026-06-21 Requires Sales Call: set to No
  22. 2026-06-21 Enterprise Plan Available: set to Yes
  23. 2026-06-21 SOC 2: set to type_2
  24. 2026-06-21 HIPAA: set to Yes
  25. 2026-06-21 GDPR: set to Yes
  26. 2026-06-21 ISO 27001: set to Yes
  27. 2026-06-21 PCI DSS: set to No
  28. 2026-06-21 SLA Published: set to Yes
  29. 2026-06-21 Data Retention Policy URL: set to https://soniox.com/docs/security-and-privacy
  30. 2026-06-21 Documented Rate Limits: set to Async API: max 1,000 files stored simultaneously, 10 GB total storage, 100 pend…
  31. 2026-06-21 Known Restrictions: set to Maximum audio file duration is 300 minutes (fixed, cannot be increased), Upload…
  32. 2026-06-21 Auth Methods: set to api_key
  33. 2026-06-21 Auth Docs URL: set to https://soniox.com/docs/api-reference/auth/create_temporary_api_key
  34. 2026-06-21 API Style: set to rest
  35. 2026-06-21 Base URL: set to https://api.soniox.com/v1
  36. 2026-06-21 API Version: set to v1
  37. 2026-06-21 Versioning Scheme: set to url
  38. 2026-06-21 Stability: set to ga
  39. 2026-06-21 MCP URL: set to https://soniox.com/docs/api/mcp/mcp
  40. 2026-06-21 Quickstart URL: set to https://soniox.com/docs/stt/get-started
  41. 2026-06-21 Error Format: set to vendor-specific
  42. 2026-06-21 Webhook Events URL: set to https://soniox.com/docs/stt/async/webhooks
  43. 2026-06-21 Requires Verification: set to No
  44. 2026-06-21 Slug: set to soniox
  45. 2026-06-21 Price Basis: set to minute
  46. 2026-06-21 Launched At: set to 2020-01-01
  47. 2026-06-21 GA Date: set to 2022-01-01
  48. 2026-06-21 Notable Customers: set to Scribe
  49. 2026-06-21 Fields Not Found: set to exact concurrent connection numbers for real-time streaming, maximum file uploa…
  50. 2026-06-21 Source Confidence: set to high

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/soniox \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/soniox/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →