Best Speech-to-Text APIs with PII Redaction
Transcription APIs that automatically detect and redact PII and PCI data from audio and transcripts for compliance-sensitive workloads.
Our pick: ElevenLabs Scribe (Speech to Text)
ElevenLabs Scribe is a REST speech-to-text API supporting batch and real-time transcription across 90+ languages, with sub-150ms latency for streaming use cases. It covers speaker diarization, word and character timestamps, entity detection and redaction, multichannel processing, and keyterm prompting, making it suitable for podcasts, video captioning, meeting documentation, and AI agent integrations. Pricing starts at $0.22 per hour of audio with a free tier of 4.5 hours per month, self-serve signup, and an enterprise plan available. The service holds SOC 2 Type 2, HIPAA, GDPR, ISO 27001, and PCI DSS certifications, and ships SDKs for Python, Node.js, Swift, Kotlin, and Flutter.
Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt).
Best for…
- Best overall
- ElevenLabs Scribe (Speech to Text)
- Best free pick
- ElevenLabs Scribe (Speech to Text)
- Best for enterprise
- ElevenLabs Scribe (Speech to Text)
- Cheapest to start
- Voicegain
- Best for agents
- ElevenLabs Scribe (Speech to Text)
- Broadest surface
- Deepgram
Ranked (6)
#1 ElevenLabs Scribe (Speech to Text)
81 / 100- Best overall
- Best free pick
- Best for enterprise
- Best for agents
ElevenLabs Scribe is a REST speech-to-text API supporting batch and real-time transcription across 90+ languages, with sub-150ms latency for streaming use cases. It covers speaker diarization, word and character timestamps, entity detection and redaction, multichannel processing, and keyterm prompting, making it suitable for podcasts, video captioning, meeting documentation, and AI agent integrations. Pricing starts at $0.22 per hour of audio with a free tier of 4.5 hours per month, self-serve signup, and an enterprise plan available. The service holds SOC 2 Type 2, HIPAA, GDPR, ISO 27001, and PCI DSS certifications, and ships SDKs for Python, Node.js, Swift, Kotlin, and Flutter.
PricingHybrid · from $0.22 hour of audio · free tier ✓TrustSOC 2 Type II · HIPAA · GDPR · ISO 27001 · PCI DSSDoesUsed byRevolut, Klarna, Washington Post, Deutsche Telekom#2 Amazon Transcribe
72 / 100Amazon Transcribe is an automatic speech recognition service from AWS that converts audio to text via batch or real-time streaming, with support for speaker diarization, custom vocabularies, custom language models, and multi-language identification. It targets a broad range of applications including contact center analytics, clinical documentation through a dedicated medical variant, accessibility captioning, and toxic content detection in gaming. Pricing starts at $0.006 per minute on a pay-as-you-go basis, with a free tier of 60 minutes per month for the first 12 months. The service is HIPAA-eligible, SOC 2 Type 2 certified, ISO 27001 and PCI DSS compliant, available across 25 AWS regions including GovCloud, and provides SDKs for Python, JavaScript, Java, Go, C++, Ruby, and PHP.
PricingUsage · from $0.006 minute · free tier ✓TrustSOC 2 Type II · HIPAA · GDPR · ISO 27001 · PCI DSSDoes#3 AssemblyAI
79 / 100AssemblyAI is a voice AI platform providing speech-to-text transcription, speaker diarization, and audio intelligence features via REST API, aimed at developers building products on top of speech data. Pricing is usage-based at $0.0025 per minute with a $50 one-time free credit requiring no credit card, and enterprise plans are available. The service holds SOC 2 Type II, HIPAA, GDPR, ISO 27001, and PCI DSS certifications, with data processed in the US and EU. Customers include Zoom, Spotify, and Dovetail, and SDKs are actively maintained for Python and Node.js.
PricingUsage · from $0.0025 minute · free tier ✗TrustSOC 2 Type II · HIPAA · GDPR · ISO 27001 · PCI DSSDoesUsed byZoom, Spotify, Veed, CallRailAvoid ifYou want to try it free before paying#4 Deepgram
59 / 100- Broadest surface
Deepgram provides real-time and batch APIs for speech-to-text, text-to-speech, and voice agents, plus audio intelligence features like summarization. Pricing is usage-based, published, and self-serve. It offers webhooks, four SDKs, and an official MCP server, with availability in North America and Europe. The platform carries SOC 2 Type 2, HIPAA, GDPR, and PCI DSS compliance with a published SLA.
PricingUsage · free tier ✗TrustSOC 2 Type II · HIPAA · GDPR · PCI DSSDoesAvoid ifYou want to try it free before paying#5 Gladia
71 / 100Gladia is an audio infrastructure API covering batch and real-time speech-to-text transcription, speaker diarization, translation, summarization, sentiment and emotion analysis, and named entity recognition, targeting voice agents, contact centers, meeting assistants, and media captioning workflows. Pricing is usage-based at $0.61 per hour with a free tier of 10 hours per month and no sales call required to start. The API is REST-based with TypeScript, JavaScript, and Python SDKs, webhooks, and an MCP server, and is hosted in EU (France, default) and US regions. Gladia holds SOC 2 Type II, HIPAA, and GDPR compliance, and counts Aircall, Citibank, Samsung, Oracle, and Microsoft among its customers.
PricingUsage · from $0.61 hour · free tier ✓TrustSOC 2 Type II · HIPAA · GDPRDoesUsed byAircall, Attention, Recall, VEED#6 Voicegain
60 / 100- Cheapest to start
Voicegain is a speech-to-text and voice AI platform aimed at contact centers, healthcare payers, and enterprises that need telephony transcription, PII/PCI redaction, real-time agent assist, and custom ASR model training. Pricing starts at $0.0015 per minute on a pay-as-you-go basis, with a $50 one-time signup credit and no credit card required; on-premise and private-cloud deployments are available but require an annual commitment. The platform holds SOC 2 Type 2, HIPAA, GDPR, and PCI DSS certifications, and customers include Aetna, Samsung, and Sutherland.
PricingUsage · from $0.0015 minute · free tier ✗TrustSOC 2 Type II · HIPAA · GDPR · PCI DSSDoesUsed bySutherland, Samsung, Aetna, LevelAIAvoid ifYou want to try it free before paying