Rev AI

"A developer-first API delivering industry-leading accuracy and fast performance at global scale." [1]

Speech-to-Text & Transcription APIs

www.rev.ai · By Rev · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

Rev AI is a speech-to-text API from Rev, offering both asynchronous batch transcription and real-time streaming, with capabilities including speaker diarization, word timestamps, custom vocabulary, language detection, translation, sentiment analysis, and summarization. Pricing is usage-based at $0.0017 per minute with a 5-hour free tier and self-serve signup, making it accessible without a sales call. SDKs are available for Python, Node.js, Java, and Go, and the service is SOC 2 Type 2 certified, HIPAA compliant, and GDPR compliant, with data residency options in the US and EU.

Best for / Avoid if

Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box

Avoid if: You want to try it free before paying

Pricing & procurement

Pricing model: Usage-based [2]
Published pricing: Yes [3]
Free tier: No [4]
Free tier details: One-time trial credit of 5 hours of Reverb ASR for new users (not a recurring free allowance)
Self-serve signup: Yes
Requires sales call: No
Enterprise plan: Yes [5]

Published prices
Plan	Item	Per	Amount	Source
Pay-As-You-Go	Reverb Transcription (English, Async)	hour	$0.2	source
Pay-As-You-Go	Reverb Turbo Transcription (English, Async)	hour	$0.1	source
Pay-As-You-Go	Reverb Foreign Language Transcription (55+ languages, Async)	hour	$0.3	source
Pay-As-You-Go	Whisper Fusion Transcription (English, Async)	minute	$0.005	source
Pay-As-You-Go	Whisper Large Transcription (English, Async)	minute	$0.005	source
Pay-As-You-Go	Human Transcription	minute	$1.99	source
Pay-As-You-Go	Forced Alignment	minute	$0.003	source
Pay-As-You-Go	Language Identification add-on	minute	$0.003	source
Pay-As-You-Go	Language Translation Standard add-on	minute	$0.002	source
Pay-As-You-Go	Language Translation Premium add-on	minute	$0.025	source
Pay-As-You-Go	Sentiment Analysis add-on	10 words	$0.0008	source
Pay-As-You-Go	Topic Extraction add-on	10 words	$0.0008	source
Pay-As-You-Go	Summarization Standard add-on	minute	$0.002	source
Pay-As-You-Go	Summarization Premium add-on	minute	$0.025	source
Enterprise	Transcription (all models, volume pricing)		-	source

Capabilities

Real-time streaming
Speaker diarization
Speech translation

Supported actions: transcribe_batch, transcribe_streaming, speaker_diarization, word_timestamps, confidence_scores, custom_vocabulary, punctuation, disfluency_removal, profanity_filtering, language_detection, language_translation, sentiment_analysis, topic_extraction, summarization, forced_alignment, captions_srt, captions_vtt, human_transcription [6]docs.rev.ai/api/asynchronous/reference“GET /jobs/{id}/transcript, GET /jobs/{id}/transcript/translation/{language}, GET /jobs/{id}/transcript/summary, GET /jobs/{id}/captions, GET /jobs/{id}/captions/translation/{language}”docs.rev.ai/api/features“Speaker diarization is the process of separating audio segments according to speaker identification. Diarization is performed by default on all audio processed through the Asynchronous Speech-to-Text API.”
Regions: United States, European Union (Frankfurt, Germany) [7]
Languages: Multilingual English/Spanish, Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Estonian, Farsi, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Mandarin, Marathi, Nepali, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh [8]docs.rev.ai/api/asynchronous/changelog/“2023-11-16 added: Afrikaans, Armenian, Azerbaijani, Belarusian, Bosnian, Estonian, Galician, Icelandic, Kannada, Kazakh, Macedonian, Marathi, Nepali, Serbian, Swahili, Tagalog, Thai, Ukrainian, Urdu, Vietnamese, Welsh, and multilingual English/Spanish”rev.ai/languages“With 57+ languages supported, you can now take your products to markets worldwide.”docs.rev.ai/faq“Rev AI supports 58+ languages in the Asynchronous Speech-to-Text API and 9+ languages in the Streaming Speech-to-Text API.”
Input types: audio/mp3, audio/mp4, audio/ogg, audio/wav, audio/flac, audio/pcm, all FFmpeg-compatible formats, raw audio, file upload (multipart), source URL (source_config), WebSocket stream (wss://), RTMP stream
Output types: JSON transcript, plain text transcript, SRT captions, VTT captions, word-level timestamps, speaker-labeled segments, confidence scores, JSON summary, plain text summary, JSON translation, SRT translation captions
Webhooks: Yes [9]
Sandbox / test mode: No [10]
SDK languages: Python, Node.js, Java, Go [11]
MCP server: No

Trust & compliance

SOC 2: SOC 2 Type II [12]
HIPAA: Yes [13]
GDPR: Yes [14]
ISO 27001: No [15]
PCI DSS: No [16]
Published SLA: No [17]
Rate limits: Async API: 10,000 transcription requests per 10 minutes; 500 transcriptions processed per 10 minutes (excess queued); multipart uploads max 5 concurrent requests. Streaming API: concurrency limit of 10 (adjustable via support); 3-hour time limit per stream. [18]
Known restrictions: Multipart file uploads: 2 GB per request max, source_config uploads: 5 TB max, Maximum 17 hours audio per async transcription job, Streaming: 10 concurrent connections limit (default), Streaming: 3-hour time limit per stream, 15-second minimum billing per job, HIPAA available on enterprise accounts only (requires BAA + MSA), EU deployment does not support human transcription, EU deployment: custom vocabularies only accepted at job submission (not pre-existing IDs), Jobs/data retained for maximum 30 days then permanently deleted, RTMP streams not supported in HIPAA mode [19]

Developer surface

Docs rendering: static · llms.txt present

Integration

API style: rest
Base URL: https://api.rev.ai/speechtotext/v1
Version: v1
Versioning: url
Stability: ga
Auth methods: api_key
Error format: vendor-specific
Rate limit: 10000 / 10 minutes

SDKs

Python rev_ai · repo
Node.js revai-node-sdk · repo
Java ai.rev:revai-java-sdk · repo
Go · repo

Adoption & maturity

Launched: 2010-01-01

Other Speech-to-Text & Transcription APIs

ElevenLabs Scribe (Speech to Text)
"Scribe v2 is the most accurate Speech to Text model" offering "real-time Speech to Text in under 150 ms" across "90+ languages."
Hybrid · free tier · public pricing · self-serve
Azure AI Speech to Text
"Azure Speech in Foundry Tools provides speech to text, text to speech, and other capabilities through a Microsoft Foundry resource. You can transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and conduct live AI voice conversations."
Usage · free tier · public pricing · self-serve
Amazon Transcribe
"Amazon Transcribe is an automatic speech recognition service that uses machine learning models to convert audio to text. You can use Amazon Transcribe as a standalone transcription service or to add speech-to-text capabilities to any application."
Usage · free tier · public pricing · self-serve
Google Cloud Speech-to-Text
"Accurate voice typing and transcription powered by Gemini."
Usage · free tier · public pricing · self-serve
IBM watsonx Speech to Text
"IBM Watson® Speech to Text technology enables fast and accurate speech transcription in multiple languages for a variety of use cases, including but not limited to customer self-service, agent assistance and speech analytics."
Usage · free tier · public pricing · self-serve
AssemblyAI
"Voice AI infrastructure for developers building products that transcribe, understand, and act on speech."
Usage · public pricing · self-serve

Rev AI alternatives · Rev AI vs ElevenLabs Scribe (Speech to Text) · All Speech-to-Text & Transcription APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

↑Description: rev.ai
↑Pricing model: rev.ai
↑Published pricing: rev.ai
↑Free tier: rev.ai · rev.ai
↑Enterprise plan: rev.ai
↑Supported actions: docs.rev.ai · docs.rev.ai
↑Regions: docs.rev.ai
↑Languages: docs.rev.ai · rev.ai · docs.rev.ai
↑Webhooks: docs.rev.ai
↑Sandbox: docs.rev.ai
↑SDK languages: docs.rev.ai
↑SOC 2: docs.rev.ai · rev.ai
↑HIPAA: rev.ai · docs.rev.ai
↑GDPR: rev.ai · docs.rev.ai
↑ISO 27001: rev.ai
↑PCI DSS: rev.ai
↑Published SLA: rev.ai
↑Rate limits: docs.rev.ai · docs.rev.ai
↑Known restrictions: docs.rev.ai · docs.rev.ai

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

2026-06-21 Capabilities: {} → {"translation":true,"real_time_streaming":true,"speaker_diarization":true}
2026-06-21 Summary Md: (none) → Rev AI is a speech-to-text API from Rev, offering both asynchronous batch trans…
2026-06-21 Score Setup Speed: (none) → 60
2026-06-21 Score Docs Quality: (none) → 25
2026-06-21 Score Procurement Friction: (none) → 85
2026-06-21 Score Trust Readiness: (none) → 55
2026-06-21 Best For: (none) → Regulated or enterprise workloads - compliance attestations and an enterprise p…
2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
2026-06-21 Avoid If: (none) → You want to try it free before paying
2026-06-21 Score Agent Friendliness: (none) → 35
2026-06-21 Score Pricing Transparency: (none) → 85
2026-06-21 Llms Txt Present: (none) → Yes
2026-06-21 Rendering: (none) → static
2026-06-21 Has Structured Data: (none) → No
2026-06-21 Robots Allows Agents: (none) → Yes
2026-06-21 Status Page URL: (none) → https://status.rev.ai
2026-06-21 Docs URL: (none) → https://docs.rev.ai/
2026-06-21 Llms Txt URL: (none) → https://www.rev.ai/llms.txt
2026-06-21 Free Tier Available: set to No
2026-06-21 Free Tier Details: set to One-time trial credit of 5 hours of Reverb ASR for new users (not a recurring f…
2026-06-21 Self Serve Signup: set to Yes
2026-06-21 Requires Sales Call: set to No
2026-06-21 Enterprise Plan Available: set to Yes
2026-06-21 SOC 2: set to type_2
2026-06-21 HIPAA: set to Yes
2026-06-21 GDPR: set to Yes
2026-06-21 ISO 27001: set to No
2026-06-21 PCI DSS: set to No
2026-06-21 SLA Published: set to No
2026-06-21 Data Retention Policy URL: set to https://docs.rev.ai/api/security
2026-06-21 Documented Rate Limits: set to Async API: 10,000 transcription requests per 10 minutes; 500 transcriptions pro…
2026-06-21 Rate Limit Requests: set to 10000
2026-06-21 Rate Limit Window: set to 10 minutes
2026-06-21 Known Restrictions: set to Multipart file uploads: 2 GB per request max, source_config uploads: 5 TB max, …
2026-06-21 Auth Methods: set to api_key
2026-06-21 Auth Docs URL: set to https://docs.rev.ai/api/asynchronous/get-started/
2026-06-21 API Style: set to rest
2026-06-21 Base URL: set to https://api.rev.ai/speechtotext/v1
2026-06-21 API Version: set to v1
2026-06-21 Versioning Scheme: set to url
2026-06-21 Stability: set to ga
2026-06-21 Quickstart URL: set to https://docs.rev.ai/api/asynchronous/get-started/
2026-06-21 Error Format: set to vendor-specific
2026-06-21 Webhook Events URL: set to https://docs.rev.ai/api/asynchronous/webhooks/
2026-06-21 Requires Verification: set to No
2026-06-21 Slug: set to rev-ai
2026-06-21 Price Basis: set to minute
2026-06-21 Free Tier Limit: set to 5 hours of Reverb ASR
2026-06-21 Launched At: set to 2010-01-01
2026-06-21 Notable Customers: set to (none)

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/rev-ai \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/rev-ai/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →

Best for / Avoid if

Pricing & procurement

Capabilities

Trust & compliance

Developer surface

Integration

Adoption & maturity

Other Speech-to-Text & Transcription APIs

ElevenLabs Scribe (Speech to Text)

Azure AI Speech to Text

Amazon Transcribe

Google Cloud Speech-to-Text

IBM watsonx Speech to Text

AssemblyAI

References

Change history

Suggest an edit / leave a review