Rev AI

"A developer-first API delivering industry-leading accuracy and fast performance at global scale." [1]

www.rev.ai · By Rev · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

Rev AI is a speech-to-text API from Rev, offering both asynchronous batch transcription and real-time streaming, with capabilities including speaker diarization, word timestamps, custom vocabulary, language detection, translation, sentiment analysis, and summarization. Pricing is usage-based at $0.0017 per minute with a 5-hour free tier and self-serve signup, making it accessible without a sales call. SDKs are available for Python, Node.js, Java, and Go, and the service is SOC 2 Type 2 certified, HIPAA compliant, and GDPR compliant, with data residency options in the US and EU.

Best for / Avoid if

Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box

Avoid if: You want to try it free before paying

Pricing & procurement

Pricing model
Usage-based [2]
Published pricing
Yes [3]
Free tier
No [4]
Free tier details
One-time trial credit of 5 hours of Reverb ASR for new users (not a recurring free allowance)
Self-serve signup
Yes
Requires sales call
No
Enterprise plan
Yes [5]
Published prices
PlanItemPerAmountSource
Pay-As-You-GoReverb Transcription (English, Async)hour$0.2source
Pay-As-You-GoReverb Turbo Transcription (English, Async)hour$0.1source
Pay-As-You-GoReverb Foreign Language Transcription (55+ languages, Async)hour$0.3source
Pay-As-You-GoWhisper Fusion Transcription (English, Async)minute$0.005source
Pay-As-You-GoWhisper Large Transcription (English, Async)minute$0.005source
Pay-As-You-GoHuman Transcriptionminute$1.99source
Pay-As-You-GoForced Alignmentminute$0.003source
Pay-As-You-GoLanguage Identification add-onminute$0.003source
Pay-As-You-GoLanguage Translation Standard add-onminute$0.002source
Pay-As-You-GoLanguage Translation Premium add-onminute$0.025source
Pay-As-You-GoSentiment Analysis add-on10 words$0.0008source
Pay-As-You-GoTopic Extraction add-on10 words$0.0008source
Pay-As-You-GoSummarization Standard add-onminute$0.002source
Pay-As-You-GoSummarization Premium add-onminute$0.025source
EnterpriseTranscription (all models, volume pricing) - source

Capabilities

  • Real-time streaming
  • Speaker diarization
  • Speech translation
Supported actions
transcribe_batch, transcribe_streaming, speaker_diarization, word_timestamps, confidence_scores, custom_vocabulary, punctuation, disfluency_removal, profanity_filtering, language_detection, language_translation, sentiment_analysis, topic_extraction, summarization, forced_alignment, captions_srt, captions_vtt, human_transcription [6]
Regions
United States, European Union (Frankfurt, Germany) [7]
Languages
Multilingual English/Spanish, Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Estonian, Farsi, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Mandarin, Marathi, Nepali, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh [8]
Input types
audio/mp3, audio/mp4, audio/ogg, audio/wav, audio/flac, audio/pcm, all FFmpeg-compatible formats, raw audio, file upload (multipart), source URL (source_config), WebSocket stream (wss://), RTMP stream
Output types
JSON transcript, plain text transcript, SRT captions, VTT captions, word-level timestamps, speaker-labeled segments, confidence scores, JSON summary, plain text summary, JSON translation, SRT translation captions
Webhooks
Yes [9]
Sandbox / test mode
No [10]
SDK languages
Python, Node.js, Java, Go [11]
MCP server
No

Trust & compliance

SOC 2
SOC 2 Type II [12]
HIPAA
Yes [13]
GDPR
Yes [14]
ISO 27001
No [15]
PCI DSS
No [16]
Published SLA
No [17]
Rate limits
Async API: 10,000 transcription requests per 10 minutes; 500 transcriptions processed per 10 minutes (excess queued); multipart uploads max 5 concurrent requests. Streaming API: concurrency limit of 10 (adjustable via support); 3-hour time limit per stream. [18]
Known restrictions
Multipart file uploads: 2 GB per request max, source_config uploads: 5 TB max, Maximum 17 hours audio per async transcription job, Streaming: 10 concurrent connections limit (default), Streaming: 3-hour time limit per stream, 15-second minimum billing per job, HIPAA available on enterprise accounts only (requires BAA + MSA), EU deployment does not support human transcription, EU deployment: custom vocabularies only accepted at job submission (not pre-existing IDs), Jobs/data retained for maximum 30 days then permanently deleted, RTMP streams not supported in HIPAA mode [19]

Developer surface

Docs rendering: static · llms.txt present

Integration

API style
rest
Base URL
https://api.rev.ai/speechtotext/v1
Version
v1
Versioning
url
Stability
ga
Auth methods
api_key
Error format
vendor-specific
Rate limit
10000 / 10 minutes

SDKs

  • Python rev_ai · repo
  • Node.js revai-node-sdk · repo
  • Java ai.rev:revai-java-sdk · repo
  • Go · repo

Adoption & maturity

Launched
2010-01-01

Other Speech-to-Text & Transcription APIs

  • ElevenLabs Scribe (Speech to Text)

    "Scribe v2 is the most accurate Speech to Text model" offering "real-time Speech to Text in under 150 ms" across "90+ languages."

    Hybrid · free tier · public pricing · self-serve

  • Azure AI Speech to Text

    "Azure Speech in Foundry Tools provides speech to text, text to speech, and other capabilities through a Microsoft Foundry resource. You can transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and conduct live AI voice conversations."

    Usage · free tier · public pricing · self-serve

  • Amazon Transcribe

    "Amazon Transcribe is an automatic speech recognition service that uses machine learning models to convert audio to text. You can use Amazon Transcribe as a standalone transcription service or to add speech-to-text capabilities to any application."

    Usage · free tier · public pricing · self-serve

  • Google Cloud Speech-to-Text

    "Accurate voice typing and transcription powered by Gemini."

    Usage · free tier · public pricing · self-serve

  • IBM watsonx Speech to Text

    "IBM Watson® Speech to Text technology enables fast and accurate speech transcription in multiple languages for a variety of use cases, including but not limited to customer self-service, agent assistance and speech analytics."

    Usage · free tier · public pricing · self-serve

  • AssemblyAI

    "Voice AI infrastructure for developers building products that transcribe, understand, and act on speech."

    Usage · public pricing · self-serve

Rev AI alternatives · Rev AI vs ElevenLabs Scribe (Speech to Text) · All Speech-to-Text & Transcription APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

  1. Description: rev.ai
  2. Pricing model: rev.ai
  3. Published pricing: rev.ai
  4. Free tier: rev.ai · rev.ai
  5. Enterprise plan: rev.ai
  6. Supported actions: docs.rev.ai · docs.rev.ai
  7. Regions: docs.rev.ai
  8. Languages: docs.rev.ai · rev.ai · docs.rev.ai
  9. Webhooks: docs.rev.ai
  10. Sandbox: docs.rev.ai
  11. SDK languages: docs.rev.ai
  12. SOC 2: docs.rev.ai · rev.ai
  13. HIPAA: rev.ai · docs.rev.ai
  14. GDPR: rev.ai · docs.rev.ai
  15. ISO 27001: rev.ai
  16. PCI DSS: rev.ai
  17. Published SLA: rev.ai
  18. Rate limits: docs.rev.ai · docs.rev.ai
  19. Known restrictions: docs.rev.ai · docs.rev.ai

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

  1. 2026-06-21 Capabilities: {}{"translation":true,"real_time_streaming":true,"speaker_diarization":true}
  2. 2026-06-21 Summary Md: (none)Rev AI is a speech-to-text API from Rev, offering both asynchronous batch trans…
  3. 2026-06-21 Score Setup Speed: (none)60
  4. 2026-06-21 Score Docs Quality: (none)25
  5. 2026-06-21 Score Procurement Friction: (none)85
  6. 2026-06-21 Score Trust Readiness: (none)55
  7. 2026-06-21 Best For: (none)Regulated or enterprise workloads - compliance attestations and an enterprise p…
  8. 2026-06-21 Scoring Methodology: (none)Scores are computed deterministically from this profile's published, sourced fi…
  9. 2026-06-21 Avoid If: (none)You want to try it free before paying
  10. 2026-06-21 Score Agent Friendliness: (none)35
  11. 2026-06-21 Score Pricing Transparency: (none)85
  12. 2026-06-21 Llms Txt Present: (none)Yes
  13. 2026-06-21 Rendering: (none)static
  14. 2026-06-21 Has Structured Data: (none)No
  15. 2026-06-21 Robots Allows Agents: (none)Yes
  16. 2026-06-21 Status Page URL: (none)https://status.rev.ai
  17. 2026-06-21 Docs URL: (none)https://docs.rev.ai/
  18. 2026-06-21 Llms Txt URL: (none)https://www.rev.ai/llms.txt
  19. 2026-06-21 Free Tier Available: set to No
  20. 2026-06-21 Free Tier Details: set to One-time trial credit of 5 hours of Reverb ASR for new users (not a recurring f…
  21. 2026-06-21 Self Serve Signup: set to Yes
  22. 2026-06-21 Requires Sales Call: set to No
  23. 2026-06-21 Enterprise Plan Available: set to Yes
  24. 2026-06-21 SOC 2: set to type_2
  25. 2026-06-21 HIPAA: set to Yes
  26. 2026-06-21 GDPR: set to Yes
  27. 2026-06-21 ISO 27001: set to No
  28. 2026-06-21 PCI DSS: set to No
  29. 2026-06-21 SLA Published: set to No
  30. 2026-06-21 Data Retention Policy URL: set to https://docs.rev.ai/api/security
  31. 2026-06-21 Documented Rate Limits: set to Async API: 10,000 transcription requests per 10 minutes; 500 transcriptions pro…
  32. 2026-06-21 Rate Limit Requests: set to 10000
  33. 2026-06-21 Rate Limit Window: set to 10 minutes
  34. 2026-06-21 Known Restrictions: set to Multipart file uploads: 2 GB per request max, source_config uploads: 5 TB max, …
  35. 2026-06-21 Auth Methods: set to api_key
  36. 2026-06-21 Auth Docs URL: set to https://docs.rev.ai/api/asynchronous/get-started/
  37. 2026-06-21 API Style: set to rest
  38. 2026-06-21 Base URL: set to https://api.rev.ai/speechtotext/v1
  39. 2026-06-21 API Version: set to v1
  40. 2026-06-21 Versioning Scheme: set to url
  41. 2026-06-21 Stability: set to ga
  42. 2026-06-21 Quickstart URL: set to https://docs.rev.ai/api/asynchronous/get-started/
  43. 2026-06-21 Error Format: set to vendor-specific
  44. 2026-06-21 Webhook Events URL: set to https://docs.rev.ai/api/asynchronous/webhooks/
  45. 2026-06-21 Requires Verification: set to No
  46. 2026-06-21 Slug: set to rev-ai
  47. 2026-06-21 Price Basis: set to minute
  48. 2026-06-21 Free Tier Limit: set to 5 hours of Reverb ASR
  49. 2026-06-21 Launched At: set to 2010-01-01
  50. 2026-06-21 Notable Customers: set to (none)

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/rev-ai \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/rev-ai/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →