Amazon Textract
"Automatically extract printed text, handwriting, layout elements, and data from any document" [1]
Amazon Textract is an AWS document intelligence service that extracts printed text, handwriting, form fields, tables, and structured data from PDFs and images, targeting industries such as healthcare, financial services, and lending. Pricing is usage-based starting at $0.0015 per page, with a free tier of 1,000 pages per month for the first three months and no sales call required to get started. The service is available across 16 AWS regions including GovCloud, holds SOC 2 Type II, HIPAA, GDPR, ISO 27001, and PCI DSS certifications, and offers SDKs for seven languages.
Best for / Avoid if
Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box
Avoid if: You want to try it free before paying
Scores
- 35 / 100Agent friendliness
- 85 / 100Pricing transparency
- 60 / 100Setup speed
- 25 / 100Docs quality
- 85 / 100Procurement ease
- 100 / 100Trust readiness
Pricing & procurement
- Pricing model
- Usage-based [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✗ No [4]
- Free tier details
- 3-month time-limited trial for new AWS customers only: Detect Document Text 1,000 pages/month; Analyze Document (Signatures) 1,000 pages/month; Analyze Document (Forms/Tables/Layout/Queries) 100 pages/month; Analyze Expense 100 pages/month; Analyze ID 100 pages/month; Analyze Lending 2,000 pages/month. Not a perpetual free tier.
- Self-serve signup
- ✓ Yes [5]
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes [6]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Trial (3-month, new AWS customers only) | Detect Document Text API — 1,000 pages/month | month | $0 | source |
| Trial (3-month, new AWS customers only) | Analyze Document API (Signatures only) — 1,000 pages/month | month | $0 | source |
| Trial (3-month, new AWS customers only) | Analyze Document API (Forms, Tables, Layout, Queries) — 100 pages/month each | month | $0 | source |
| Trial (3-month, new AWS customers only) | Analyze Expense API — 100 pages/month | month | $0 | source |
| Trial (3-month, new AWS customers only) | Analyze ID API — 100 pages/month | month | $0 | source |
| Trial (3-month, new AWS customers only) | Analyze Lending API — 2,000 pages/month | month | $0 | source |
| Pay As You Go | Detect Document Text API — first 1M pages | page | $0.0015 | source |
| Pay As You Go | Detect Document Text API — after 1M pages | page | $0.0006 | source |
| Pay As You Go | Analyze Document API (Forms) — first 1M pages | page | $0.05 | source |
| Pay As You Go | Analyze Document API (Forms) — after 1M pages | page | $0.04 | source |
| Pay As You Go | Analyze Document API (Tables) — first 1M pages | page | $0.015 | source |
| Pay As You Go | Analyze Document API (Tables) — after 1M pages | page | $0.01 | source |
| Pay As You Go | Analyze Document API (Queries) — first 1M pages | page | $0.015 | source |
| Pay As You Go | Analyze Document API (Tables + Queries) — first 1M pages | page | $0.02 | source |
| Pay As You Go | Analyze Document API (Tables + Queries) — after 1M pages | page | $0.015 | source |
| Pay As You Go | Analyze Document API (Forms + Tables + Queries) — first 1M pages | page | $0.07 | source |
| Pay As You Go | Analyze Document API (Forms + Tables + Queries) — after 1M pages | page | $0.055 | source |
| Pay As You Go | Analyze Document API (Signatures) — first 1M pages | page | $0.0035 | source |
| Pay As You Go | Analyze Document API (Signatures) — after 1M pages | page | $0.0014 | source |
| Pay As You Go | Analyze Document API (Custom Queries) — first 1M pages | page | $0.025 | source |
| Pay As You Go | Analyze Document API (Custom Queries) — after 1M pages | page | $0.015 | source |
| Pay As You Go | Analyze Document API (Forms + Custom Queries) — first 1M pages | page | $0.065 | source |
| Pay As You Go | Analyze Document API (Forms + Custom Queries) — after 1M pages | page | $0.05 | source |
| Pay As You Go | Analyze Expense API — first 1M pages | page | $0.01 | source |
| Pay As You Go | Analyze Expense API — after 1M pages | page | $0.008 | source |
| Pay As You Go | Analyze ID API — first 100K pages | page | $0.025 | source |
| Pay As You Go | Analyze ID API — after 100K pages | page | $0.01 | source |
| Pay As You Go | Analyze Lending API — first 1M pages (classified and extracted pages only) | page | $0.07 | source |
| Pay As You Go | Analyze Lending API — after 1M pages (classified and extracted pages only) | page | $0.055 | source |
Capabilities
- Supported actions
- DetectDocumentText, AnalyzeDocument (Forms, Tables, Queries, Signatures, Layout, Custom Queries), AnalyzeExpense, AnalyzeID, StartDocumentTextDetection, StartDocumentAnalysis, StartExpenseAnalysis, StartIDDocumentAnalysis, GetDocumentTextDetection, GetDocumentAnalysis, GetExpenseAnalysis, GetIDDocumentAnalysis, StartLendingAnalysis, GetLendingAnalysis, GetLendingAnalysisSummary, CreateAdapter, CreateAdapterVersion, DeleteAdapter, DeleteAdapterVersion, UpdateAdapter, GetAdapter, GetAdapterVersion, ListAdapters, ListAdapterVersions [7]
- Regions
- US East (N. Virginia) us-east-1, US East (Ohio) us-east-2, US West (N. California) us-west-1, US West (Oregon) us-west-2, Asia Pacific (Mumbai) ap-south-1, Asia Pacific (Seoul) ap-northeast-2, Asia Pacific (Singapore) ap-southeast-1, Asia Pacific (Sydney) ap-southeast-2, Canada (Central) ca-central-1, Europe (Frankfurt) eu-central-1, Europe (Ireland) eu-west-1, Europe (London) eu-west-2, Europe (Paris) eu-west-3, Europe (Spain) eu-south-2, AWS GovCloud (US-East) us-gov-east-1, AWS GovCloud (US-West) us-gov-west-1 [8]
- Languages
- English, French, German, Italian, Portuguese, Spanish [9]
- Input types
- JPEG, PNG, PDF, TIFF [10]
- Output types
- JSON [11]
- Webhooks
- ✓ Yes [12]
- Sandbox / test mode
- ✗ No
- SDK languages
- Python, JavaScript/Node.js, Java, .NET, Go, Ruby, PHP [13]
- MCP server
- ✗ No
Trust & compliance
- SOC 2
- SOC 2 Type II [14]
- HIPAA
- ✓ Yes [15]
- GDPR
- ✓ Yes [16]
- ISO 27001
- ✓ Yes [17]
- PCI DSS
- ✓ Yes [18]
- Published SLA
- ✓ Yes [19]
- Rate limits
- Default TPS quotas vary by region and API. In US East (N. Virginia): DetectDocumentText sync=25 TPS, AnalyzeDocument sync=10 TPS, AnalyzeExpense sync=5 TPS, AnalyzeID sync=5 TPS; StartDocumentTextDetection async=15 TPS, StartDocumentAnalysis async=10 TPS, StartExpenseAnalysis async=5 TPS, StartLendingAnalysis async=5 TPS; Maximum concurrent async jobs=600 in us-east-1/us-west-2. Synchronous ops: max 15 queries per page; Asynchronous ops: max 30 queries per page. Limits are adjustable via AWS Service Quotas console. [20]
- Known restrictions
- Supported file formats: JPEG, PNG, PDF, TIFF only (no XFA-based PDFs), Sync operations: max 10 MB file size, PDF/TIFF limited to 1 page, Async operations: PDF/TIFF max 500 MB and 3,000 pages; JPEG/PNG max 10 MB, PDF max height/width: 40 inches and 9,000 points; cannot be password protected, Max image resolution: 10,000 pixels on all sides, Handwritten text recognition supported in English only, Query feature (natural language) available in English documents only, AnalyzeID supports US passports and US driver's licenses only, Vertical text (e.g., Japanese, Chinese vertical layout) not supported, Minimum text height: 15 pixels (equivalent to 8pt font at 150 DPI), Up to 15 queries per page (sync) and 30 queries per page (async), PCI DSS users must opt out of content use for service improvement via AWS Support, Max 10 adapters per account; max 10 adapter versions per month; max 3 in-progress adapter training jobs, Layout feature is free when used with the Tables feature [21]
Developer surface
Integration
- API style
- rest
- Base URL
- https://textract.{region}.amazonaws.com
- Version
- 2018-06-27
- Versioning
- header
- Stability
- ga
- Auth methods
- hmac_signature
- Idempotency keys
- ✓ Yes
- Error format
- vendor-specific
- Rate limit
- 25 / second
Adoption & maturity
- Launched
- 2018-11-28
- GA
- 2019-05-29
- Notable customers
- Change Healthcare, Roche, Elevance Health, Pennymac, Cox Automotive, The Washington Post, HelloSign, PitchBook
Other OCR & Document Parsing APIs
Veryfi
"Documents into Data - securely, in seconds"
Google Document AI
"A document processing and understanding platform that takes unstructured data from documents and transforms it into structured data, making it easier to understand, analyze, and consume."
Azure AI Document Intelligence
"Azure Document Intelligence in Foundry Tools is a machine-learning based OCR and intelligent document processing service to automate extraction of key data from forms and documents."
Nanonets
"AI Agents for Enterprise Data Processing."
Extend
"Turn documents into high quality data"
Mindee
"Turn your document data into structured JSON with high-reliability. Zero model training required."
References
- ↑Description: aws.amazon.com · docs.aws.amazon.com
- ↑Pricing model: docs.aws.amazon.com · aws.amazon.com
- ↑Published pricing: aws.amazon.com
- ↑Free tier: aws.amazon.com · aws.amazon.com
- ↑Self-serve signup: docs.aws.amazon.com · docs.aws.amazon.com
- ↑Enterprise plan: docs.aws.amazon.com · docs.aws.amazon.com
- ↑Supported actions: docs.aws.amazon.com · docs.aws.amazon.com
- ↑Regions: docs.aws.amazon.com · aws.amazon.com
- ↑Languages: docs.aws.amazon.com · aws.amazon.com
- ↑Input types: docs.aws.amazon.com · docs.aws.amazon.com
- ↑Output types: docs.aws.amazon.com
- ↑Webhooks: docs.aws.amazon.com · docs.aws.amazon.com
- ↑SDK languages: docs.aws.amazon.com · docs.aws.amazon.com
- ↑SOC 2: aws.amazon.com
- ↑HIPAA: aws.amazon.com · aws.amazon.com
- ↑GDPR: aws.amazon.com · docs.aws.amazon.com
- ↑ISO 27001: aws.amazon.com · aws.amazon.com
- ↑PCI DSS: aws.amazon.com
- ↑Published SLA: aws.amazon.com · aws.amazon.com
- ↑Rate limits: docs.aws.amazon.com · docs.aws.amazon.com
- ↑Known restrictions: docs.aws.amazon.com · docs.aws.amazon.com
Change history
- 2026-06-15 Best For: Regulated or enterprise workloads - compliance attestations and an enterprise p… → Regulated or enterprise workloads - compliance attestations and an enterprise p…
- 2026-06-15 Score Docs Quality: 15 → 25
- 2026-06-15 Score Agent Friendliness: 10 → 35
- 2026-06-14 Llms Txt URL: (none) → https://docs.aws.amazon.com/llms.txt
- 2026-06-14 Llms Txt Present: No → Yes
- 2026-06-14 Robots Allows Agents: (none) → Yes
- 2026-06-14 Has Structured Data: (none) → No
- 2026-06-14 Capabilities: {} → {"tables":true,"handwriting":true,"id_documents":true,"custom_models":true,"rec…
- 2026-06-14 Summary Md: (none) → Amazon Textract is an AWS document intelligence service that extracts printed t…
- 2026-06-14 Score Docs Quality: 0 → 15
- 2026-06-14 Llms Txt Present: (none) → No
- 2026-06-14 Rendering: (none) → static
- 2026-06-14 Status Page URL: (none) → https://status.aws.amazon.com
- 2026-06-14 Docs URL: (none) → https://docs.aws.amazon.com
- 2026-06-14 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-14 Score Setup Speed: (none) → 60
- 2026-06-14 Score Pricing Transparency: (none) → 85
- 2026-06-14 Score Agent Friendliness: (none) → 10
- 2026-06-14 Best For: (none) → Regulated or enterprise workloads - compliance attestations and an enterprise p…
- 2026-06-14 Avoid If: (none) → You want to try it free before paying
- 2026-06-14 Score Trust Readiness: (none) → 100
- 2026-06-14 Score Procurement Friction: (none) → 85
- 2026-06-14 Score Docs Quality: (none) → 0
- 2026-06-14 SDK Packages: Python, JavaScript/Node.js, Java, .NET, Go, Ruby, PHP → Python, JavaScript/Node.js, Java, .NET, Go, Ruby, PHP
- 2026-06-14 Starting Price Usd: 0.0015 → 0.0015
- 2026-06-14 Last Verified At: 2026-06-13T00:00:00.000Z → 2026-06-14T00:00:00.000Z
- 2026-06-14 Starting Price Usd: 0.0015 → 0.0015
- 2026-06-14 SDK Packages: Python, JavaScript/Node.js, Java, .NET, Go, Ruby, PHP → Python, JavaScript/Node.js, Java, .NET, Go, Ruby, PHP
- 2026-06-13 PCI DSS: set to Yes
- 2026-06-13 SLA Published: set to Yes
- 2026-06-13 SLA URL: set to https://aws.amazon.com/textract/sla/
- 2026-06-13 Data Retention Policy URL: set to https://docs.aws.amazon.com/textract/latest/dg/data-protection.html
- 2026-06-13 Documented Rate Limits: set to Default TPS quotas vary by region and API. In US East (N. Virginia): DetectDocu…
- 2026-06-13 Rate Limit Requests: set to 25
- 2026-06-13 Rate Limit Window: set to second
- 2026-06-13 Known Restrictions: set to Supported file formats: JPEG, PNG, PDF, TIFF only (no XFA-based PDFs), Sync ope…
- 2026-06-13 Auth Methods: set to hmac_signature
- 2026-06-13 Auth Docs URL: set to https://docs.aws.amazon.com/textract/latest/dg/security-iam.html
- 2026-06-13 API Style: set to rest
- 2026-06-13 Base URL: set to https://textract.{region}.amazonaws.com
- 2026-06-13 API Version: set to 2018-06-27
- 2026-06-13 Versioning Scheme: set to header
- 2026-06-13 Deprecation Policy URL: set to https://docs.aws.amazon.com/general/latest/gr/service-lifecycle.html
- 2026-06-13 Quickstart URL: set to https://docs.aws.amazon.com/textract/latest/dg/getting-started.html
- 2026-06-13 Idempotency Supported: set to Yes
- 2026-06-13 Error Format: set to vendor-specific
- 2026-06-13 Webhook Events URL: set to https://docs.aws.amazon.com/textract/latest/dg/async-notification-payload.html
- 2026-06-13 Requires Verification: set to No
- 2026-06-13 Starting Price Usd: set to 0.0015
- 2026-06-13 Price Basis: set to page
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/aws-textract \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/aws-textract/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'