Amazon Textract

"Automatically extract printed text, handwriting, layout elements, and data from any document" [1]

aws.amazon.com/textract/ · By Amazon Web Services · Agent JSON · Suggest an edit · Last verified 2026-06-14 · Source confidence: high

Amazon Textract is an AWS document intelligence service that extracts printed text, handwriting, form fields, tables, and structured data from PDFs and images, targeting industries such as healthcare, financial services, and lending. Pricing is usage-based starting at $0.0015 per page, with a free tier of 1,000 pages per month for the first three months and no sales call required to get started. The service is available across 16 AWS regions including GovCloud, holds SOC 2 Type II, HIPAA, GDPR, ISO 27001, and PCI DSS certifications, and offers SDKs for seven languages.

Best for / Avoid if

Best for: Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box

Avoid if: You want to try it free before paying

Scores

  • 35 / 100
    Agent friendliness
  • 85 / 100
    Pricing transparency
  • 60 / 100
    Setup speed
  • 25 / 100
    Docs quality
  • 85 / 100
    Procurement ease
  • 100 / 100
    Trust readiness

Scores are computed deterministically from this profile's published, sourced fields (pricing, compliance, capabilities, docs and developer-surface signals) - not from reviews or paid placement. Each axis is 0-100; an unknown signal scores 0 for that axis. Procurement ease is the inverse of buying friction (higher = easier to adopt).

Pricing & procurement

Pricing model
Usage-based [2]
Published pricing
Yes [3]
Free tier
No [4]
Free tier details
3-month time-limited trial for new AWS customers only: Detect Document Text 1,000 pages/month; Analyze Document (Signatures) 1,000 pages/month; Analyze Document (Forms/Tables/Layout/Queries) 100 pages/month; Analyze Expense 100 pages/month; Analyze ID 100 pages/month; Analyze Lending 2,000 pages/month. Not a perpetual free tier.
Self-serve signup
Yes [5]
Requires sales call
No
Enterprise plan
Yes [6]
Published prices
PlanItemPerAmountSource
Trial (3-month, new AWS customers only)Detect Document Text API — 1,000 pages/monthmonth$0source
Trial (3-month, new AWS customers only)Analyze Document API (Signatures only) — 1,000 pages/monthmonth$0source
Trial (3-month, new AWS customers only)Analyze Document API (Forms, Tables, Layout, Queries) — 100 pages/month eachmonth$0source
Trial (3-month, new AWS customers only)Analyze Expense API — 100 pages/monthmonth$0source
Trial (3-month, new AWS customers only)Analyze ID API — 100 pages/monthmonth$0source
Trial (3-month, new AWS customers only)Analyze Lending API — 2,000 pages/monthmonth$0source
Pay As You GoDetect Document Text API — first 1M pagespage$0.0015source
Pay As You GoDetect Document Text API — after 1M pagespage$0.0006source
Pay As You GoAnalyze Document API (Forms) — first 1M pagespage$0.05source
Pay As You GoAnalyze Document API (Forms) — after 1M pagespage$0.04source
Pay As You GoAnalyze Document API (Tables) — first 1M pagespage$0.015source
Pay As You GoAnalyze Document API (Tables) — after 1M pagespage$0.01source
Pay As You GoAnalyze Document API (Queries) — first 1M pagespage$0.015source
Pay As You GoAnalyze Document API (Tables + Queries) — first 1M pagespage$0.02source
Pay As You GoAnalyze Document API (Tables + Queries) — after 1M pagespage$0.015source
Pay As You GoAnalyze Document API (Forms + Tables + Queries) — first 1M pagespage$0.07source
Pay As You GoAnalyze Document API (Forms + Tables + Queries) — after 1M pagespage$0.055source
Pay As You GoAnalyze Document API (Signatures) — first 1M pagespage$0.0035source
Pay As You GoAnalyze Document API (Signatures) — after 1M pagespage$0.0014source
Pay As You GoAnalyze Document API (Custom Queries) — first 1M pagespage$0.025source
Pay As You GoAnalyze Document API (Custom Queries) — after 1M pagespage$0.015source
Pay As You GoAnalyze Document API (Forms + Custom Queries) — first 1M pagespage$0.065source
Pay As You GoAnalyze Document API (Forms + Custom Queries) — after 1M pagespage$0.05source
Pay As You GoAnalyze Expense API — first 1M pagespage$0.01source
Pay As You GoAnalyze Expense API — after 1M pagespage$0.008source
Pay As You GoAnalyze ID API — first 100K pagespage$0.025source
Pay As You GoAnalyze ID API — after 100K pagespage$0.01source
Pay As You GoAnalyze Lending API — first 1M pages (classified and extracted pages only)page$0.07source
Pay As You GoAnalyze Lending API — after 1M pages (classified and extracted pages only)page$0.055source

Capabilities

  • Receipts / invoices
  • ID documents
  • Table extraction
  • Handwriting
  • Custom models
Supported actions
DetectDocumentText, AnalyzeDocument (Forms, Tables, Queries, Signatures, Layout, Custom Queries), AnalyzeExpense, AnalyzeID, StartDocumentTextDetection, StartDocumentAnalysis, StartExpenseAnalysis, StartIDDocumentAnalysis, GetDocumentTextDetection, GetDocumentAnalysis, GetExpenseAnalysis, GetIDDocumentAnalysis, StartLendingAnalysis, GetLendingAnalysis, GetLendingAnalysisSummary, CreateAdapter, CreateAdapterVersion, DeleteAdapter, DeleteAdapterVersion, UpdateAdapter, GetAdapter, GetAdapterVersion, ListAdapters, ListAdapterVersions [7]
Regions
US East (N. Virginia) us-east-1, US East (Ohio) us-east-2, US West (N. California) us-west-1, US West (Oregon) us-west-2, Asia Pacific (Mumbai) ap-south-1, Asia Pacific (Seoul) ap-northeast-2, Asia Pacific (Singapore) ap-southeast-1, Asia Pacific (Sydney) ap-southeast-2, Canada (Central) ca-central-1, Europe (Frankfurt) eu-central-1, Europe (Ireland) eu-west-1, Europe (London) eu-west-2, Europe (Paris) eu-west-3, Europe (Spain) eu-south-2, AWS GovCloud (US-East) us-gov-east-1, AWS GovCloud (US-West) us-gov-west-1 [8]
Languages
English, French, German, Italian, Portuguese, Spanish [9]
Input types
JPEG, PNG, PDF, TIFF [10]
Output types
JSON [11]
Webhooks
Yes [12]
Sandbox / test mode
No
SDK languages
Python, JavaScript/Node.js, Java, .NET, Go, Ruby, PHP [13]
MCP server
No

Trust & compliance

SOC 2
SOC 2 Type II [14]
HIPAA
Yes [15]
GDPR
Yes [16]
ISO 27001
Yes [17]
PCI DSS
Yes [18]
Published SLA
Yes [19]
Rate limits
Default TPS quotas vary by region and API. In US East (N. Virginia): DetectDocumentText sync=25 TPS, AnalyzeDocument sync=10 TPS, AnalyzeExpense sync=5 TPS, AnalyzeID sync=5 TPS; StartDocumentTextDetection async=15 TPS, StartDocumentAnalysis async=10 TPS, StartExpenseAnalysis async=5 TPS, StartLendingAnalysis async=5 TPS; Maximum concurrent async jobs=600 in us-east-1/us-west-2. Synchronous ops: max 15 queries per page; Asynchronous ops: max 30 queries per page. Limits are adjustable via AWS Service Quotas console. [20]
Known restrictions
Supported file formats: JPEG, PNG, PDF, TIFF only (no XFA-based PDFs), Sync operations: max 10 MB file size, PDF/TIFF limited to 1 page, Async operations: PDF/TIFF max 500 MB and 3,000 pages; JPEG/PNG max 10 MB, PDF max height/width: 40 inches and 9,000 points; cannot be password protected, Max image resolution: 10,000 pixels on all sides, Handwritten text recognition supported in English only, Query feature (natural language) available in English documents only, AnalyzeID supports US passports and US driver's licenses only, Vertical text (e.g., Japanese, Chinese vertical layout) not supported, Minimum text height: 15 pixels (equivalent to 8pt font at 150 DPI), Up to 15 queries per page (sync) and 30 queries per page (async), PCI DSS users must opt out of content use for service improvement via AWS Support, Max 10 adapters per account; max 10 adapter versions per month; max 3 in-progress adapter training jobs, Layout feature is free when used with the Tables feature [21]

Developer surface

Docs rendering: static · llms.txt present

Integration

API style
rest
Base URL
https://textract.{region}.amazonaws.com
Version
2018-06-27
Versioning
header
Stability
ga
Auth methods
hmac_signature
Idempotency keys
Yes
Error format
vendor-specific
Rate limit
25 / second

SDKs

  • Python boto3 · repo
  • JavaScript/Node.js @aws-sdk/client-textract · repo
  • Java software.amazon.awssdk:textract · repo
  • .NET AWSSDK.Textract · repo
  • Go github.com/aws/aws-sdk-go-v2/service/textract · repo
  • Ruby aws-sdk-textract · repo
  • PHP aws/aws-sdk-php · repo

Adoption & maturity

Launched
2018-11-28
GA
2019-05-29
Notable customers
Change Healthcare, Roche, Elevance Health, Pennymac, Cox Automotive, The Washington Post, HelloSign, PitchBook

Other OCR & Document Parsing APIs

  • Veryfi

    "Documents into Data - securely, in seconds"

    Hybrid · free tier · public pricing · self-serve

  • Google Document AI

    "A document processing and understanding platform that takes unstructured data from documents and transforms it into structured data, making it easier to understand, analyze, and consume."

    Usage · public pricing · self-serve

  • Azure AI Document Intelligence

    "Azure Document Intelligence in Foundry Tools is a machine-learning based OCR and intelligent document processing service to automate extraction of key data from forms and documents."

    Usage · free tier · public pricing · self-serve

  • Nanonets

    "AI Agents for Enterprise Data Processing."

    Usage · public pricing · self-serve

  • Extend

    "Turn documents into high quality data"

    Hybrid · public pricing · self-serve

  • Mindee

    "Turn your document data into structured JSON with high-reliability. Zero model training required."

    Hybrid · public pricing · self-serve

Amazon Textract alternatives · Amazon Textract vs Veryfi · All OCR & Document Parsing APIs APIs

References

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

  1. 2026-06-15 Best For: Regulated or enterprise workloads - compliance attestations and an enterprise p…Regulated or enterprise workloads - compliance attestations and an enterprise p…
  2. 2026-06-15 Score Docs Quality: 1525
  3. 2026-06-15 Score Agent Friendliness: 1035
  4. 2026-06-14 Llms Txt URL: (none)https://docs.aws.amazon.com/llms.txt
  5. 2026-06-14 Llms Txt Present: NoYes
  6. 2026-06-14 Robots Allows Agents: (none)Yes
  7. 2026-06-14 Has Structured Data: (none)No
  8. 2026-06-14 Capabilities: {}{"tables":true,"handwriting":true,"id_documents":true,"custom_models":true,"rec…
  9. 2026-06-14 Summary Md: (none)Amazon Textract is an AWS document intelligence service that extracts printed t…
  10. 2026-06-14 Score Docs Quality: 015
  11. 2026-06-14 Llms Txt Present: (none)No
  12. 2026-06-14 Rendering: (none)static
  13. 2026-06-14 Status Page URL: (none)https://status.aws.amazon.com
  14. 2026-06-14 Docs URL: (none)https://docs.aws.amazon.com
  15. 2026-06-14 Scoring Methodology: (none)Scores are computed deterministically from this profile's published, sourced fi…
  16. 2026-06-14 Score Setup Speed: (none)60
  17. 2026-06-14 Score Pricing Transparency: (none)85
  18. 2026-06-14 Score Agent Friendliness: (none)10
  19. 2026-06-14 Best For: (none)Regulated or enterprise workloads - compliance attestations and an enterprise p…
  20. 2026-06-14 Avoid If: (none)You want to try it free before paying
  21. 2026-06-14 Score Trust Readiness: (none)100
  22. 2026-06-14 Score Procurement Friction: (none)85
  23. 2026-06-14 Score Docs Quality: (none)0
  24. 2026-06-14 SDK Packages: Python, JavaScript/Node.js, Java, .NET, Go, Ruby, PHPPython, JavaScript/Node.js, Java, .NET, Go, Ruby, PHP
  25. 2026-06-14 Starting Price Usd: 0.00150.0015
  26. 2026-06-14 Last Verified At: 2026-06-13T00:00:00.000Z2026-06-14T00:00:00.000Z
  27. 2026-06-14 Starting Price Usd: 0.00150.0015
  28. 2026-06-14 SDK Packages: Python, JavaScript/Node.js, Java, .NET, Go, Ruby, PHPPython, JavaScript/Node.js, Java, .NET, Go, Ruby, PHP
  29. 2026-06-13 PCI DSS: set to Yes
  30. 2026-06-13 SLA Published: set to Yes
  31. 2026-06-13 SLA URL: set to https://aws.amazon.com/textract/sla/
  32. 2026-06-13 Data Retention Policy URL: set to https://docs.aws.amazon.com/textract/latest/dg/data-protection.html
  33. 2026-06-13 Documented Rate Limits: set to Default TPS quotas vary by region and API. In US East (N. Virginia): DetectDocu…
  34. 2026-06-13 Rate Limit Requests: set to 25
  35. 2026-06-13 Rate Limit Window: set to second
  36. 2026-06-13 Known Restrictions: set to Supported file formats: JPEG, PNG, PDF, TIFF only (no XFA-based PDFs), Sync ope…
  37. 2026-06-13 Auth Methods: set to hmac_signature
  38. 2026-06-13 Auth Docs URL: set to https://docs.aws.amazon.com/textract/latest/dg/security-iam.html
  39. 2026-06-13 API Style: set to rest
  40. 2026-06-13 Base URL: set to https://textract.{region}.amazonaws.com
  41. 2026-06-13 API Version: set to 2018-06-27
  42. 2026-06-13 Versioning Scheme: set to header
  43. 2026-06-13 Deprecation Policy URL: set to https://docs.aws.amazon.com/general/latest/gr/service-lifecycle.html
  44. 2026-06-13 Quickstart URL: set to https://docs.aws.amazon.com/textract/latest/dg/getting-started.html
  45. 2026-06-13 Idempotency Supported: set to Yes
  46. 2026-06-13 Error Format: set to vendor-specific
  47. 2026-06-13 Webhook Events URL: set to https://docs.aws.amazon.com/textract/latest/dg/async-notification-payload.html
  48. 2026-06-13 Requires Verification: set to No
  49. 2026-06-13 Starting Price Usd: set to 0.0015
  50. 2026-06-13 Price Basis: set to page

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/aws-textract \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/aws-textract/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →