Azure AI Document Intelligence

"Azure Document Intelligence in Foundry Tools is a machine-learning based OCR and intelligent document processing service to automate extraction of key data from forms and documents." [1]

azure.microsoft.com/en-us/products/ai-services/ai-document-intelligence · By Microsoft · Agent JSON · Suggest an edit · Last verified 2026-06-14 · Source confidence: high

Azure AI Document Intelligence is a machine-learning OCR and document processing service from Microsoft that extracts structured data from forms, invoices, receipts, identity documents, tax forms, bank statements, and dozens of other document types via REST API. It suits teams automating accounts payable, mortgage processing, or RAG data preparation, with SDKs for Python, JavaScript, Java, and C#/.NET. Pricing starts at $1.50 per 1,000 pages on a pay-per-use basis with a free tier of 500 pages per month, and the service carries SOC 2 Type II, HIPAA, GDPR, and ISO 27001 certifications across more than 25 global regions.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt)

Scores

  • 45 / 100
    Agent friendliness
  • 100 / 100
    Pricing transparency
  • 85 / 100
    Setup speed
  • 25 / 100
    Docs quality
  • 100 / 100
    Procurement ease
  • 90 / 100
    Trust readiness

Scores are computed deterministically from this profile's published, sourced fields (pricing, compliance, capabilities, docs and developer-surface signals) - not from reviews or paid placement. Each axis is 0-100; an unknown signal scores 0 for that axis. Procurement ease is the inverse of buying friction (higher = easier to adopt).

Pricing & procurement

Pricing model
Usage-based [2]
Published pricing
Yes [3]
Free tier
Yes [4]
Free tier details
Document Intelligence offers a persistent free tier (F0) for testing all features. F0 includes 500 pages/month; no time limit. F0 limits document analysis to only the first two pages per request, max document size 4 MB, and 1 TPS. No SLA is provided on the free tier.
Self-serve signup
Yes [5]
Requires sales call
No
Enterprise plan
Yes [6]
Published prices
PlanItemPerAmountSource
Free (F0)All models — 500 pages per month includedmonth$0source
Pay As You Go (S0)Read (OCR) — first 1M pages/month1,000 pages$1.5source
Pay As You Go (S0)Read (OCR) — over 1M pages/month1,000 pages$0.6source
Pay As You Go (S0)Layout model1,000 pages$10source
Pay As You Go (S0)Prebuilt models (Invoice, Receipt, ID Document, W-2, 1098, Health Insurance Card, Contract, Business Card, Credit Card, Pay Stub, Bank Statement, Mortgage, etc.)1,000 pages$10source
Pay As You Go (S0)Custom classification model — analysis1,000 pages$3source
Pay As You Go (S0)Custom extraction model (template/neural) — analysis1,000 pages$30source
Pay As You Go (S0)Custom generative extraction model — analysis1,000 pages$30source
Pay As You Go (S0)Query fields add-on1,000 pages$10source
Pay As You Go (S0)High resolution / font / formula add-on1,000 pages$6source
Pay As You Go (S0)Custom neural model training — first 10 hours/monthmonth$0source
Pay As You Go (S0)Custom neural model training — beyond 10 free hours/monthhour$3source
Commitment Tier — Prebuilt (500K pages/month)Monthly commitment fee — 500,000 pages includedmonth$375source
Commitment Tier — Prebuilt (8M pages/month)Monthly commitment fee — 8,000,000 pages includedmonth$4200source

Capabilities

  • Receipts / invoices
  • ID documents
  • Bank statements
  • Table extraction
  • Custom models
  • LLM / RAG-ready output
Supported actions
analyze_document (prebuilt-read), analyze_document (prebuilt-layout), analyze_document (prebuilt-invoice), analyze_document (prebuilt-receipt), analyze_document (prebuilt-idDocument), analyze_document (prebuilt-contract), analyze_document (prebuilt-creditCard), analyze_document (prebuilt-check.us), analyze_document (prebuilt-payStub.us), analyze_document (prebuilt-bankStatement), analyze_document (prebuilt-healthInsuranceCard.us), analyze_document (prebuilt-marriageCertificate.us), analyze_document (prebuilt-mortgage.us.1003), analyze_document (prebuilt-mortgage.us.1004), analyze_document (prebuilt-mortgage.us.1005), analyze_document (prebuilt-mortgage.us.1008), analyze_document (prebuilt-mortgage.us.closingDisclosure), analyze_document (prebuilt-tax.us), analyze_document (prebuilt-tax.us.w2), analyze_document (prebuilt-tax.us.1040), analyze_document (prebuilt-tax.us.1098), analyze_document (prebuilt-tax.us.1099), build_custom_model (template), build_custom_model (neural), build_custom_classifier, compose_models, copy_model, delete_analyze_result, get_analyze_result, list_models, get_model, delete_model, ocr_high_resolution (add-on), formula_extraction (add-on), font_property_extraction (add-on), barcode_extraction (add-on), searchable_pdf_generation (add-on), query_fields (add-on), key_value_pairs (add-on) [7]
Regions
East US, East US 2, West US, West US 2, West US 3, Central US, North Central US, South Central US, Canada Central, Canada East, North Europe, West Europe, UK South, France Central, Germany West Central, Switzerland North, East Asia, Southeast Asia, Australia East, Japan East, Korea Central, Brazil South, South Africa North, UAE North, Azure Government (US), Azure China (21Vianet), Azure US Government (Fairfax) [8]
Languages
Abaza, Abkhazian, Achinese, Acoli, Adangme, Adyghe, Afar, Afrikaans, Akan, Albanian, Algonquin, Arabic, Asturian, Azerbaijani, Bambara, Bashkir, Basque, Belarusian, Bislama, Bosnian, Breton, Bulgarian, Catalan, Cebuano, Chamorro, Chinese Simplified, Chinese Traditional, Cornish, Corsican, Croatian, Czech, Danish, Dari, Dutch, English, Estonian, Faroese, Fijian, Filipino, Finnish, French, Galician, German, Greek, Gujarati (detection only), Haitian Creole, Hawaiian, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Interlingua, Irish, Italian, Japanese, Javanese, Kazakh, Korean, Kurdish, Latin, Latvian, Lithuanian, Luxembourgish, Macedonian, Malay, Maltese, Maori, Marathi, Mongolian, Nepali, Norwegian, Occitan, Persian, Polish, Portuguese, Punjabi, Romanian, Romansh, Russian, Samoan, Sanskrit, Scottish Gaelic, Serbian, Slovak, Slovenian, Somali, Spanish, Swahili, Swedish, Tajik, Tamil, Tatar, Thai, Turkish, Turkmen, Ukrainian, Urdu, Uzbek, Vietnamese, Welsh, Zulu [9]
Input types
PDF, JPEG, JPG, PNG, BMP, TIFF, HEIF, DOCX, PPTX, XLSX [10]
Output types
JSON, Searchable PDF [11]
Webhooks
No [12]
Sandbox / test mode
No
SDK languages
C# / .NET, Java, JavaScript / TypeScript, Python [13]
MCP server
No [14]

Trust & compliance

SOC 2
SOC 2 Type II [15]
HIPAA
Yes [16]
GDPR
Yes [17]
ISO 27001
Yes [18]
PCI DSS
No [19]
Published SLA
Yes [20]
Rate limits
Free (F0): 1 TPS Analyze, 1 TPS GET operations, 1 TPS model management, 1 TPS list operations; max doc size 4 MB; max pages per analysis 2. Standard (S0): 15 TPS Analyze (default, adjustable), 50 TPS GET operations (default, adjustable), 5 TPS model management (default, adjustable), 10 TPS list operations (default, adjustable); max doc size 500 MB; max pages per analysis 2,000. (Source: https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/service-limits) [21]
Known restrictions
Free (F0) tier limits document analysis to only the first two pages per request, Free (F0) tier max document size is 4 MB vs 500 MB on Standard, PDF and TIFF max 2,000 pages (Standard tier), Image dimensions must be between 50x50 pixels and 10,000x10,000 pixels, Minimum text height for extraction is 12 pixels for a 1024x768 image, Password-locked PDFs must have password removed before submission, Microsoft Office files (DOCX, PPTX, XLSX) are only supported by Read and Layout models, not prebuilt or custom extraction models, No webhook/callback support; async operations require polling, Data and results are temporarily stored in Azure Storage in the same region for 24 hours, then deleted, Custom neural model free training capped at 10 hours/month (v4.0); additional training billed per hour, Disconnected container support limited to custom and invoice models, Copying models across cloud environments (e.g., public cloud to Azure Government) is not supported [22]

Developer surface

Docs rendering: static · llms.txt present

Integration

API style
rest
Base URL
https://{endpoint}/documentintelligence
Version
2024-11-30
Versioning
url
Stability
ga
Auth methods
api_key, oauth2
Idempotency keys
No
Error format
vendor-specific (JSON: {error: {code, message, details, innererror}})
Rate limit
15 / second

SDKs

  • C# / .NET Azure.AI.DocumentIntelligence · repo
  • Java com.azure:azure-ai-documentintelligence · repo
  • JavaScript / TypeScript @azure-rest/ai-document-intelligence · repo
  • Python azure-ai-documentintelligence · repo

Adoption & maturity

Launched
2019-01-01
GA
2020-08-01

Other OCR & Document Parsing APIs

  • Amazon Textract

    "Automatically extract printed text, handwriting, layout elements, and data from any document"

    Usage · public pricing · self-serve

  • Veryfi

    "Documents into Data - securely, in seconds"

    Hybrid · free tier · public pricing · self-serve

  • Google Document AI

    "A document processing and understanding platform that takes unstructured data from documents and transforms it into structured data, making it easier to understand, analyze, and consume."

    Usage · public pricing · self-serve

  • Nanonets

    "AI Agents for Enterprise Data Processing."

    Usage · public pricing · self-serve

  • Extend

    "Turn documents into high quality data"

    Hybrid · public pricing · self-serve

  • Mindee

    "Turn your document data into structured JSON with high-reliability. Zero model training required."

    Hybrid · public pricing · self-serve

Azure AI Document Intelligence alternatives · Azure AI Document Intelligence vs Amazon Textract · All OCR & Document Parsing APIs APIs

References

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

  1. 2026-06-15 Score Agent Friendliness: 2545
  2. 2026-06-14 Robots Allows Agents: (none)Yes
  3. 2026-06-14 Has Structured Data: (none)Yes
  4. 2026-06-14 Capabilities: {}{"tables":true,"id_documents":true,"custom_models":true,"agentic_output":true,"…
  5. 2026-06-14 Summary Md: (none)Azure AI Document Intelligence is a machine-learning OCR and document processin…
  6. 2026-06-14 Score Docs Quality: 025
  7. 2026-06-14 Best For: Prototypes and side projects - free to start, no sales call, Regulated or enter…Prototypes and side projects - free to start, no sales call, Regulated or enter…
  8. 2026-06-14 Score Agent Friendliness: 1025
  9. 2026-06-14 Llms Txt Present: (none)Yes
  10. 2026-06-14 Llms Txt URL: (none)https://azure.microsoft.com/llms.txt
  11. 2026-06-14 Rendering: (none)static
  12. 2026-06-14 Docs URL: (none)https://azure.microsoft.com/developer
  13. 2026-06-14 Score Agent Friendliness: (none)10
  14. 2026-06-14 Score Pricing Transparency: (none)100
  15. 2026-06-14 Best For: (none)Prototypes and side projects - free to start, no sales call, Regulated or enter…
  16. 2026-06-14 Scoring Methodology: (none)Scores are computed deterministically from this profile's published, sourced fi…
  17. 2026-06-14 Score Trust Readiness: (none)90
  18. 2026-06-14 Score Procurement Friction: (none)100
  19. 2026-06-14 Score Docs Quality: (none)0
  20. 2026-06-14 Score Setup Speed: (none)85
  21. 2026-06-14 Last Verified At: 2026-06-13T00:00:00.000Z2026-06-14T00:00:00.000Z
  22. 2026-06-14 Starting Price Usd: 1.51.5
  23. 2026-06-14 SDK Packages: C# / .NET, Java, JavaScript / TypeScript, PythonC# / .NET, Java, JavaScript / TypeScript, Python
  24. 2026-06-13 Enterprise Plan Available: set to Yes
  25. 2026-06-13 SOC 2: set to type_2
  26. 2026-06-13 GDPR: set to Yes
  27. 2026-06-13 ISO 27001: set to Yes
  28. 2026-06-13 PCI DSS: set to No
  29. 2026-06-13 SLA Published: set to Yes
  30. 2026-06-13 SLA URL: set to https://azure.microsoft.com/en-us/support/legal/sla/cognitive-services/
  31. 2026-06-13 Data Retention Policy URL: set to https://learn.microsoft.com/en-us/legal/cognitive-services/document-intelligenc…
  32. 2026-06-13 Documented Rate Limits: set to Free (F0): 1 TPS Analyze, 1 TPS GET operations, 1 TPS model management, 1 TPS l…
  33. 2026-06-13 Rate Limit Requests: set to 15
  34. 2026-06-13 Rate Limit Window: set to second
  35. 2026-06-13 Known Restrictions: set to Free (F0) tier limits document analysis to only the first two pages per request…
  36. 2026-06-13 Auth Methods: set to api_key, oauth2
  37. 2026-06-13 Auth Docs URL: set to https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/versi…
  38. 2026-06-13 API Style: set to rest
  39. 2026-06-13 Base URL: set to https://{endpoint}/documentintelligence
  40. 2026-06-13 API Version: set to 2024-11-30
  41. 2026-06-13 Versioning Scheme: set to url
  42. 2026-06-13 Stability: set to ga
  43. 2026-06-13 Deprecation Policy URL: set to https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/train…
  44. 2026-06-13 Quickstart URL: set to https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/quick…
  45. 2026-06-13 Idempotency Supported: set to No
  46. 2026-06-13 Error Format: set to vendor-specific (JSON: {error: {code, message, details, innererror}})
  47. 2026-06-13 Requires Verification: set to No
  48. 2026-06-13 Starting Price Usd: set to 1.5
  49. 2026-06-13 Price Basis: set to 1,000 pages
  50. 2026-06-13 Free Tier Limit: set to 500 pages/month

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/azure-document-intelligence \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/azure-document-intelligence/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →