Azure AI Document Intelligence
"Azure Document Intelligence in Foundry Tools is a machine-learning based OCR and intelligent document processing service to automate extraction of key data from forms and documents." [1]
Azure AI Document Intelligence is a machine-learning OCR and document processing service from Microsoft that extracts structured data from forms, invoices, receipts, identity documents, tax forms, bank statements, and dozens of other document types via REST API. It suits teams automating accounts payable, mortgage processing, or RAG data preparation, with SDKs for Python, JavaScript, Java, and C#/.NET. Pricing starts at $1.50 per 1,000 pages on a pay-per-use basis with a free tier of 500 pages per month, and the service carries SOC 2 Type II, HIPAA, GDPR, and ISO 27001 certifications across more than 25 global regions.
Best for / Avoid if
Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt)
Scores
- 45 / 100Agent friendliness
- 100 / 100Pricing transparency
- 85 / 100Setup speed
- 25 / 100Docs quality
- 100 / 100Procurement ease
- 90 / 100Trust readiness
Pricing & procurement
- Pricing model
- Usage-based [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✓ Yes [4]
- Free tier details
- Document Intelligence offers a persistent free tier (F0) for testing all features. F0 includes 500 pages/month; no time limit. F0 limits document analysis to only the first two pages per request, max document size 4 MB, and 1 TPS. No SLA is provided on the free tier.
- Self-serve signup
- ✓ Yes [5]
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes [6]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Free (F0) | All models — 500 pages per month included | month | $0 | source |
| Pay As You Go (S0) | Read (OCR) — first 1M pages/month | 1,000 pages | $1.5 | source |
| Pay As You Go (S0) | Read (OCR) — over 1M pages/month | 1,000 pages | $0.6 | source |
| Pay As You Go (S0) | Layout model | 1,000 pages | $10 | source |
| Pay As You Go (S0) | Prebuilt models (Invoice, Receipt, ID Document, W-2, 1098, Health Insurance Card, Contract, Business Card, Credit Card, Pay Stub, Bank Statement, Mortgage, etc.) | 1,000 pages | $10 | source |
| Pay As You Go (S0) | Custom classification model — analysis | 1,000 pages | $3 | source |
| Pay As You Go (S0) | Custom extraction model (template/neural) — analysis | 1,000 pages | $30 | source |
| Pay As You Go (S0) | Custom generative extraction model — analysis | 1,000 pages | $30 | source |
| Pay As You Go (S0) | Query fields add-on | 1,000 pages | $10 | source |
| Pay As You Go (S0) | High resolution / font / formula add-on | 1,000 pages | $6 | source |
| Pay As You Go (S0) | Custom neural model training — first 10 hours/month | month | $0 | source |
| Pay As You Go (S0) | Custom neural model training — beyond 10 free hours/month | hour | $3 | source |
| Commitment Tier — Prebuilt (500K pages/month) | Monthly commitment fee — 500,000 pages included | month | $375 | source |
| Commitment Tier — Prebuilt (8M pages/month) | Monthly commitment fee — 8,000,000 pages included | month | $4200 | source |
Capabilities
- Supported actions
- analyze_document (prebuilt-read), analyze_document (prebuilt-layout), analyze_document (prebuilt-invoice), analyze_document (prebuilt-receipt), analyze_document (prebuilt-idDocument), analyze_document (prebuilt-contract), analyze_document (prebuilt-creditCard), analyze_document (prebuilt-check.us), analyze_document (prebuilt-payStub.us), analyze_document (prebuilt-bankStatement), analyze_document (prebuilt-healthInsuranceCard.us), analyze_document (prebuilt-marriageCertificate.us), analyze_document (prebuilt-mortgage.us.1003), analyze_document (prebuilt-mortgage.us.1004), analyze_document (prebuilt-mortgage.us.1005), analyze_document (prebuilt-mortgage.us.1008), analyze_document (prebuilt-mortgage.us.closingDisclosure), analyze_document (prebuilt-tax.us), analyze_document (prebuilt-tax.us.w2), analyze_document (prebuilt-tax.us.1040), analyze_document (prebuilt-tax.us.1098), analyze_document (prebuilt-tax.us.1099), build_custom_model (template), build_custom_model (neural), build_custom_classifier, compose_models, copy_model, delete_analyze_result, get_analyze_result, list_models, get_model, delete_model, ocr_high_resolution (add-on), formula_extraction (add-on), font_property_extraction (add-on), barcode_extraction (add-on), searchable_pdf_generation (add-on), query_fields (add-on), key_value_pairs (add-on) [7]
- Regions
- East US, East US 2, West US, West US 2, West US 3, Central US, North Central US, South Central US, Canada Central, Canada East, North Europe, West Europe, UK South, France Central, Germany West Central, Switzerland North, East Asia, Southeast Asia, Australia East, Japan East, Korea Central, Brazil South, South Africa North, UAE North, Azure Government (US), Azure China (21Vianet), Azure US Government (Fairfax) [8]
- Languages
- Abaza, Abkhazian, Achinese, Acoli, Adangme, Adyghe, Afar, Afrikaans, Akan, Albanian, Algonquin, Arabic, Asturian, Azerbaijani, Bambara, Bashkir, Basque, Belarusian, Bislama, Bosnian, Breton, Bulgarian, Catalan, Cebuano, Chamorro, Chinese Simplified, Chinese Traditional, Cornish, Corsican, Croatian, Czech, Danish, Dari, Dutch, English, Estonian, Faroese, Fijian, Filipino, Finnish, French, Galician, German, Greek, Gujarati (detection only), Haitian Creole, Hawaiian, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Interlingua, Irish, Italian, Japanese, Javanese, Kazakh, Korean, Kurdish, Latin, Latvian, Lithuanian, Luxembourgish, Macedonian, Malay, Maltese, Maori, Marathi, Mongolian, Nepali, Norwegian, Occitan, Persian, Polish, Portuguese, Punjabi, Romanian, Romansh, Russian, Samoan, Sanskrit, Scottish Gaelic, Serbian, Slovak, Slovenian, Somali, Spanish, Swahili, Swedish, Tajik, Tamil, Tatar, Thai, Turkish, Turkmen, Ukrainian, Urdu, Uzbek, Vietnamese, Welsh, Zulu [9]
- Input types
- PDF, JPEG, JPG, PNG, BMP, TIFF, HEIF, DOCX, PPTX, XLSX [10]
- Output types
- JSON, Searchable PDF [11]
- Webhooks
- ✗ No [12]
- Sandbox / test mode
- ✗ No
- SDK languages
- C# / .NET, Java, JavaScript / TypeScript, Python [13]
- MCP server
- ✗ No [14]
Trust & compliance
- SOC 2
- SOC 2 Type II [15]
- HIPAA
- ✓ Yes [16]
- GDPR
- ✓ Yes [17]
- ISO 27001
- ✓ Yes [18]
- PCI DSS
- ✗ No [19]
- Published SLA
- ✓ Yes [20]
- Rate limits
- Free (F0): 1 TPS Analyze, 1 TPS GET operations, 1 TPS model management, 1 TPS list operations; max doc size 4 MB; max pages per analysis 2. Standard (S0): 15 TPS Analyze (default, adjustable), 50 TPS GET operations (default, adjustable), 5 TPS model management (default, adjustable), 10 TPS list operations (default, adjustable); max doc size 500 MB; max pages per analysis 2,000. (Source: https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/service-limits) [21]
- Known restrictions
- Free (F0) tier limits document analysis to only the first two pages per request, Free (F0) tier max document size is 4 MB vs 500 MB on Standard, PDF and TIFF max 2,000 pages (Standard tier), Image dimensions must be between 50x50 pixels and 10,000x10,000 pixels, Minimum text height for extraction is 12 pixels for a 1024x768 image, Password-locked PDFs must have password removed before submission, Microsoft Office files (DOCX, PPTX, XLSX) are only supported by Read and Layout models, not prebuilt or custom extraction models, No webhook/callback support; async operations require polling, Data and results are temporarily stored in Azure Storage in the same region for 24 hours, then deleted, Custom neural model free training capped at 10 hours/month (v4.0); additional training billed per hour, Disconnected container support limited to custom and invoice models, Copying models across cloud environments (e.g., public cloud to Azure Government) is not supported [22]
Developer surface
Integration
- API style
- rest
- Base URL
- https://{endpoint}/documentintelligence
- Version
- 2024-11-30
- Versioning
- url
- Stability
- ga
- Auth methods
- api_key, oauth2
- Idempotency keys
- ✗ No
- Error format
- vendor-specific (JSON: {error: {code, message, details, innererror}})
- Rate limit
- 15 / second
Adoption & maturity
- Launched
- 2019-01-01
- GA
- 2020-08-01
Other OCR & Document Parsing APIs
Amazon Textract
"Automatically extract printed text, handwriting, layout elements, and data from any document"
Veryfi
"Documents into Data - securely, in seconds"
Google Document AI
"A document processing and understanding platform that takes unstructured data from documents and transforms it into structured data, making it easier to understand, analyze, and consume."
Nanonets
"AI Agents for Enterprise Data Processing."
Extend
"Turn documents into high quality data"
Mindee
"Turn your document data into structured JSON with high-reliability. Zero model training required."
References
- ↑Description: learn.microsoft.com · learn.microsoft.com
- ↑Pricing model: learn.microsoft.com · azure.microsoft.com
- ↑Published pricing: learn.microsoft.com · azure.microsoft.com
- ↑Free tier: learn.microsoft.com · azure.microsoft.com · learn.microsoft.com
- ↑Self-serve signup: learn.microsoft.com
- ↑Enterprise plan: learn.microsoft.com · azure.microsoft.com
- ↑Supported actions: learn.microsoft.com · learn.microsoft.com · learn.microsoft.com
- ↑Regions: learn.microsoft.com · learn.microsoft.com
- ↑Languages: learn.microsoft.com · learn.microsoft.com
- ↑Input types: learn.microsoft.com
- ↑Output types: learn.microsoft.com · learn.microsoft.com
- ↑Webhooks: learn.microsoft.com
- ↑SDK languages: learn.microsoft.com · learn.microsoft.com
- ↑MCP server: learn.microsoft.com
- ↑SOC 2: learn.microsoft.com · azure.microsoft.com
- ↑HIPAA: learn.microsoft.com · learn.microsoft.com
- ↑GDPR: learn.microsoft.com · learn.microsoft.com
- ↑ISO 27001: learn.microsoft.com · azure.microsoft.com
- ↑PCI DSS: azure.microsoft.com
- ↑Published SLA: learn.microsoft.com · azure.microsoft.com
- ↑Rate limits: learn.microsoft.com
- ↑Known restrictions: learn.microsoft.com · learn.microsoft.com
Change history
- 2026-06-15 Score Agent Friendliness: 25 → 45
- 2026-06-14 Robots Allows Agents: (none) → Yes
- 2026-06-14 Has Structured Data: (none) → Yes
- 2026-06-14 Capabilities: {} → {"tables":true,"id_documents":true,"custom_models":true,"agentic_output":true,"…
- 2026-06-14 Summary Md: (none) → Azure AI Document Intelligence is a machine-learning OCR and document processin…
- 2026-06-14 Score Docs Quality: 0 → 25
- 2026-06-14 Best For: Prototypes and side projects - free to start, no sales call, Regulated or enter… → Prototypes and side projects - free to start, no sales call, Regulated or enter…
- 2026-06-14 Score Agent Friendliness: 10 → 25
- 2026-06-14 Llms Txt Present: (none) → Yes
- 2026-06-14 Llms Txt URL: (none) → https://azure.microsoft.com/llms.txt
- 2026-06-14 Rendering: (none) → static
- 2026-06-14 Docs URL: (none) → https://azure.microsoft.com/developer
- 2026-06-14 Score Agent Friendliness: (none) → 10
- 2026-06-14 Score Pricing Transparency: (none) → 100
- 2026-06-14 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
- 2026-06-14 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-14 Score Trust Readiness: (none) → 90
- 2026-06-14 Score Procurement Friction: (none) → 100
- 2026-06-14 Score Docs Quality: (none) → 0
- 2026-06-14 Score Setup Speed: (none) → 85
- 2026-06-14 Last Verified At: 2026-06-13T00:00:00.000Z → 2026-06-14T00:00:00.000Z
- 2026-06-14 Starting Price Usd: 1.5 → 1.5
- 2026-06-14 SDK Packages: C# / .NET, Java, JavaScript / TypeScript, Python → C# / .NET, Java, JavaScript / TypeScript, Python
- 2026-06-13 Enterprise Plan Available: set to Yes
- 2026-06-13 SOC 2: set to type_2
- 2026-06-13 GDPR: set to Yes
- 2026-06-13 ISO 27001: set to Yes
- 2026-06-13 PCI DSS: set to No
- 2026-06-13 SLA Published: set to Yes
- 2026-06-13 SLA URL: set to https://azure.microsoft.com/en-us/support/legal/sla/cognitive-services/
- 2026-06-13 Data Retention Policy URL: set to https://learn.microsoft.com/en-us/legal/cognitive-services/document-intelligenc…
- 2026-06-13 Documented Rate Limits: set to Free (F0): 1 TPS Analyze, 1 TPS GET operations, 1 TPS model management, 1 TPS l…
- 2026-06-13 Rate Limit Requests: set to 15
- 2026-06-13 Rate Limit Window: set to second
- 2026-06-13 Known Restrictions: set to Free (F0) tier limits document analysis to only the first two pages per request…
- 2026-06-13 Auth Methods: set to api_key, oauth2
- 2026-06-13 Auth Docs URL: set to https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/versi…
- 2026-06-13 API Style: set to rest
- 2026-06-13 Base URL: set to https://{endpoint}/documentintelligence
- 2026-06-13 API Version: set to 2024-11-30
- 2026-06-13 Versioning Scheme: set to url
- 2026-06-13 Stability: set to ga
- 2026-06-13 Deprecation Policy URL: set to https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/train…
- 2026-06-13 Quickstart URL: set to https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/quick…
- 2026-06-13 Idempotency Supported: set to No
- 2026-06-13 Error Format: set to vendor-specific (JSON: {error: {code, message, details, innererror}})
- 2026-06-13 Requires Verification: set to No
- 2026-06-13 Starting Price Usd: set to 1.5
- 2026-06-13 Price Basis: set to 1,000 pages
- 2026-06-13 Free Tier Limit: set to 500 pages/month
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/azure-document-intelligence \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/azure-document-intelligence/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'