Diffbot
Web Data for your AI [1]
Diffbot turns the web into structured data for AI, with products for market intelligence, news monitoring, machine learning, and e-commerce, including a knowledge graph. The REST API offers API-key auth, webhooks, eleven SDKs, and an official MCP server. Pricing is published and self-serve on a hybrid model from $299/month, with 10,000 credits free each month. It is GDPR compliant. Used by Snapchat, AstraZeneca, Klarna, and Indeed.
Scores
Pricing & procurement
- Pricing model
- Hybrid (base + usage) [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✓ Yes [4]
- Free tier details
- Free forever. 10,000 credits/month, 5 calls/min. No credit card required. [5]
- Self-serve signup
- ✓ Yes [6]
- Requires sales call
- ✗ No [7]
- Enterprise plan
- ✓ Yes [8]
- Minimum commitment
- No contracts required [9]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Free | 10,000 credits | month | $0 | source |
| Free | credit | credit | $0 | source |
| Startup | 250,000 credits | month | $299 | source |
| Startup | credit | credit | $0.001 | source |
| Plus | 1,000,000 credits | month | $899 | source |
| Plus | credit | credit | $0.0009 | source |
Capabilities
- Supported actions
- extract_analyze, extract_article, extract_discussion, extract_event, extract_image, extract_job, extract_list, extract_product, extract_video, enhance_get, enhance_post, bulk_enhance, knowledge_graph_search, knowledge_graph_combine, create_crawl, manage_crawl_job, retrieve_crawl_job_data, search_crawl_job_data, create_bulkjob, download_bulkjob_results, poll_bulkjob_status, list_bulkjobs_for_token, download_bulkjob_coverage_report, delete_bulkjob, download_single_bulkjob_result, stop_bulkjob, create_or_update_custom_api, custom_api_rulesets, extract_with_custom_api, delete_custom_api, retrieve_custom_apis, extract_content_not_available_online, extract_custom_headers, extract_custom_javascript [10]
- Input types
- URL, HTML markup, plain text [11]
- Output types
- JSON [12]
- Webhooks
- ✓ Yes [13]
- Sandbox / test mode
- ✗ No [14]
- SDK languages
- Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust [15]
- MCP server
- ✓ Yes [16]
Trust & compliance
- SOC 2
- – Unknown [17]
- HIPAA
- – Unknown [18]
- GDPR
- ✓ Yes [19]
- ISO 27001
- – Unknown [20]
- PCI DSS
- – Unknown [21]
- Published SLA
- ✗ No [22]
- Rate limits
- Free: 5 Calls Per Minute; Startup: 5 Calls Per Second; Plus: 25 Calls Per Second; Enterprise: 25+ Calls Per Second [23]
- Known restrictions
- sublicense, resell, rent, lease, transfer, assign, time share, or otherwise commercially exploit or make the Service available to any third party, reverse engineer, decompile or disassemble any portion of the Service, bypass any robot exclusion headers or other measures we take to restrict access to the Site or Service, use the API or the Data in any manner that violates the rights of any person, including but not limited to intellectual property rights, rights of privacy or rights of publicity, IN NO EVENT WILL COMPANY'S LIABILITY TO YOU EXCEED $10 [24]
Developer surface
Integration
- API style
- rest
- Base URL
- https://api.diffbot.com/v3
- Version
- v3
- Versioning
- url
- Stability
- ga
- Auth methods
- api_key
- Idempotency keys
- ✗ No
- Error format
- vendor-specific
- Rate limit
- 5 / minute
Adoption & maturity
- Notable customers
- Snapchat, AstraZeneca, Klarna, Indeed, NBC, BuzzFeed, Notion, Quora, SemRush, Sequoia Capital, Andreessen Horowitz, Opera, Doximity, FINRA, Factset, Meltwater, SmartNews, Vice, InMoment, Instapaper
Other Scraping & Crawling APIs
Oxylabs
The best proxy service platform with 175M+ Residential and 2M Datacenter IP proxies. Extract public data from any website with ease!
Firecrawl
The API to search, scrape, and interact with the web at scale.
Bright Data Web Unlocker
Automate your CAPTCHA solving while scraping websites. Our advanced technology rotates IPs, tackles user agents, and solves CAPTCHAs with ease.
ScraperAPI
Collect data from any public website with our web scraping API, without worrying about proxies, browsers, or CAPTCHA handling.
ScrapingBee
ScrapingBee is the best web scraping API that handles proxies and headless browsers for you — so you can focus on extracting the data you need.
Zyte API
Effortlessly scrape data with our all-in-one web scraping API. Unblocking, browser rendering and web data extraction in one full-stack web scraper.
References
- ↑Description: diffbot.com
- ↑Pricing model: diffbot.com · devtune.ai
- ↑Published pricing: diffbot.com · xpay.sh
- ↑Free tier: diffbot.com
- ↑Free tier details: diffbot.com · devtune.ai
- ↑Self-serve signup: diffbot.com · docs.diffbot.com
- ↑Requires sales call: diffbot.com · docs.diffbot.com
- ↑Enterprise plan: diffbot.com · devtune.ai
- ↑Minimum commitment: diffbot.com
- ↑Supported actions: docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com
- ↑Input types: docs.diffbot.com · docs.diffbot.com · docs.diffbot.com
- ↑Output types: github.com · docs.diffbot.com
- ↑Webhooks: blog.diffbot.com · diffbot.com
- ↑Sandbox: docs.diffbot.com · docs.diffbot.com
- ↑SDK languages: github.com · github.com · github.com · github.com · github.com · github.com · github.com · github.com · github.com
- ↑MCP server: github.com
- ↑SOC 2: diffbot.com
- ↑HIPAA: diffbot.com
- ↑GDPR: diffbot.com · aidoos.com
- ↑ISO 27001: diffbot.com
- ↑PCI DSS: diffbot.com
- ↑Published SLA: status.diffbot.com · status.diffbot.com
- ↑Rate limits: diffbot.com · docs.diffbot.com
- ↑Known restrictions: diffbot.com
Change history
- 2026-06-08 Llms Txt Present: (none) → No
- 2026-06-08 Rendering: (none) → static
- 2026-06-08 Status Page URL: (none) → https://status.diffbot.com
- 2026-06-08 Docs URL: (none) → https://docs.diffbot.com
- 2026-06-07 Summary Md: (none) → Diffbot turns the web into structured data for AI, with products for market int…
- 2026-06-07 SDK Packages: Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust → Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust
- 2026-06-07 Supported Actions: set to extract_analyze, extract_article, extract_discussion, extract_event, extract_im…
- 2026-06-07 Supported Regions: set to (none)
- 2026-06-07 Supported Languages: set to (none)
- 2026-06-07 Input Types: set to URL, HTML markup, plain text
- 2026-06-07 Output Types: set to JSON
- 2026-06-07 Webhooks Supported: set to Yes
- 2026-06-07 Sandbox Available: set to No
- 2026-06-07 SDK Languages: set to Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust
- 2026-06-07 SDK Packages: set to Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust
- 2026-06-07 MCP Server Available: set to Yes
- 2026-06-07 Pricing Model: set to hybrid
- 2026-06-07 Has Published Pricing: set to Yes
- 2026-06-07 Free Tier Available: set to Yes
- 2026-06-07 Free Tier Details: set to Free forever. 10,000 credits/month, 5 calls/min. No credit card required.
- 2026-06-07 Minimum Commitment: set to No contracts required
- 2026-06-07 Self Serve Signup: set to Yes
- 2026-06-07 Requires Sales Call: set to No
- 2026-06-07 Enterprise Plan Available: set to Yes
- 2026-06-07 GDPR: set to Yes
- 2026-06-07 SLA Published: set to No
- 2026-06-07 Data Retention Policy URL: set to https://www.diffbot.com/company/privacy/
- 2026-06-07 Documented Rate Limits: set to Free: 5 Calls Per Minute; Startup: 5 Calls Per Second; Plus: 25 Calls Per Secon…
- 2026-06-07 Rate Limit Requests: set to 5
- 2026-06-07 Rate Limit Window: set to minute
- 2026-06-07 Known Restrictions: set to sublicense, resell, rent, lease, transfer, assign, time share, or otherwise com…
- 2026-06-07 Auth Methods: set to api_key
- 2026-06-07 Auth Docs URL: set to https://docs.diffbot.com/reference/authentication
- 2026-06-07 API Style: set to rest
- 2026-06-07 Base URL: set to https://api.diffbot.com/v3
- 2026-06-07 API Version: set to v3
- 2026-06-07 Versioning Scheme: set to url
- 2026-06-07 Stability: set to ga
- 2026-06-07 MCP URL: set to https://github.com/diffbot/diffbot-mcp
- 2026-06-07 Idempotency Supported: set to No
- 2026-06-07 Error Format: set to vendor-specific
- 2026-06-07 Requires Verification: set to No
- 2026-06-07 Starting Price Usd: set to 299
- 2026-06-07 Price Basis: set to month
- 2026-06-07 Free Tier Limit: set to 10,000 credits/month
- 2026-06-07 Notable Customers: set to Snapchat, AstraZeneca, Klarna, Indeed, NBC, BuzzFeed, Notion, Quora, SemRush, S…
- 2026-06-07 Fields Not Found: set to supported_regions, soc2, iso_27001, hipaa, pci_dss, ga_date, launched_at
- 2026-06-07 Source Confidence: set to high
- 2026-06-07 Extractor: set to parallel:ultra
- 2026-06-07 Slug: set to diffbot