Diffbot

Web Data for your AI [1]

www.diffbot.com · Agent JSON · Last verified 2026-06-06 · Source confidence: high

Diffbot turns the web into structured data for AI, with products for market intelligence, news monitoring, machine learning, and e-commerce, including a knowledge graph. The REST API offers API-key auth, webhooks, eleven SDKs, and an official MCP server. Pricing is published and self-serve on a hybrid model from $299/month, with 10,000 credits free each month. It is GDPR compliant. Used by Snapchat, AstraZeneca, Klarna, and Indeed.

Scores

Scores are derived in a separate pass from the literal fields below; not yet computed for this profile.

Pricing & procurement

Pricing model
Hybrid (base + usage) [2]
Published pricing
Yes [3]
Free tier
Yes [4]
Free tier details
Free forever. 10,000 credits/month, 5 calls/min. No credit card required. [5]
Self-serve signup
Yes [6]
Requires sales call
No [7]
Enterprise plan
Yes [8]
Minimum commitment
No contracts required [9]
Published prices
PlanItemPerAmountSource
Free10,000 creditsmonth$0source
Freecreditcredit$0source
Startup250,000 creditsmonth$299source
Startupcreditcredit$0.001source
Plus1,000,000 creditsmonth$899source
Pluscreditcredit$0.0009source

Capabilities

Supported actions
extract_analyze, extract_article, extract_discussion, extract_event, extract_image, extract_job, extract_list, extract_product, extract_video, enhance_get, enhance_post, bulk_enhance, knowledge_graph_search, knowledge_graph_combine, create_crawl, manage_crawl_job, retrieve_crawl_job_data, search_crawl_job_data, create_bulkjob, download_bulkjob_results, poll_bulkjob_status, list_bulkjobs_for_token, download_bulkjob_coverage_report, delete_bulkjob, download_single_bulkjob_result, stop_bulkjob, create_or_update_custom_api, custom_api_rulesets, extract_with_custom_api, delete_custom_api, retrieve_custom_apis, extract_content_not_available_online, extract_custom_headers, extract_custom_javascript [10]
Input types
URL, HTML markup, plain text [11]
Output types
JSON [12]
Webhooks
Yes [13]
Sandbox / test mode
No [14]
SDK languages
Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust [15]
MCP server
Yes [16]

Trust & compliance

SOC 2
Unknown [17]
HIPAA
Unknown [18]
GDPR
Yes [19]
ISO 27001
Unknown [20]
PCI DSS
Unknown [21]
Published SLA
No [22]
Rate limits
Free: 5 Calls Per Minute; Startup: 5 Calls Per Second; Plus: 25 Calls Per Second; Enterprise: 25+ Calls Per Second [23]
Known restrictions
sublicense, resell, rent, lease, transfer, assign, time share, or otherwise commercially exploit or make the Service available to any third party, reverse engineer, decompile or disassemble any portion of the Service, bypass any robot exclusion headers or other measures we take to restrict access to the Site or Service, use the API or the Data in any manner that violates the rights of any person, including but not limited to intellectual property rights, rights of privacy or rights of publicity, IN NO EVENT WILL COMPANY'S LIABILITY TO YOU EXCEED $10 [24]

Developer surface

Docs rendering: static

Integration

API style
rest
Base URL
https://api.diffbot.com/v3
Version
v3
Versioning
url
Stability
ga
Auth methods
api_key
Idempotency keys
No
Error format
vendor-specific
Rate limit
5 / minute

SDKs

  • Python · repo · updated 2026-06 · 124
  • Go · repo · updated 2025-06 · 10
  • Ruby diffbot-ruby-client (rubygems) · repo · updated 2024-05 · 17
  • Java · repo · updated 2019-03 · 9
  • C# · repo · updated 2022-06 · 10
  • Node.js · repo · updated 2017-05 · 23
  • PHP · repo · updated 2016-03 · 9
  • C · repo · updated 2019-03 · 0
  • Scala · repo · updated 2019-03 · 4
  • Clojure · repo · updated 2019-03 · 6
  • Rust · repo · updated 2017-07 · 4

Adoption & maturity

Notable customers
Snapchat, AstraZeneca, Klarna, Indeed, NBC, BuzzFeed, Notion, Quora, SemRush, Sequoia Capital, Andreessen Horowitz, Opera, Doximity, FINRA, Factset, Meltwater, SmartNews, Vice, InMoment, Instapaper

Other Scraping & Crawling APIs

  • Oxylabs

    The best proxy service platform with 175M+ Residential and 2M Datacenter IP proxies. Extract public data from any website with ease!

    Hybrid · free tier · public pricing · self-serve

  • Firecrawl

    The API to search, scrape, and interact with the web at scale.

    Subscription · free tier · public pricing · self-serve

  • Bright Data Web Unlocker

    Automate your CAPTCHA solving while scraping websites. Our advanced technology rotates IPs, tackles user agents, and solves CAPTCHAs with ease.

    Hybrid · public pricing · self-serve

  • ScraperAPI

    Collect data from any public website with our web scraping API, without worrying about proxies, browsers, or CAPTCHA handling.

    Hybrid · free tier · public pricing · self-serve

  • ScrapingBee

    ScrapingBee is the best web scraping API that handles proxies and headless browsers for you — so you can focus on extracting the data you need.

    Subscription · public pricing · self-serve

  • Zyte API

    Effortlessly scrape data with our all-in-one web scraping API. Unblocking, browser rendering and web data extraction in one full-stack web scraper.

    Hybrid · public pricing · self-serve

See all Scraping & Crawling APIs APIs →

References

Each field above carries a numbered source — hover for a preview, click to jump here.

  1. Description: diffbot.com
  2. Pricing model: diffbot.com · devtune.ai
  3. Published pricing: diffbot.com · xpay.sh
  4. Free tier: diffbot.com
  5. Free tier details: diffbot.com · devtune.ai
  6. Self-serve signup: diffbot.com · docs.diffbot.com
  7. Requires sales call: diffbot.com · docs.diffbot.com
  8. Enterprise plan: diffbot.com · devtune.ai
  9. Minimum commitment: diffbot.com
  10. Supported actions: docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com · docs.diffbot.com
  11. Input types: docs.diffbot.com · docs.diffbot.com · docs.diffbot.com
  12. Output types: github.com · docs.diffbot.com
  13. Webhooks: blog.diffbot.com · diffbot.com
  14. Sandbox: docs.diffbot.com · docs.diffbot.com
  15. SDK languages: github.com · github.com · github.com · github.com · github.com · github.com · github.com · github.com · github.com
  16. MCP server: github.com
  17. SOC 2: diffbot.com
  18. HIPAA: diffbot.com
  19. GDPR: diffbot.com · aidoos.com
  20. ISO 27001: diffbot.com
  21. PCI DSS: diffbot.com
  22. Published SLA: status.diffbot.com · status.diffbot.com
  23. Rate limits: diffbot.com · docs.diffbot.com
  24. Known restrictions: diffbot.com

Change history

Every field change, who made it, and when — from our audited data pipeline and editors.

  1. 2026-06-08 Llms Txt Present: (none)No
  2. 2026-06-08 Rendering: (none)static
  3. 2026-06-08 Status Page URL: (none)https://status.diffbot.com
  4. 2026-06-08 Docs URL: (none)https://docs.diffbot.com
  5. 2026-06-07 Summary Md: (none)Diffbot turns the web into structured data for AI, with products for market int…
  6. 2026-06-07 SDK Packages: Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, RustPython, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust
  7. 2026-06-07 Supported Actions: set to extract_analyze, extract_article, extract_discussion, extract_event, extract_im…
  8. 2026-06-07 Supported Regions: set to (none)
  9. 2026-06-07 Supported Languages: set to (none)
  10. 2026-06-07 Input Types: set to URL, HTML markup, plain text
  11. 2026-06-07 Output Types: set to JSON
  12. 2026-06-07 Webhooks Supported: set to Yes
  13. 2026-06-07 Sandbox Available: set to No
  14. 2026-06-07 SDK Languages: set to Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust
  15. 2026-06-07 SDK Packages: set to Python, Go, Ruby, Java, C#, Node.js, PHP, C, Scala, Clojure, Rust
  16. 2026-06-07 MCP Server Available: set to Yes
  17. 2026-06-07 Pricing Model: set to hybrid
  18. 2026-06-07 Has Published Pricing: set to Yes
  19. 2026-06-07 Free Tier Available: set to Yes
  20. 2026-06-07 Free Tier Details: set to Free forever. 10,000 credits/month, 5 calls/min. No credit card required.
  21. 2026-06-07 Minimum Commitment: set to No contracts required
  22. 2026-06-07 Self Serve Signup: set to Yes
  23. 2026-06-07 Requires Sales Call: set to No
  24. 2026-06-07 Enterprise Plan Available: set to Yes
  25. 2026-06-07 GDPR: set to Yes
  26. 2026-06-07 SLA Published: set to No
  27. 2026-06-07 Data Retention Policy URL: set to https://www.diffbot.com/company/privacy/
  28. 2026-06-07 Documented Rate Limits: set to Free: 5 Calls Per Minute; Startup: 5 Calls Per Second; Plus: 25 Calls Per Secon…
  29. 2026-06-07 Rate Limit Requests: set to 5
  30. 2026-06-07 Rate Limit Window: set to minute
  31. 2026-06-07 Known Restrictions: set to sublicense, resell, rent, lease, transfer, assign, time share, or otherwise com…
  32. 2026-06-07 Auth Methods: set to api_key
  33. 2026-06-07 Auth Docs URL: set to https://docs.diffbot.com/reference/authentication
  34. 2026-06-07 API Style: set to rest
  35. 2026-06-07 Base URL: set to https://api.diffbot.com/v3
  36. 2026-06-07 API Version: set to v3
  37. 2026-06-07 Versioning Scheme: set to url
  38. 2026-06-07 Stability: set to ga
  39. 2026-06-07 MCP URL: set to https://github.com/diffbot/diffbot-mcp
  40. 2026-06-07 Idempotency Supported: set to No
  41. 2026-06-07 Error Format: set to vendor-specific
  42. 2026-06-07 Requires Verification: set to No
  43. 2026-06-07 Starting Price Usd: set to 299
  44. 2026-06-07 Price Basis: set to month
  45. 2026-06-07 Free Tier Limit: set to 10,000 credits/month
  46. 2026-06-07 Notable Customers: set to Snapchat, AstraZeneca, Klarna, Indeed, NBC, BuzzFeed, Notion, Quora, SemRush, S…
  47. 2026-06-07 Fields Not Found: set to supported_regions, soc2, iso_27001, hipaa, pci_dss, ga_date, launched_at
  48. 2026-06-07 Source Confidence: set to high
  49. 2026-06-07 Extractor: set to parallel:ultra
  50. 2026-06-07 Slug: set to diffbot