Crawlbase

"Web data infrastructure for developers, enterprises & LLMs" [1]

crawlbase.com · By Crawlbase · Agent JSON · Suggest an edit · Last verified 2026-06-14 · Source confidence: high

Crawlbase is a web data infrastructure platform, launched in 2017, that provides scraping and crawling APIs for developers, enterprises, and AI/LLM training pipelines, with support for JavaScript rendering, CAPTCHA solving, anti-bot bypass, and structured data extraction. It draws on 140 million rotating residential proxies and 98 million datacenter proxies across 195 countries for geo-targeting. Pricing starts at $3 per 1,000 requests with a free tier of 1,000 requests requiring no credit card, and the REST API ships with SDKs for seven languages including Python, Node.js, and Go. Notable customers include Intel, Airbnb, Shopify, and Expedia, and a published SLA and GDPR compliance are in place.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box

Avoid if: You have strict compliance requirements

Scores

  • 75 / 100
    Agent friendliness
  • 100 / 100
    Pricing transparency
  • 85 / 100
    Setup speed
  • 60 / 100
    Docs quality
  • 100 / 100
    Procurement ease
  • 35 / 100
    Trust readiness

Scores are computed deterministically from this profile's published, sourced fields (pricing, compliance, capabilities, docs and developer-surface signals) - not from reviews or paid placement. Each axis is 0-100; an unknown signal scores 0 for that axis. Procurement ease is the inverse of buying friction (higher = easier to adopt).

Pricing & procurement

Pricing model
Hybrid (base + usage) [2]
Published pricing
Yes [3]
Free tier
Yes [4]
Free tier details
Cloud Storage Free plan: $0/month with 10,000 requests and 14-day retention (recurring tier). Smart AI Proxy Free Trial plan: $0/month with 5,000 credits and 5 threads (labeled 'Free Trial' on pricing page). Crawling API: 1,000 free requests on signup (one-time, not recurring). [5]
Self-serve signup
Yes [6]
Requires sales call
No
Enterprise plan
Yes [7]
Published prices
PlanItemPerAmountSource
Crawling API - Standard (0-1k)successful requests1,000 requests$3source
Crawling API - Standard (next 10k)successful requests1,000 requests$2source
Crawling API - Standard (next 100k)successful requests1,000 requests$0.6source
Crawling API - Standard (next 1M)successful requests1,000 requests$0.5source
Crawling API - Standard (next 10M)successful requests1,000 requests$0.1source
Crawling API - Standard (next 100M)successful requests1,000 requests$0.05source
Crawling API - Standard (next 1B)successful requests1,000 requests$0.04source
Crawling API - Standard (1B+)successful requests1,000 requests$0.02source
Crawling API - Moderate (0-1k)successful requests1,000 requests$4.5source
Crawling API - Moderate (next 10k)successful requests1,000 requests$3source
Crawling API - Moderate (next 100k)successful requests1,000 requests$0.9source
Crawling API - Moderate (next 1M)successful requests1,000 requests$0.75source
Crawling API - Moderate (next 10M)successful requests1,000 requests$0.15source
Crawling API - Moderate (next 100M)successful requests1,000 requests$0.07source
Crawling API - Moderate (next 1B)successful requests1,000 requests$0.06source
Crawling API - Moderate (1B+)successful requests1,000 requests$0.03source
Crawling API - Complex (0-1k)successful requests1,000 requests$6source
Crawling API - Complex (next 10k)successful requests1,000 requests$4source
Crawling API - Complex (next 100k)successful requests1,000 requests$1.2source
Crawling API - Complex (next 1M)successful requests1,000 requests$1source
Crawling API - Complex (next 10M)successful requests1,000 requests$0.2source
Crawling API - Complex (next 100M)successful requests1,000 requests$0.1source
Crawling API - Complex (next 1B)successful requests1,000 requests$0.08source
Crawling API - Complex (1B+)successful requests1,000 requests$0.04source
Crawling API - LinkedInsuccessful requests (public pages only)1,000 requests$15source
Smart AI Proxy - Free Trialsubscriptionmonth$0source
Smart AI Proxy - Free Trialcredits included5,000 credits/month$0source
Smart AI Proxy - Startersubscriptionmonth$149source
Smart AI Proxy - Starter (annual)subscriptionyear$1599source
Smart AI Proxy - Startercredits included200,000 credits/month - source
Smart AI Proxy - Advancedsubscriptionmonth$229source
Smart AI Proxy - Advanced (annual)subscriptionyear$2249source
Smart AI Proxy - Advancedcredits included1,000,000 credits/month - source
Smart AI Proxy - Premiumsubscriptionmonth$449source
Smart AI Proxy - Premium (annual)subscriptionyear$4140source
Smart AI Proxy - Premiumcredits included3,000,000 credits/month - source
Cloud Storage - Freesubscriptionmonth$0source
Cloud Storage - Freerequests included10,000 requests, 14-day retention$0source
Cloud Storage - Developersubscriptionmonth$29source
Cloud Storage - Developerrequests included100,000 requests, 30-day retention - source
Cloud Storage - Businesssubscriptionmonth$249source
Cloud Storage - Businessrequests included1,000,000 requests, 30-day retention - source

Capabilities

  • JavaScript rendering
  • Residential proxies
  • Structured / AI extraction
  • Site crawling
  • SERP scraping
  • Anti-bot bypass
Supported actions
scrape, crawl, js_rendering, captcha_solving, anti_bot_bypass, screenshot, pdf_rendering, structured_data_extraction, prebuilt_scrapers, async_jobs, proxy_rotation, residential_proxies, datacenter_proxies, markdown_output, ai_extraction, cloud_storage, webhook_delivery, geo_targeting, sticky_sessions, tor_network [8]
Regions
140M rotating residential proxies, 98M datacenter proxies, 195 countries supported for geo-targeting (Crawling API), 45+ countries supported (Smart AI Proxy Premium), 23 countries supported for country routing parameter
Input types
target URL, country/geo code, device type, custom headers, cookies, CSS click selector, prebuilt scraper id, render JS flag, async flag, callback/webhook URL, output format flag, scroll flag, screenshot flag
Output types
raw HTML, rendered HTML, Markdown, JSON (structured), screenshot (JPEG), PDF, parsed fields (prebuilt scrapers), webhook delivery, cloud-stored pages
Webhooks
Yes [9]
Sandbox / test mode
No [10]
SDK languages
Python, Node.js, Ruby, PHP, Go, Java, C#/.NET [11]
MCP server
Yes [12]

Trust & compliance

SOC 2
None [13]
HIPAA
No [14]
GDPR
Yes [15]
ISO 27001
No [16]
PCI DSS
No [17]
Published SLA
Yes [18]
Rate limits
Default concurrency limit in Crawling API proxy mode is 20 requests per second (~1.7M req/day). Smart AI Proxy thread limits by plan: Free=5, Starter=20, Advanced=40, Premium=80. [19]
Known restrictions
LinkedIn crawling limited to public pages only and excluded from free credits, POST/PUT methods: normal token only; accounts caught using POST for spam, credential stuffing, or other malicious traffic will be suspended, Prohibited uses: spam, comment injection, fraudulent form submissions, scripted account creation, credential stuffing, Scroll billing: first 8 seconds = 1 request; each additional 5 seconds adds 1 billed request, Only pay for successful requests (pc_status=200 AND original_status in 200,201,204,301,302,404,410), Google SERP scraper is a prebuilt scraper within the Crawling API, not a standalone SERP API product [20]

Developer surface

Docs rendering: static · markdown variants served · llms.txt present

Integration

API style
rest
Base URL
https://api.crawlbase.com/
Versioning
none
Stability
ga
Auth methods
api_key
Idempotency keys
No
Error format
vendor-specific
Rate limit
20 / concurrent

SDKs

  • Python crawlbase · repo
  • Node.js crawlbase · repo
  • Ruby crawlbase · repo
  • PHP crawlbase/crawlbase · repo
  • Go github.com/crawlbase/crawlbase-go · repo
  • Java com.crawlbase:crawlbase-java · repo
  • C#/.NET CrawlbaseAPI · repo

Adoption & maturity

Launched
2017-01-01
Notable customers
Intel, Pinterest, Airbnb, Honda, Amgen, Huawei, Shopify, Expedia, H&M, Nike, Oracle, Stanford University, LG

Other Scraping & Crawling APIs

  • ScrapFly

    "Scrape any site, drive any browser, power any agent. One API key."

    Subscription · public pricing · self-serve

  • Bright Data Web Scraper API

    "The most reliable Web Scraping API. Scrape any website with automatic proxy rotation, anti-bot bypass, and JavaScript rendering."

    Subscription · free tier · public pricing · self-serve

  • Oxylabs

    The best proxy service platform with 175M+ Residential and 2M Datacenter IP proxies. Extract public data from any website with ease!

    Hybrid · free tier · public pricing · self-serve

  • Apify

    Cloud platform for web scraping, browser automation, AI agents, and data for AI

    Hybrid · free tier · public pricing · self-serve

  • Diffbot

    Web Data for your AI

    Hybrid · free tier · public pricing · self-serve

  • Firecrawl

    The API to search, scrape, and interact with the web at scale.

    Subscription · free tier · public pricing · self-serve

Crawlbase alternatives · Crawlbase vs ScrapFly · All Scraping & Crawling APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

  1. Description: crawlbase.com
  2. Pricing model: crawlbase.com · crawlbase.com
  3. Published pricing: crawlbase.com
  4. Free tier: crawlbase.com · crawlbase.com
  5. Free tier details: crawlbase.com · crawlbase.com
  6. Self-serve signup: crawlbase.com
  7. Enterprise plan: crawlbase.com
  8. Supported actions: crawlbase.com · crawlbase.com
  9. Webhooks: crawlbase.com
  10. Sandbox: crawlbase.com
  11. SDK languages: crawlbase.com
  12. MCP server: crawlbase.com
  13. SOC 2: crawlbase.com · crawlbase.com
  14. HIPAA: crawlbase.com
  15. GDPR: crawlbase.com · crawlbase.com
  16. ISO 27001: crawlbase.com · crawlbase.com
  17. PCI DSS: crawlbase.com
  18. Published SLA: crawlbase.com · crawlbase.com
  19. Rate limits: crawlbase.com · crawlbase.com
  20. Known restrictions: crawlbase.com · crawlbase.com

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

  1. 2026-06-15 Score Docs Quality: 2560
  2. 2026-06-15 Score Agent Friendliness: 4575
  3. 2026-06-14 Markdown Docs Served: (none)Yes
  4. 2026-06-14 API Reference URL: (none)https://crawlbase.com/docs/api-reference
  5. 2026-06-14 Robots Allows Agents: (none)Yes
  6. 2026-06-14 Has Structured Data: (none)Yes
  7. 2026-06-14 Markdown Docs URL: (none)https://crawlbase.com/docs.md
  8. 2026-06-14 Capabilities: {}{"serp":true,"crawl":true,"anti_bot":true,"js_rendering":true,"residential_prox…
  9. 2026-06-14 Summary Md: (none)Crawlbase is a web data infrastructure platform, launched in 2017, that provide…
  10. 2026-06-14 Score Docs Quality: (none)25
  11. 2026-06-14 Score Procurement Friction: (none)100
  12. 2026-06-14 Score Trust Readiness: (none)35
  13. 2026-06-14 Best For: (none)Prototypes and side projects - free to start, no sales call, AI agents and auto…
  14. 2026-06-14 Avoid If: (none)You have strict compliance requirements
  15. 2026-06-14 Scoring Methodology: (none)Scores are computed deterministically from this profile's published, sourced fi…
  16. 2026-06-14 Score Setup Speed: (none)85
  17. 2026-06-14 Score Pricing Transparency: (none)100
  18. 2026-06-14 Score Agent Friendliness: (none)45
  19. 2026-06-14 Docs URL: (none)https://crawlbase.com/docs
  20. 2026-06-14 Status Page URL: (none)https://status.crawlbase.com
  21. 2026-06-14 Rendering: (none)static
  22. 2026-06-14 Llms Txt URL: (none)https://crawlbase.com/llms.txt
  23. 2026-06-14 Llms Txt Present: (none)Yes
  24. 2026-06-14 Enterprise Plan Available: set to Yes
  25. 2026-06-14 SOC 2: set to none
  26. 2026-06-14 HIPAA: set to No
  27. 2026-06-14 GDPR: set to Yes
  28. 2026-06-14 PCI DSS: set to No
  29. 2026-06-14 SLA Published: set to Yes
  30. 2026-06-14 Data Retention Policy URL: set to https://crawlbase.com/privacy
  31. 2026-06-14 Documented Rate Limits: set to Default concurrency limit in Crawling API proxy mode is 20 requests per second …
  32. 2026-06-14 Rate Limit Requests: set to 20
  33. 2026-06-14 Rate Limit Window: set to concurrent
  34. 2026-06-14 Known Restrictions: set to LinkedIn crawling limited to public pages only and excluded from free credits, …
  35. 2026-06-14 Auth Methods: set to api_key
  36. 2026-06-14 Auth Docs URL: set to https://crawlbase.com/docs/authentication
  37. 2026-06-14 API Style: set to rest
  38. 2026-06-14 Base URL: set to https://api.crawlbase.com/
  39. 2026-06-14 Versioning Scheme: set to none
  40. 2026-06-14 Stability: set to ga
  41. 2026-06-14 MCP URL: set to https://crawlbase.com/mcp
  42. 2026-06-14 Quickstart URL: set to https://crawlbase.com/docs/quick-start
  43. 2026-06-14 Idempotency Supported: set to No
  44. 2026-06-14 Error Format: set to vendor-specific
  45. 2026-06-14 Requires Verification: set to No
  46. 2026-06-14 Starting Price Usd: set to 3
  47. 2026-06-14 Price Basis: set to 1,000 requests
  48. 2026-06-14 Free Tier Limit: set to 1,000 requests (no credit card required)
  49. 2026-06-14 ISO 27001: set to No
  50. 2026-06-14 Notable Customers: set to Intel, Pinterest, Airbnb, Honda, Amgen, Huawei, Shopify, Expedia, H&M, Nike, Or…

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/crawlbase \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/crawlbase/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →