Crawlbase

"Web data infrastructure for developers, enterprises & LLMs" [1]

Scraping & Crawling APIs

crawlbase.com · By Crawlbase · Agent JSON · Suggest an edit · Last verified 2026-06-14 · Source confidence: high

Crawlbase is a web data infrastructure platform, launched in 2017, that provides scraping and crawling APIs for developers, enterprises, and AI/LLM training pipelines, with support for JavaScript rendering, CAPTCHA solving, anti-bot bypass, and structured data extraction. It draws on 140 million rotating residential proxies and 98 million datacenter proxies across 195 countries for geo-targeting. Pricing starts at $3 per 1,000 requests with a free tier of 1,000 requests requiring no credit card, and the REST API ships with SDKs for seven languages including Python, Node.js, and Go. Notable customers include Intel, Airbnb, Shopify, and Expedia, and a published SLA and GDPR compliance are in place.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box

Avoid if: You have strict compliance requirements

Scores

75 / 100
Agent friendliness
100 / 100
Pricing transparency
85 / 100
Setup speed
60 / 100
Docs quality
100 / 100
Procurement ease
35 / 100
Trust readiness

Scores are computed deterministically from this profile's published, sourced fields (pricing, compliance, capabilities, docs and developer-surface signals) - not from reviews or paid placement. Each axis is 0-100; an unknown signal scores 0 for that axis. Procurement ease is the inverse of buying friction (higher = easier to adopt).

Pricing & procurement

Pricing model: Hybrid (base + usage) [2]
Published pricing: Yes [3]
Free tier: Yes [4]
Free tier details: Cloud Storage Free plan: $0/month with 10,000 requests and 14-day retention (recurring tier). Smart AI Proxy Free Trial plan: $0/month with 5,000 credits and 5 threads (labeled 'Free Trial' on pricing page). Crawling API: 1,000 free requests on signup (one-time, not recurring). [5]
Self-serve signup: Yes [6]
Requires sales call: No
Enterprise plan: Yes [7]

Published prices
Plan	Item	Per	Amount	Source
Crawling API - Standard (0-1k)	successful requests	1,000 requests	$3	source
Crawling API - Standard (next 10k)	successful requests	1,000 requests	$2	source
Crawling API - Standard (next 100k)	successful requests	1,000 requests	$0.6	source
Crawling API - Standard (next 1M)	successful requests	1,000 requests	$0.5	source
Crawling API - Standard (next 10M)	successful requests	1,000 requests	$0.1	source
Crawling API - Standard (next 100M)	successful requests	1,000 requests	$0.05	source
Crawling API - Standard (next 1B)	successful requests	1,000 requests	$0.04	source
Crawling API - Standard (1B+)	successful requests	1,000 requests	$0.02	source
Crawling API - Moderate (0-1k)	successful requests	1,000 requests	$4.5	source
Crawling API - Moderate (next 10k)	successful requests	1,000 requests	$3	source
Crawling API - Moderate (next 100k)	successful requests	1,000 requests	$0.9	source
Crawling API - Moderate (next 1M)	successful requests	1,000 requests	$0.75	source
Crawling API - Moderate (next 10M)	successful requests	1,000 requests	$0.15	source
Crawling API - Moderate (next 100M)	successful requests	1,000 requests	$0.07	source
Crawling API - Moderate (next 1B)	successful requests	1,000 requests	$0.06	source
Crawling API - Moderate (1B+)	successful requests	1,000 requests	$0.03	source
Crawling API - Complex (0-1k)	successful requests	1,000 requests	$6	source
Crawling API - Complex (next 10k)	successful requests	1,000 requests	$4	source
Crawling API - Complex (next 100k)	successful requests	1,000 requests	$1.2	source
Crawling API - Complex (next 1M)	successful requests	1,000 requests	$1	source
Crawling API - Complex (next 10M)	successful requests	1,000 requests	$0.2	source
Crawling API - Complex (next 100M)	successful requests	1,000 requests	$0.1	source
Crawling API - Complex (next 1B)	successful requests	1,000 requests	$0.08	source
Crawling API - Complex (1B+)	successful requests	1,000 requests	$0.04	source
Crawling API - LinkedIn	successful requests (public pages only)	1,000 requests	$15	source
Smart AI Proxy - Free Trial	subscription	month	$0	source
Smart AI Proxy - Free Trial	credits included	5,000 credits/month	$0	source
Smart AI Proxy - Starter	subscription	month	$149	source
Smart AI Proxy - Starter (annual)	subscription	year	$1599	source
Smart AI Proxy - Starter	credits included	200,000 credits/month	-	source
Smart AI Proxy - Advanced	subscription	month	$229	source
Smart AI Proxy - Advanced (annual)	subscription	year	$2249	source
Smart AI Proxy - Advanced	credits included	1,000,000 credits/month	-	source
Smart AI Proxy - Premium	subscription	month	$449	source
Smart AI Proxy - Premium (annual)	subscription	year	$4140	source
Smart AI Proxy - Premium	credits included	3,000,000 credits/month	-	source
Cloud Storage - Free	subscription	month	$0	source
Cloud Storage - Free	requests included	10,000 requests, 14-day retention	$0	source
Cloud Storage - Developer	subscription	month	$29	source
Cloud Storage - Developer	requests included	100,000 requests, 30-day retention	-	source
Cloud Storage - Business	subscription	month	$249	source
Cloud Storage - Business	requests included	1,000,000 requests, 30-day retention	-	source

Capabilities

JavaScript rendering
Residential proxies
Structured / AI extraction
Site crawling
SERP scraping
Anti-bot bypass

Supported actions: scrape, crawl, js_rendering, captcha_solving, anti_bot_bypass, screenshot, pdf_rendering, structured_data_extraction, prebuilt_scrapers, async_jobs, proxy_rotation, residential_proxies, datacenter_proxies, markdown_output, ai_extraction, cloud_storage, webhook_delivery, geo_targeting, sticky_sessions, tor_network [8]crawlbase.com/docs/crawling-api/parameters/“screenshot: Boolean for JPEG capture; pdf: Boolean to render as PDF file instead of HTML; format: html | json | md; scraper: Apply built-in structured data extractor; async: Boolean to queue request; callback: Webhook URL to receive crawl results; store: Boolean to persist crawled page in Cloud Storage”crawlbase.com/docs/crawling-api/“Solves Cloudflare, PerimeterX, DataDome, hCaptcha, and other common challenges”
Regions: 140M rotating residential proxies, 98M datacenter proxies, 195 countries supported for geo-targeting (Crawling API), 45+ countries supported (Smart AI Proxy Premium), 23 countries supported for country routing parameter
Input types: target URL, country/geo code, device type, custom headers, cookies, CSS click selector, prebuilt scraper id, render JS flag, async flag, callback/webhook URL, output format flag, scroll flag, screenshot flag
Output types: raw HTML, rendered HTML, Markdown, JSON (structured), screenshot (JPEG), PDF, parsed fields (prebuilt scrapers), webhook delivery, cloud-stored pages
Webhooks: Yes [9]
Sandbox / test mode: No [10]
SDK languages: Python, Node.js, Ruby, PHP, Go, Java, C#/.NET [11]
MCP server: Yes [12]

Trust & compliance

SOC 2: None [13]
HIPAA: No [14]
GDPR: Yes [15]
ISO 27001: No [16]
PCI DSS: No [17]
Published SLA: Yes [18]
Rate limits: Default concurrency limit in Crawling API proxy mode is 20 requests per second (~1.7M req/day). Smart AI Proxy thread limits by plan: Free=5, Starter=20, Advanced=40, Premium=80. [19]
Known restrictions: LinkedIn crawling limited to public pages only and excluded from free credits, POST/PUT methods: normal token only; accounts caught using POST for spam, credential stuffing, or other malicious traffic will be suspended, Prohibited uses: spam, comment injection, fraudulent form submissions, scripted account creation, credential stuffing, Scroll billing: first 8 seconds = 1 request; each additional 5 seconds adds 1 billed request, Only pay for successful requests (pc_status=200 AND original_status in 200,201,204,301,302,404,410), Google SERP scraper is a prebuilt scraper within the Crawling API, not a standalone SERP API product [20]crawlbase.com/pricing“LinkedIn charges fixed $15 per 1,000 requests with restrictions on public pages only; free credits excluded”crawlbase.com/docs/crawling-api/response-codes/“Scroll billing: First 8 seconds = 1 request; every additional 5 seconds adds 1 more billed request. Only billable when both conditions met: pc_status = 200 AND original_status is one of: 200, 201, 204, 301, 302, 404, or 410”

Developer surface

Docs rendering: static · markdown variants served · llms.txt present

Integration

API style: rest
Base URL: https://api.crawlbase.com/
Versioning: none
Stability: ga
Auth methods: api_key
Idempotency keys: No
Error format: vendor-specific
Rate limit: 20 / concurrent

SDKs

Python crawlbase · repo
Node.js crawlbase · repo
Ruby crawlbase · repo
PHP crawlbase/crawlbase · repo
Go github.com/crawlbase/crawlbase-go · repo
Java com.crawlbase:crawlbase-java · repo
C#/.NET CrawlbaseAPI · repo

Adoption & maturity

Launched: 2017-01-01
Notable customers: Intel, Pinterest, Airbnb, Honda, Amgen, Huawei, Shopify, Expedia, H&M, Nike, Oracle, Stanford University, LG

Other Scraping & Crawling APIs

ScrapFly
"Scrape any site, drive any browser, power any agent. One API key."
Subscription · public pricing · self-serve
Bright Data Web Scraper API
"The most reliable Web Scraping API. Scrape any website with automatic proxy rotation, anti-bot bypass, and JavaScript rendering."
Subscription · free tier · public pricing · self-serve
Oxylabs
The best proxy service platform with 175M+ Residential and 2M Datacenter IP proxies. Extract public data from any website with ease!
Hybrid · free tier · public pricing · self-serve
Apify
Cloud platform for web scraping, browser automation, AI agents, and data for AI
Hybrid · free tier · public pricing · self-serve
Diffbot
Web Data for your AI
Hybrid · free tier · public pricing · self-serve
Firecrawl
The API to search, scrape, and interact with the web at scale.
Subscription · free tier · public pricing · self-serve

Crawlbase alternatives · Crawlbase vs ScrapFly · All Scraping & Crawling APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

↑Description: crawlbase.com
↑Pricing model: crawlbase.com · crawlbase.com
↑Published pricing: crawlbase.com
↑Free tier: crawlbase.com · crawlbase.com
↑Free tier details: crawlbase.com · crawlbase.com
↑Self-serve signup: crawlbase.com
↑Enterprise plan: crawlbase.com
↑Supported actions: crawlbase.com · crawlbase.com
↑Webhooks: crawlbase.com
↑Sandbox: crawlbase.com
↑SDK languages: crawlbase.com
↑MCP server: crawlbase.com
↑SOC 2: crawlbase.com · crawlbase.com
↑HIPAA: crawlbase.com
↑GDPR: crawlbase.com · crawlbase.com
↑ISO 27001: crawlbase.com · crawlbase.com
↑PCI DSS: crawlbase.com
↑Published SLA: crawlbase.com · crawlbase.com
↑Rate limits: crawlbase.com · crawlbase.com
↑Known restrictions: crawlbase.com · crawlbase.com

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

2026-06-15 Score Docs Quality: 25 → 60
2026-06-15 Score Agent Friendliness: 45 → 75
2026-06-14 Markdown Docs Served: (none) → Yes
2026-06-14 API Reference URL: (none) → https://crawlbase.com/docs/api-reference
2026-06-14 Robots Allows Agents: (none) → Yes
2026-06-14 Has Structured Data: (none) → Yes
2026-06-14 Markdown Docs URL: (none) → https://crawlbase.com/docs.md
2026-06-14 Capabilities: {} → {"serp":true,"crawl":true,"anti_bot":true,"js_rendering":true,"residential_prox…
2026-06-14 Summary Md: (none) → Crawlbase is a web data infrastructure platform, launched in 2017, that provide…
2026-06-14 Score Docs Quality: (none) → 25
2026-06-14 Score Procurement Friction: (none) → 100
2026-06-14 Score Trust Readiness: (none) → 35
2026-06-14 Best For: (none) → Prototypes and side projects - free to start, no sales call, AI agents and auto…
2026-06-14 Avoid If: (none) → You have strict compliance requirements
2026-06-14 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
2026-06-14 Score Setup Speed: (none) → 85
2026-06-14 Score Pricing Transparency: (none) → 100
2026-06-14 Score Agent Friendliness: (none) → 45
2026-06-14 Docs URL: (none) → https://crawlbase.com/docs
2026-06-14 Status Page URL: (none) → https://status.crawlbase.com
2026-06-14 Rendering: (none) → static
2026-06-14 Llms Txt URL: (none) → https://crawlbase.com/llms.txt
2026-06-14 Llms Txt Present: (none) → Yes
2026-06-14 Enterprise Plan Available: set to Yes
2026-06-14 SOC 2: set to none
2026-06-14 HIPAA: set to No
2026-06-14 GDPR: set to Yes
2026-06-14 PCI DSS: set to No
2026-06-14 SLA Published: set to Yes
2026-06-14 Data Retention Policy URL: set to https://crawlbase.com/privacy
2026-06-14 Documented Rate Limits: set to Default concurrency limit in Crawling API proxy mode is 20 requests per second …
2026-06-14 Rate Limit Requests: set to 20
2026-06-14 Rate Limit Window: set to concurrent
2026-06-14 Known Restrictions: set to LinkedIn crawling limited to public pages only and excluded from free credits, …
2026-06-14 Auth Methods: set to api_key
2026-06-14 Auth Docs URL: set to https://crawlbase.com/docs/authentication
2026-06-14 API Style: set to rest
2026-06-14 Base URL: set to https://api.crawlbase.com/
2026-06-14 Versioning Scheme: set to none
2026-06-14 Stability: set to ga
2026-06-14 MCP URL: set to https://crawlbase.com/mcp
2026-06-14 Quickstart URL: set to https://crawlbase.com/docs/quick-start
2026-06-14 Idempotency Supported: set to No
2026-06-14 Error Format: set to vendor-specific
2026-06-14 Requires Verification: set to No
2026-06-14 Starting Price Usd: set to 3
2026-06-14 Price Basis: set to 1,000 requests
2026-06-14 Free Tier Limit: set to 1,000 requests (no credit card required)
2026-06-14 ISO 27001: set to No
2026-06-14 Notable Customers: set to Intel, Pinterest, Airbnb, Honda, Amgen, Huawei, Shopify, Expedia, H&M, Nike, Or…

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/crawlbase \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/crawlbase/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →

Best for / Avoid if

Scores

Pricing & procurement

Capabilities

Trust & compliance

Developer surface

Integration

Adoption & maturity

Other Scraping & Crawling APIs

ScrapFly

Bright Data Web Scraper API

Oxylabs

Apify

Diffbot

Firecrawl

References

Change history

Suggest an edit / leave a review