Requesty
"A unified AI gateway, LLM router, and OpenAI-compatible API for 400+ AI models" [1]
Requesty is a unified AI gateway and LLM router that provides a single OpenAI-compatible endpoint for accessing over 400 AI models, with automatic failover, load balancing, and prompt caching built in. It is aimed at teams and enterprises that want cost control and observability across multiple LLM providers, offering real-time cost and latency dashboards, RBAC, spend limits, and model whitelists. Pricing is usage-based at a 5% markup on base model costs for pay-as-you-go accounts, with a free tier capped at 200 requests per day. Customers include Shopify, Siemens, Pfizer, and PwC, and the service runs across EU, US, and APAC regions with GDPR compliance and a published SLA.
Best for / Avoid if
Best for: Prototypes and side projects - free to start, no sales call; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box
Pricing & procurement
- Pricing model
- Usage-based [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✓ Yes [4]
- Free tier details
- Free tier with access to all free models and 200 requests per day; no credit card required. Includes routing, caching, spend tracking, and EU data residency. [5]
- Self-serve signup
- ✓ Yes [6]
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes [7]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Free | Gateway usage fee | 200 requests/day on free models | $0 | source |
| Pay-As-You-Go | Gateway usage markup | % of spend on base model costs | 5% | source |
| Enterprise | Gateway usage fee | custom (contact sales) | - | source |
Capabilities
- Supported actions
- unified_chat_completions, openai_compatible_api, anthropic_compatible_api, model_routing, automatic_fallback, load_balancing, prompt_caching, spend_limits, budgets, rate_limiting, observability_logging, tracing, guardrails, pii_redaction, byo_provider_keys, rbac, model_whitelists, audit_logs, sso_integration, embeddings_routing, image_generation_routing, text_to_speech_routing, speech_to_text_routing, geo_based_routing, mcp_gateway, request_metadata_tagging, semantic_caching [8]
- Regions
- EU (Frankfurt), US (Virginia), APAC (Singapore) [9]
- Input types
- chat completions, embeddings, image generation, text to speech, speech to text, model listing [10]
- Output types
- OpenAI-compatible JSON response, streaming (SSE), Anthropic-compatible response
- Webhooks
- ✗ No
- Sandbox / test mode
- ✗ No
- SDK languages
- Python, Node.js, Node.js / TypeScript [11]
- MCP server
- ✓ Yes [12]
Trust & compliance
- SOC 2
- In progress [13]
- HIPAA
- – Unknown [14]
- GDPR
- ✓ Yes [15]
- ISO 27001
- – Unknown [16]
- PCI DSS
- – Unknown
- Published SLA
- ✓ Yes [17]
- Rate limits
- Free tier: 200 requests per day. Requesty does not impose its own rate limits on paid plans; per-key, per-team, and per-model limits are configurable by the user. [18]
- Known restrictions
- Free tier capped at 200 requests per day and limited to free models only, Pay-as-you-go tier adds a 5% markup on base model costs, Enterprise pricing requires contacting sales, SOC 2 Type II certification in progress (expected Q2 2026 per security page, contradicted by certified claim on enterprise page), No webhooks documented, No sandbox/test environment - free tier serves as the live environment
Developer surface
Integration
Adoption & maturity
- Launched
- 2023-01-01
- Notable customers
- Shopify, Amadeus, Chargebee, Contentful, Demandbase, Pfizer, PWC, Capgemini, Sage, Siemens, Appnovation
Other AI Gateway & LLM Routing APIs
Vercel AI Gateway
"AI Gateway provides a unified API to access hundreds of AI models through a single endpoint, with built-in budgets, usage monitoring, and fallbacks."
Portkey
"Production Stack for Gen AI Builders"
Bifrost (Maxim AI)
"The fastest, most resilient, enterprise-grade LLM, MCP, and agent gateway."
Cloudflare AI Gateway
"Connect to any model, dynamically route requests, and manage usage, billing, and logs from one unified gateway."
TrueFoundry AI Gateway
"A unified AI gateway to securely manage and govern AI across 1600+ models with policy control, real-time monitoring, and up to 30% cost reduction."
Helicone
"Open-source LLM observability and monitoring platform for developers" - routes, debugs, and analyzes AI applications with access to 100+ models through one API with built-in observability, automatic fallbacks, and zero markup pricing.
References
- ↑Description: requesty.ai
- ↑Pricing model: requesty.ai · requesty.ai
- ↑Published pricing: requesty.ai
- ↑Free tier: requesty.ai
- ↑Free tier details: requesty.ai
- ↑Self-serve signup: requesty.ai
- ↑Enterprise plan: requesty.ai
- ↑Supported actions: docs.requesty.ai · requesty.ai
- ↑Regions: requesty.ai
- ↑Input types: docs.requesty.ai
- ↑SDK languages: docs.requesty.ai
- ↑MCP server: docs.requesty.ai
- ↑SOC 2: requesty.ai · requesty.ai
- ↑HIPAA: requesty.ai
- ↑GDPR: requesty.ai · requesty.ai
- ↑ISO 27001: requesty.ai
- ↑Published SLA: requesty.ai · requesty.ai
- ↑Rate limits: requesty.ai · docs.requesty.ai
Change history
- 2026-06-21 Capabilities: {} → {"guardrails":true,"observability":true,"spend_controls":true,"fallback_routing…
- 2026-06-21 Summary Md: (none) → Requesty is a unified AI gateway and LLM router that provides a single OpenAI-c…
- 2026-06-21 Score Setup Speed: (none) → 85
- 2026-06-21 Score Docs Quality: (none) → 25
- 2026-06-21 Score Procurement Friction: (none) → 90
- 2026-06-21 Score Trust Readiness: (none) → 43
- 2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, AI agents and auto…
- 2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-21 Score Agent Friendliness: (none) → 55
- 2026-06-21 Score Pricing Transparency: (none) → 75
- 2026-06-21 Llms Txt Present: (none) → Yes
- 2026-06-21 Rendering: (none) → static
- 2026-06-21 Has Structured Data: (none) → Yes
- 2026-06-21 Robots Allows Agents: (none) → No
- 2026-06-21 Status Page URL: (none) → https://status.requesty.ai
- 2026-06-21 Docs URL: (none) → https://docs.requesty.ai/quickstart
- 2026-06-21 Llms Txt URL: (none) → https://www.requesty.ai/llms.txt
- 2026-06-21 Has Published Pricing: set to Yes
- 2026-06-21 Free Tier Available: set to Yes
- 2026-06-21 Free Tier Details: set to Free tier with access to all free models and 200 requests per day; no credit ca…
- 2026-06-21 Self Serve Signup: set to Yes
- 2026-06-21 Requires Sales Call: set to No
- 2026-06-21 Enterprise Plan Available: set to Yes
- 2026-06-21 SOC 2: set to in_progress
- 2026-06-21 GDPR: set to Yes
- 2026-06-21 SLA Published: set to Yes
- 2026-06-21 SLA URL: set to https://www.requesty.ai/enterprise
- 2026-06-21 Data Retention Policy URL: set to https://www.requesty.ai/privacy
- 2026-06-21 Documented Rate Limits: set to Free tier: 200 requests per day. Requesty does not impose its own rate limits o…
- 2026-06-21 Known Restrictions: set to Free tier capped at 200 requests per day and limited to free models only, Pay-a…
- 2026-06-21 Auth Methods: set to api_key
- 2026-06-21 Auth Docs URL: set to https://docs.requesty.ai/quickstart
- 2026-06-21 API Style: set to rest
- 2026-06-21 Base URL: set to https://router.requesty.ai/v1
- 2026-06-21 API Version: set to v1
- 2026-06-21 Versioning Scheme: set to url
- 2026-06-21 Stability: set to ga
- 2026-06-21 MCP URL: set to https://router.requesty.ai/mcp
- 2026-06-21 Quickstart URL: set to https://docs.requesty.ai/quickstart
- 2026-06-21 Idempotency Supported: set to No
- 2026-06-21 Error Format: set to vendor-specific
- 2026-06-21 Requires Verification: set to No
- 2026-06-21 Slug: set to requesty
- 2026-06-21 Free Tier Limit: set to 200 requests/day on free models
- 2026-06-21 Launched At: set to 2023-01-01
- 2026-06-21 Notable Customers: set to Shopify, Amadeus, Chargebee, Contentful, Demandbase, Pfizer, PWC, Capgemini, Sa…
- 2026-06-21 Fields Not Found: set to hipaa, iso_27001, pci_dss, webhooks_supported, sandbox_available, documented_ra…
- 2026-06-21 Source Confidence: set to high
- 2026-06-21 Extractor: set to claude-subagent:sonnet
- 2026-06-21 Last Verified At: set to 2026-06-21T00:00:00.000Z
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/requesty \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/requesty/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'