Requesty

"A unified AI gateway, LLM router, and OpenAI-compatible API for 400+ AI models" [1]

AI Gateway & LLM Routing APIs

www.requesty.ai · By Requesty · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

Requesty is a unified AI gateway and LLM router that provides a single OpenAI-compatible endpoint for accessing over 400 AI models, with automatic failover, load balancing, and prompt caching built in. It is aimed at teams and enterprises that want cost control and observability across multiple LLM providers, offering real-time cost and latency dashboards, RBAC, spend limits, and model whitelists. Pricing is usage-based at a 5% markup on base model costs for pay-as-you-go accounts, with a free tier capped at 200 requests per day. Customers include Shopify, Siemens, Pfizer, and PwC, and the service runs across EU, US, and APAC regions with GDPR compliance and a published SLA.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; AI agents and automation - an agent-ready surface (MCP / llms.txt); Teams needing broad API coverage out of the box

Pricing & procurement

Pricing model: Usage-based [2]
Published pricing: Yes [3]
Free tier: Yes [4]
Free tier details: Free tier with access to all free models and 200 requests per day; no credit card required. Includes routing, caching, spend tracking, and EU data residency. [5]
Self-serve signup: Yes [6]
Requires sales call: No
Enterprise plan: Yes [7]

Published prices
Plan	Item	Per	Amount	Source
Free	Gateway usage fee	200 requests/day on free models	$0	source
Pay-As-You-Go	Gateway usage markup	% of spend on base model costs	5%	source
Enterprise	Gateway usage fee	custom (contact sales)	-	source

Capabilities

Semantic caching
Fallback / routing
Spend controls
Observability
Guardrails

Supported actions: unified_chat_completions, openai_compatible_api, anthropic_compatible_api, model_routing, automatic_fallback, load_balancing, prompt_caching, spend_limits, budgets, rate_limiting, observability_logging, tracing, guardrails, pii_redaction, byo_provider_keys, rbac, model_whitelists, audit_logs, sso_integration, embeddings_routing, image_generation_routing, text_to_speech_routing, speech_to_text_routing, geo_based_routing, mcp_gateway, request_metadata_tagging, semantic_caching [8]
Regions: EU (Frankfurt), US (Virginia), APAC (Singapore) [9]
Input types: chat completions, embeddings, image generation, text to speech, speech to text, model listing [10]docs.requesty.ai/api-reference/inference-apis“Chat - Generate text completions and conversations using OpenAI Chat Completions, Anthropic Messages, or the Responses API. Embeddings - Vector embeddings for semantic search and RAG applications. Text to Speech - Converting text into spoken audio. Speech to Text - Transcribing audio files to text. Images - Generate and edit images using DALL-E, Stable Diffusion, and other image models.”
Output types: OpenAI-compatible JSON response, streaming (SSE), Anthropic-compatible response
Webhooks: No
Sandbox / test mode: No
SDK languages: Python, Node.js, Node.js / TypeScript [11]
MCP server: Yes [12]

Trust & compliance

SOC 2: In progress [13]
HIPAA: Unknown [14]
GDPR: Yes [15]
ISO 27001: Unknown [16]
PCI DSS: Unknown
Published SLA: Yes [17]
Rate limits: Free tier: 200 requests per day. Requesty does not impose its own rate limits on paid plans; per-key, per-team, and per-model limits are configurable by the user. [18]
Known restrictions: Free tier capped at 200 requests per day and limited to free models only, Pay-as-you-go tier adds a 5% markup on base model costs, Enterprise pricing requires contacting sales, SOC 2 Type II certification in progress (expected Q2 2026 per security page, contradicted by certified claim on enterprise page), No webhooks documented, No sandbox/test environment - free tier serves as the live environment

Developer surface

Docs rendering: static · llms.txt present

Integration

API style: rest
Base URL: https://router.requesty.ai/v1
Version: v1
Versioning: url
Stability: ga
Auth methods: api_key
Idempotency keys: No
Error format: vendor-specific

SDKs

Python openai · repo
Node.js openai · repo
Node.js / TypeScript @requesty/ai-sdk · repo

Adoption & maturity

Launched: 2023-01-01
Notable customers: Shopify, Amadeus, Chargebee, Contentful, Demandbase, Pfizer, PWC, Capgemini, Sage, Siemens, Appnovation

Other AI Gateway & LLM Routing APIs

Vercel AI Gateway
"AI Gateway provides a unified API to access hundreds of AI models through a single endpoint, with built-in budgets, usage monitoring, and fallbacks."
Usage · free tier · public pricing · self-serve
Portkey
"Production Stack for Gen AI Builders"
Hybrid · free tier · public pricing · self-serve
Bifrost (Maxim AI)
"The fastest, most resilient, enterprise-grade LLM, MCP, and agent gateway."
Sales-led · free tier · public pricing · self-serve
Cloudflare AI Gateway
"Connect to any model, dynamically route requests, and manage usage, billing, and logs from one unified gateway."
Hybrid · free tier · public pricing · self-serve
TrueFoundry AI Gateway
"A unified AI gateway to securely manage and govern AI across 1600+ models with policy control, real-time monitoring, and up to 30% cost reduction."
Hybrid · free tier · public pricing · self-serve
Helicone
"Open-source LLM observability and monitoring platform for developers" - routes, debugs, and analyzes AI applications with access to 100+ models through one API with built-in observability, automatic fallbacks, and zero markup pricing.
Hybrid · free tier · public pricing · self-serve

Requesty alternatives · Requesty vs Vercel AI Gateway · All AI Gateway & LLM Routing APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

↑Description: requesty.ai
↑Pricing model: requesty.ai · requesty.ai
↑Published pricing: requesty.ai
↑Free tier: requesty.ai
↑Free tier details: requesty.ai
↑Self-serve signup: requesty.ai
↑Enterprise plan: requesty.ai
↑Supported actions: docs.requesty.ai · requesty.ai
↑Regions: requesty.ai
↑Input types: docs.requesty.ai
↑SDK languages: docs.requesty.ai
↑MCP server: docs.requesty.ai
↑SOC 2: requesty.ai · requesty.ai
↑HIPAA: requesty.ai
↑GDPR: requesty.ai · requesty.ai
↑ISO 27001: requesty.ai
↑Published SLA: requesty.ai · requesty.ai
↑Rate limits: requesty.ai · docs.requesty.ai

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

2026-06-21 Capabilities: {} → {"guardrails":true,"observability":true,"spend_controls":true,"fallback_routing…
2026-06-21 Summary Md: (none) → Requesty is a unified AI gateway and LLM router that provides a single OpenAI-c…
2026-06-21 Score Setup Speed: (none) → 85
2026-06-21 Score Docs Quality: (none) → 25
2026-06-21 Score Procurement Friction: (none) → 90
2026-06-21 Score Trust Readiness: (none) → 43
2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, AI agents and auto…
2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
2026-06-21 Score Agent Friendliness: (none) → 55
2026-06-21 Score Pricing Transparency: (none) → 75
2026-06-21 Llms Txt Present: (none) → Yes
2026-06-21 Rendering: (none) → static
2026-06-21 Has Structured Data: (none) → Yes
2026-06-21 Robots Allows Agents: (none) → No
2026-06-21 Status Page URL: (none) → https://status.requesty.ai
2026-06-21 Docs URL: (none) → https://docs.requesty.ai/quickstart
2026-06-21 Llms Txt URL: (none) → https://www.requesty.ai/llms.txt
2026-06-21 Has Published Pricing: set to Yes
2026-06-21 Free Tier Available: set to Yes
2026-06-21 Free Tier Details: set to Free tier with access to all free models and 200 requests per day; no credit ca…
2026-06-21 Self Serve Signup: set to Yes
2026-06-21 Requires Sales Call: set to No
2026-06-21 Enterprise Plan Available: set to Yes
2026-06-21 SOC 2: set to in_progress
2026-06-21 GDPR: set to Yes
2026-06-21 SLA Published: set to Yes
2026-06-21 SLA URL: set to https://www.requesty.ai/enterprise
2026-06-21 Data Retention Policy URL: set to https://www.requesty.ai/privacy
2026-06-21 Documented Rate Limits: set to Free tier: 200 requests per day. Requesty does not impose its own rate limits o…
2026-06-21 Known Restrictions: set to Free tier capped at 200 requests per day and limited to free models only, Pay-a…
2026-06-21 Auth Methods: set to api_key
2026-06-21 Auth Docs URL: set to https://docs.requesty.ai/quickstart
2026-06-21 API Style: set to rest
2026-06-21 Base URL: set to https://router.requesty.ai/v1
2026-06-21 API Version: set to v1
2026-06-21 Versioning Scheme: set to url
2026-06-21 Stability: set to ga
2026-06-21 MCP URL: set to https://router.requesty.ai/mcp
2026-06-21 Quickstart URL: set to https://docs.requesty.ai/quickstart
2026-06-21 Idempotency Supported: set to No
2026-06-21 Error Format: set to vendor-specific
2026-06-21 Requires Verification: set to No
2026-06-21 Slug: set to requesty
2026-06-21 Free Tier Limit: set to 200 requests/day on free models
2026-06-21 Launched At: set to 2023-01-01
2026-06-21 Notable Customers: set to Shopify, Amadeus, Chargebee, Contentful, Demandbase, Pfizer, PWC, Capgemini, Sa…
2026-06-21 Fields Not Found: set to hipaa, iso_27001, pci_dss, webhooks_supported, sandbox_available, documented_ra…
2026-06-21 Source Confidence: set to high
2026-06-21 Extractor: set to claude-subagent:sonnet
2026-06-21 Last Verified At: set to 2026-06-21T00:00:00.000Z

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/requesty \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/requesty/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →

Best for / Avoid if

Pricing & procurement

Capabilities

Trust & compliance

Developer surface

Integration

Adoption & maturity

Other AI Gateway & LLM Routing APIs

Vercel AI Gateway

Portkey

Bifrost (Maxim AI)

Cloudflare AI Gateway

TrueFoundry AI Gateway

Helicone

References

Change history

Suggest an edit / leave a review