LLM Gateway
"Route requests across 280+ models, track costs in real-time, and switch providers without changing your code." [1]
LLM Gateway is a unified routing layer that sends requests across 280+ models through a single OpenAI-compatible endpoint, handling automatic failover, load balancing, response caching, and per-key spend limits without requiring code changes when switching providers. It targets teams managing multi-provider LLM costs, offering bring-your-own-keys with no markup plus a 5% gateway fee on credits for the hosted service, or a self-hostable AGPLv3 open-source build for teams that want full control. Pricing is usage-based with a free tier covering three rate-limited models, and enterprise plans add SSO, audit logs, and custom rate limits. The service is SOC 2 Type 2 certified, GDPR compliant, and counts Samsung and Harvard among its customers.
Best for / Avoid if
Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt)
Pricing & procurement
- Pricing model
- Usage-based [2]
- Published pricing
- ✓ Yes
- Free tier
- ✓ Yes [3]
- Free tier details
- Free plan at $0 forever: access to 280+ models across 35+ providers via BYOK, 3 free (rate-limited) models, 30-day activity log retention, chat and API access. Free models rate-limited to 5 requests per 10 minutes (upgrades to 20 req/min once any credits are added). Open-source self-host also available under AGPLv3. [4]
- Self-serve signup
- ✓ Yes [5]
- Requires sales call
- ✗ No
- Enterprise plan
- ✓ Yes [6]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Free | Gateway usage fee | month | $0 | source |
| Paid (BYOK) | Gateway usage fee with own provider keys | % of spend | 0% + $0 | source |
| Paid (LLM Gateway credits) | Gateway platform fee on credit usage | % of credit spend | 5% | source |
| Paid | International card surcharge | % of credit top-up (non-US cards only) | 1.5% | source |
| DevPass Lite | Chat/coding subscription (Lite) | month | $29 | source |
| DevPass Pro | Chat/coding subscription (Pro) | month | $79 | source |
| DevPass Max | Chat/coding subscription (Max) | month | $179 | source |
| Enterprise | Gateway usage — custom pricing | custom | - | source |
| Self-hosted (open source, AGPLv3) | $0 | source |
Capabilities
- Supported actions
- unified_chat_completions, openai_compatible_api, model_routing, automatic_fallback, load_balancing, response_caching, prompt_caching, spend_limits, budgets, rate_limiting, observability_logging, request_analytics, cost_tracking, guardrails, pii_redaction, virtual_keys, byo_provider_keys, embeddings, image_generation, audio_speech_synthesis, video_generation, content_moderation, custom_providers, provider_compliance_policies, ip_restrictions, sso_saml_oidc, audit_logs, mcp_server [7]
- Regions
- US, EU, APAC [8]
- Input types
- chat completions, embeddings, image generation, audio speech synthesis, video generation, content moderation
- Output types
- streaming (SSE), JSON, OpenAI-compatible response
- Webhooks
- ✓ Yes [9]
- Sandbox / test mode
- ✗ No [10]
- SDK languages
- Node.js / TypeScript (Vercel AI SDK), Python / any (OpenAI SDK drop-in)
- MCP server
- ✓ Yes [11]
Trust & compliance
- SOC 2
- SOC 2 Type II [12]
- HIPAA
- ✗ No [13]
- GDPR
- ✓ Yes [14]
- ISO 27001
- ✗ No [15]
- PCI DSS
- ✗ No [16]
- Published SLA
- ✓ Yes [17]
- Rate limits
- Free models (no credits): 5 requests per 10 minutes. Free models (with any credits added): 20 requests per minute. Paid models: no rate limiting. Enterprise: custom rate limits. [18]
- Known restrictions
- 5% gateway fee applied to credit usage on paid plans, Non-US cards incur additional 1.5% international fee, IP restriction rules (CIDR-based) are Enterprise-only, No formal SLA for free/standard PAYG accounts - SLA applies only if expressly stated in a separate written agreement, Self-hosted AGPLv3 open-source version lacks enterprise features (SSO, audit logs, spend controls, guardrails), Image disk saving only works on self-hosted instances with UPLOAD_DIR configured, HIPAA and ISO 27001 are routing-policy features (restrict to compliant providers), not LLM Gateway's own certifications [19]
Developer surface
Integration
Adoption & maturity
- Launched
- 2025-05-01
- GA
- 2025-05-01
- Notable customers
- Samsung, Harvard, Coloop.ai, FieldKo
Other AI Gateway & LLM Routing APIs
Vercel AI Gateway
"AI Gateway provides a unified API to access hundreds of AI models through a single endpoint, with built-in budgets, usage monitoring, and fallbacks."
Portkey
"Production Stack for Gen AI Builders"
Bifrost (Maxim AI)
"The fastest, most resilient, enterprise-grade LLM, MCP, and agent gateway."
Cloudflare AI Gateway
"Connect to any model, dynamically route requests, and manage usage, billing, and logs from one unified gateway."
TrueFoundry AI Gateway
"A unified AI gateway to securely manage and govern AI across 1600+ models with policy control, real-time monitoring, and up to 30% cost reduction."
Helicone
"Open-source LLM observability and monitoring platform for developers" - routes, debugs, and analyzes AI applications with access to 100+ models through one API with built-in observability, automatic fallbacks, and zero markup pricing.
References
- ↑Description: llmgateway.io
- ↑Pricing model: llmgateway.io · llmgateway.io
- ↑Free tier: llmgateway.io · llmgateway.io
- ↑Free tier details: llmgateway.io · docs.llmgateway.io
- ↑Self-serve signup: llmgateway.io
- ↑Enterprise plan: llmgateway.io
- ↑Supported actions: docs.llmgateway.io · llmgateway.io
- ↑Regions: llmgateway.io
- ↑Webhooks: llmgateway.io
- ↑Sandbox: llmgateway.io
- ↑MCP server: llmgateway.io · docs.llmgateway.io
- ↑SOC 2: llmgateway.io · security.llmgateway.io
- ↑HIPAA: security.llmgateway.io · llmgateway.io
- ↑GDPR: llmgateway.io · security.llmgateway.io
- ↑ISO 27001: security.llmgateway.io · llmgateway.io
- ↑PCI DSS: security.llmgateway.io
- ↑Published SLA: llmgateway.io · llmgateway.io
- ↑Rate limits: docs.llmgateway.io
- ↑Known restrictions: llmgateway.io · llmgateway.io
Change history
- 2026-06-21 Capabilities: {} → {"guardrails":true,"self_hosted":true,"observability":true,"spend_controls":tru…
- 2026-06-21 Summary Md: (none) → LLM Gateway is a unified routing layer that sends requests across 280+ models t…
- 2026-06-21 Score Agent Friendliness: (none) → 65
- 2026-06-21 Score Pricing Transparency: (none) → 75
- 2026-06-21 Score Setup Speed: (none) → 80
- 2026-06-21 Score Docs Quality: (none) → 35
- 2026-06-21 Score Procurement Friction: (none) → 90
- 2026-06-21 Score Trust Readiness: (none) → 60
- 2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
- 2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-21 Rendering: (none) → static
- 2026-06-21 Robots Allows Agents: (none) → Yes
- 2026-06-21 Status Page URL: (none) → https://status.llmgateway.io
- 2026-06-21 Changelog URL: (none) → https://llmgateway.io/changelog
- 2026-06-21 Has Structured Data: (none) → Yes
- 2026-06-21 Docs URL: (none) → https://docs.llmgateway.io/
- 2026-06-21 Llms Txt Present: (none) → Yes
- 2026-06-21 Llms Txt URL: (none) → https://llmgateway.io/llms.txt
- 2026-06-21 Has Published Pricing: set to Yes
- 2026-06-21 Free Tier Available: set to Yes
- 2026-06-21 Free Tier Details: set to Free plan at $0 forever: access to 280+ models across 35+ providers via BYOK, 3…
- 2026-06-21 Self Serve Signup: set to Yes
- 2026-06-21 Requires Sales Call: set to No
- 2026-06-21 Enterprise Plan Available: set to Yes
- 2026-06-21 SOC 2: set to type_2
- 2026-06-21 HIPAA: set to No
- 2026-06-21 GDPR: set to Yes
- 2026-06-21 ISO 27001: set to No
- 2026-06-21 PCI DSS: set to No
- 2026-06-21 SLA Published: set to Yes
- 2026-06-21 SLA URL: set to https://status.llmgateway.io/
- 2026-06-21 Status: set to published
- 2026-06-21 Data Retention Policy URL: set to https://llmgateway.io/legal/privacy
- 2026-06-21 Documented Rate Limits: set to Free models (no credits): 5 requests per 10 minutes. Free models (with any cred…
- 2026-06-21 Rate Limit Requests: set to 5
- 2026-06-21 Rate Limit Window: set to minute
- 2026-06-21 Known Restrictions: set to 5% gateway fee applied to credit usage on paid plans, Non-US cards incur additi…
- 2026-06-21 Auth Methods: set to api_key
- 2026-06-21 Auth Docs URL: set to https://docs.llmgateway.io/features/api-keys
- 2026-06-21 API Style: set to rest
- 2026-06-21 Base URL: set to https://api.llmgateway.io/v1
- 2026-06-21 API Version: set to v1
- 2026-06-21 Versioning Scheme: set to url
- 2026-06-21 Stability: set to ga
- 2026-06-21 MCP URL: set to https://api.llmgateway.io/mcp
- 2026-06-21 Quickstart URL: set to https://docs.llmgateway.io/quick-start
- 2026-06-21 Error Format: set to openai-compatible
- 2026-06-21 Requires Verification: set to No
- 2026-06-21 Price Basis: set to % of spend
- 2026-06-21 Free Tier Limit: set to 3 free models (rate-limited); open-source self-host available (AGPLv3)
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/llmgateway \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/llmgateway/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'