LLM Gateway

"Route requests across 280+ models, track costs in real-time, and switch providers without changing your code." [1]

AI Gateway & LLM Routing APIs

llmgateway.io · By LLM Gateway · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

LLM Gateway is a unified routing layer that sends requests across 280+ models through a single OpenAI-compatible endpoint, handling automatic failover, load balancing, response caching, and per-key spend limits without requiring code changes when switching providers. It targets teams managing multi-provider LLM costs, offering bring-your-own-keys with no markup plus a 5% gateway fee on credits for the hosted service, or a self-hostable AGPLv3 open-source build for teams that want full control. Pricing is usage-based with a free tier covering three rate-limited models, and enterprise plans add SSO, audit logs, and custom rate limits. The service is SOC 2 Type 2 certified, GDPR compliant, and counts Samsung and Harvard among its customers.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt)

Pricing & procurement

Pricing model: Usage-based [2]
Published pricing: Yes
Free tier: Yes [3]
Free tier details: Free plan at $0 forever: access to 280+ models across 35+ providers via BYOK, 3 free (rate-limited) models, 30-day activity log retention, chat and API access. Free models rate-limited to 5 requests per 10 minutes (upgrades to 20 req/min once any credits are added). Open-source self-host also available under AGPLv3. [4]
Self-serve signup: Yes [5]
Requires sales call: No
Enterprise plan: Yes [6]

Published prices
Plan	Item	Per	Amount	Source
Free	Gateway usage fee	month	$0	source
Paid (BYOK)	Gateway usage fee with own provider keys	% of spend	0% + $0	source
Paid (LLM Gateway credits)	Gateway platform fee on credit usage	% of credit spend	5%	source
Paid	International card surcharge	% of credit top-up (non-US cards only)	1.5%	source
DevPass Lite	Chat/coding subscription (Lite)	month	$29	source
DevPass Pro	Chat/coding subscription (Pro)	month	$79	source
DevPass Max	Chat/coding subscription (Max)	month	$179	source
Enterprise	Gateway usage — custom pricing	custom	-	source
	Self-hosted (open source, AGPLv3)		$0	source

Capabilities

Fallback / routing
Spend controls
Observability
Guardrails
Self-hosted

Supported actions: unified_chat_completions, openai_compatible_api, model_routing, automatic_fallback, load_balancing, response_caching, prompt_caching, spend_limits, budgets, rate_limiting, observability_logging, request_analytics, cost_tracking, guardrails, pii_redaction, virtual_keys, byo_provider_keys, embeddings, image_generation, audio_speech_synthesis, video_generation, content_moderation, custom_providers, provider_compliance_policies, ip_restrictions, sso_saml_oidc, audit_logs, mcp_server [7]
Regions: US, EU, APAC [8]
Input types: chat completions, embeddings, image generation, audio speech synthesis, video generation, content moderation
Output types: streaming (SSE), JSON, OpenAI-compatible response
Webhooks: Yes [9]
Sandbox / test mode: No [10]
SDK languages: Node.js / TypeScript (Vercel AI SDK), Python / any (OpenAI SDK drop-in)
MCP server: Yes [11]

Trust & compliance

SOC 2: SOC 2 Type II [12]
HIPAA: No [13]
GDPR: Yes [14]
ISO 27001: No [15]
PCI DSS: No [16]
Published SLA: Yes [17]
Rate limits: Free models (no credits): 5 requests per 10 minutes. Free models (with any credits added): 20 requests per minute. Paid models: no rate limiting. Enterprise: custom rate limits. [18]
Known restrictions: 5% gateway fee applied to credit usage on paid plans, Non-US cards incur additional 1.5% international fee, IP restriction rules (CIDR-based) are Enterprise-only, No formal SLA for free/standard PAYG accounts - SLA applies only if expressly stated in a separate written agreement, Self-hosted AGPLv3 open-source version lacks enterprise features (SSO, audit logs, spend controls, guardrails), Image disk saving only works on self-hosted instances with UPLOAD_DIR configured, HIPAA and ISO 27001 are routing-policy features (restrict to compliant providers), not LLM Gateway's own certifications [19]

Developer surface

Docs rendering: static · llms.txt present

Integration

API style: rest
Base URL: https://api.llmgateway.io/v1
Version: v1
Versioning: url
Stability: ga
Auth methods: api_key
Error format: openai-compatible
Rate limit: 5 / minute

SDKs

Node.js / TypeScript (Vercel AI SDK) @llmgateway/ai-sdk-provider · repo
Python / any (OpenAI SDK drop-in) openai · repo

Adoption & maturity

Launched: 2025-05-01
GA: 2025-05-01
Notable customers: Samsung, Harvard, Coloop.ai, FieldKo

Other AI Gateway & LLM Routing APIs

Vercel AI Gateway
"AI Gateway provides a unified API to access hundreds of AI models through a single endpoint, with built-in budgets, usage monitoring, and fallbacks."
Usage · free tier · public pricing · self-serve
Portkey
"Production Stack for Gen AI Builders"
Hybrid · free tier · public pricing · self-serve
Bifrost (Maxim AI)
"The fastest, most resilient, enterprise-grade LLM, MCP, and agent gateway."
Sales-led · free tier · public pricing · self-serve
Cloudflare AI Gateway
"Connect to any model, dynamically route requests, and manage usage, billing, and logs from one unified gateway."
Hybrid · free tier · public pricing · self-serve
TrueFoundry AI Gateway
"A unified AI gateway to securely manage and govern AI across 1600+ models with policy control, real-time monitoring, and up to 30% cost reduction."
Hybrid · free tier · public pricing · self-serve
Helicone
"Open-source LLM observability and monitoring platform for developers" - routes, debugs, and analyzes AI applications with access to 100+ models through one API with built-in observability, automatic fallbacks, and zero markup pricing.
Hybrid · free tier · public pricing · self-serve

LLM Gateway alternatives · LLM Gateway vs Vercel AI Gateway · All AI Gateway & LLM Routing APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

↑Description: llmgateway.io
↑Pricing model: llmgateway.io · llmgateway.io
↑Free tier: llmgateway.io · llmgateway.io
↑Free tier details: llmgateway.io · docs.llmgateway.io
↑Self-serve signup: llmgateway.io
↑Enterprise plan: llmgateway.io
↑Supported actions: docs.llmgateway.io · llmgateway.io
↑Regions: llmgateway.io
↑Webhooks: llmgateway.io
↑Sandbox: llmgateway.io
↑MCP server: llmgateway.io · docs.llmgateway.io
↑SOC 2: llmgateway.io · security.llmgateway.io
↑HIPAA: security.llmgateway.io · llmgateway.io
↑GDPR: llmgateway.io · security.llmgateway.io
↑ISO 27001: security.llmgateway.io · llmgateway.io
↑PCI DSS: security.llmgateway.io
↑Published SLA: llmgateway.io · llmgateway.io
↑Rate limits: docs.llmgateway.io
↑Known restrictions: llmgateway.io · llmgateway.io

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

2026-06-21 Capabilities: {} → {"guardrails":true,"self_hosted":true,"observability":true,"spend_controls":tru…
2026-06-21 Summary Md: (none) → LLM Gateway is a unified routing layer that sends requests across 280+ models t…
2026-06-21 Score Agent Friendliness: (none) → 65
2026-06-21 Score Pricing Transparency: (none) → 75
2026-06-21 Score Setup Speed: (none) → 80
2026-06-21 Score Docs Quality: (none) → 35
2026-06-21 Score Procurement Friction: (none) → 90
2026-06-21 Score Trust Readiness: (none) → 60
2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
2026-06-21 Rendering: (none) → static
2026-06-21 Robots Allows Agents: (none) → Yes
2026-06-21 Status Page URL: (none) → https://status.llmgateway.io
2026-06-21 Changelog URL: (none) → https://llmgateway.io/changelog
2026-06-21 Has Structured Data: (none) → Yes
2026-06-21 Docs URL: (none) → https://docs.llmgateway.io/
2026-06-21 Llms Txt Present: (none) → Yes
2026-06-21 Llms Txt URL: (none) → https://llmgateway.io/llms.txt
2026-06-21 Has Published Pricing: set to Yes
2026-06-21 Free Tier Available: set to Yes
2026-06-21 Free Tier Details: set to Free plan at $0 forever: access to 280+ models across 35+ providers via BYOK, 3…
2026-06-21 Self Serve Signup: set to Yes
2026-06-21 Requires Sales Call: set to No
2026-06-21 Enterprise Plan Available: set to Yes
2026-06-21 SOC 2: set to type_2
2026-06-21 HIPAA: set to No
2026-06-21 GDPR: set to Yes
2026-06-21 ISO 27001: set to No
2026-06-21 PCI DSS: set to No
2026-06-21 SLA Published: set to Yes
2026-06-21 SLA URL: set to https://status.llmgateway.io/
2026-06-21 Status: set to published
2026-06-21 Data Retention Policy URL: set to https://llmgateway.io/legal/privacy
2026-06-21 Documented Rate Limits: set to Free models (no credits): 5 requests per 10 minutes. Free models (with any cred…
2026-06-21 Rate Limit Requests: set to 5
2026-06-21 Rate Limit Window: set to minute
2026-06-21 Known Restrictions: set to 5% gateway fee applied to credit usage on paid plans, Non-US cards incur additi…
2026-06-21 Auth Methods: set to api_key
2026-06-21 Auth Docs URL: set to https://docs.llmgateway.io/features/api-keys
2026-06-21 API Style: set to rest
2026-06-21 Base URL: set to https://api.llmgateway.io/v1
2026-06-21 API Version: set to v1
2026-06-21 Versioning Scheme: set to url
2026-06-21 Stability: set to ga
2026-06-21 MCP URL: set to https://api.llmgateway.io/mcp
2026-06-21 Quickstart URL: set to https://docs.llmgateway.io/quick-start
2026-06-21 Error Format: set to openai-compatible
2026-06-21 Requires Verification: set to No
2026-06-21 Price Basis: set to % of spend
2026-06-21 Free Tier Limit: set to 3 free models (rate-limited); open-source self-host available (AGPLv3)

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/llmgateway \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/llmgateway/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →

Best for / Avoid if

Pricing & procurement

Capabilities

Trust & compliance

Developer surface

Integration

Adoption & maturity

Other AI Gateway & LLM Routing APIs

Vercel AI Gateway

Portkey

Bifrost (Maxim AI)

Cloudflare AI Gateway

TrueFoundry AI Gateway

Helicone

References

Change history

Suggest an edit / leave a review