LiteLLM

"LLM Gateway (OpenAI Proxy) to manage authentication, loadbalancing, and spend tracking across 100+ LLMs. All in the OpenAI format." [1]

AI Gateway & LLM Routing APIs

www.litellm.ai · By LiteLLM · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

LiteLLM is an open-source LLM gateway that provides a unified OpenAI-compatible API across 100+ model providers, handling load balancing, automatic failover, semantic caching, rate limiting, and spend tracking in one proxy layer. It is self-hosted rather than offered as a managed SaaS, making it suited for teams that need centralized governance over multiple LLM deployments without vendor lock-in. The free tier covers self-hosting with SSO available up to five users; enterprise licensing is required for features such as audit logs, SCIM, per-key guardrails, and batch cost tracking. LiteLLM holds SOC 2 Type I and ISO 27001 certifications, with SDKs for Python and Node.js and support for API key, JWT, and OAuth2 authentication.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; Teams needing broad API coverage out of the box

Pricing & procurement

Pricing model: Contact sales [2]
Published pricing: Yes [3]
Free tier: Yes [4]
Free tier details: Open-source self-hosted version is free ($0) with 100+ LLM integrations, load balancing, virtual keys, budgets, teams, and LLM guardrails. Available on GitHub with 51K+ stars and 240M+ Docker pulls. No data or telemetry stored on LiteLLM servers when self-hosted.
Self-serve signup: Yes [5]
Requires sales call: Yes [6]
Enterprise plan: Yes [7]

Published prices
Plan	Item	Per	Amount	Source
Open Source	Self-hosted open-source gateway	month	$0	source
Enterprise	Enterprise license (cloud or self-hosted)	custom quote	-	source

Capabilities

Semantic caching
Fallback / routing
Spend controls
Observability
Guardrails
Self-hosted

Supported actions: unified_chat_completions, openai_compatible_api, model_routing, automatic_fallback, load_balancing, semantic_caching, prompt_caching, spend_limits, budgets, rate_limiting, observability_logging, tracing, guardrails, pii_redaction, virtual_keys, byo_provider_keys, prompt_management, embeddings_routing, image_generation_routing, audio_transcription_routing, text_to_speech_routing, batch_api, tag_based_routing, latency_based_routing, cost_based_routing, usage_based_routing, context_window_fallback, content_policy_fallback, key_rotation, sso_oidc_saml, audit_logs, secret_manager_integration, prometheus_metrics, opentelemetry_tracing, multi_tenant_organizations, mcp_gateway [8]
Input types: chat completions, text completions, embeddings, image generation, audio transcription, text to speech, batch [9]
Output types: streaming (SSE), OpenAI-compatible response, JSON
Webhooks: Yes [10]
Sandbox / test mode: No
SDK languages: Python, Node.js [11]
MCP server: No [12]

Trust & compliance

SOC 2: SOC 2 Type I [13]
HIPAA: No [14]
GDPR: No [15]
ISO 27001: Yes [16]
PCI DSS: No [17]
Published SLA: Yes [18]
Known restrictions: Enterprise features (SSO, audit logs, SCIM, per-key guardrails, batch cost tracking) require Enterprise license, SOC 2 Type I and ISO 27001 reports available only on Enterprise plan upon request; SOC 2 Type II and ISO 27001 certifications undergoing recertification via Vanta as of March 2026 after prior Delve-issued certs were questioned, SSO free for up to 5 users; enterprise licensing required beyond that, PostgreSQL required for virtual keys and spend tracking in self-hosted deployments, Production minimum: 4 CPU cores and 8 GB RAM for self-hosted, No cloud-hosted SaaS gateway - product is self-hosted or customer-deployed, Enterprise pricing is contact-sales only; no public per-seat or per-request gateway fees [19]

Developer surface

Docs rendering: client_rendered

Integration

API style: rest
Base URL: http://0.0.0.0:4000
Versioning: none
Stability: ga
Auth methods: api_key, jwt, oauth2
Idempotency keys: No
Error format: openai-compatible

SDKs

Python litellm · repo
Python litellm[proxy] · repo
Node.js litellm · repo

Adoption & maturity

Launched: 2023-01-01
Notable customers: Netflix, Lemonade

Other AI Gateway & LLM Routing APIs

Vercel AI Gateway
"AI Gateway provides a unified API to access hundreds of AI models through a single endpoint, with built-in budgets, usage monitoring, and fallbacks."
Usage · free tier · public pricing · self-serve
Portkey
"Production Stack for Gen AI Builders"
Hybrid · free tier · public pricing · self-serve
Bifrost (Maxim AI)
"The fastest, most resilient, enterprise-grade LLM, MCP, and agent gateway."
Sales-led · free tier · public pricing · self-serve
Cloudflare AI Gateway
"Connect to any model, dynamically route requests, and manage usage, billing, and logs from one unified gateway."
Hybrid · free tier · public pricing · self-serve
TrueFoundry AI Gateway
"A unified AI gateway to securely manage and govern AI across 1600+ models with policy control, real-time monitoring, and up to 30% cost reduction."
Hybrid · free tier · public pricing · self-serve
Helicone
"Open-source LLM observability and monitoring platform for developers" - routes, debugs, and analyzes AI applications with access to 100+ models through one API with built-in observability, automatic fallbacks, and zero markup pricing.
Hybrid · free tier · public pricing · self-serve

LiteLLM alternatives · LiteLLM vs Vercel AI Gateway · All AI Gateway & LLM Routing APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

↑Description: litellm.ai
↑Pricing model: docs.litellm.ai · litellm.ai
↑Published pricing: litellm.ai
↑Free tier: litellm.ai · docs.litellm.ai
↑Self-serve signup: github.com
↑Requires sales call: docs.litellm.ai · litellm.ai
↑Enterprise plan: docs.litellm.ai · litellm.ai
↑Supported actions: docs.litellm.ai · docs.litellm.ai
↑Input types: docs.litellm.ai
↑Webhooks: docs.litellm.ai
↑SDK languages: github.com
↑MCP server: docs.litellm.ai
↑SOC 2: docs.litellm.ai · docs.litellm.ai
↑HIPAA: docs.litellm.ai
↑GDPR: docs.litellm.ai
↑ISO 27001: docs.litellm.ai · docs.litellm.ai
↑PCI DSS: docs.litellm.ai
↑Published SLA: docs.litellm.ai
↑Known restrictions: docs.litellm.ai · docs.litellm.ai

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

2026-06-21 Capabilities: {} → {"guardrails":true,"self_hosted":true,"observability":true,"spend_controls":tru…
2026-06-21 Summary Md: (none) → LiteLLM is an open-source LLM gateway that provides a unified OpenAI-compatible…
2026-06-21 Score Docs Quality: (none) → 15
2026-06-21 Score Procurement Friction: (none) → 60
2026-06-21 Score Trust Readiness: (none) → 53
2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
2026-06-21 Score Agent Friendliness: (none) → 20
2026-06-21 Score Pricing Transparency: (none) → 60
2026-06-21 Score Setup Speed: (none) → 80
2026-06-21 Llms Txt Present: (none) → No
2026-06-21 Has Structured Data: (none) → No
2026-06-21 Robots Allows Agents: (none) → Yes
2026-06-21 Status Page URL: (none) → https://status.litellm.ai
2026-06-21 Docs URL: (none) → https://docs.litellm.ai/
2026-06-21 Rendering: (none) → client_rendered
2026-06-21 Pricing Model: set to contact_sales
2026-06-21 Has Published Pricing: set to Yes
2026-06-21 Free Tier Available: set to Yes
2026-06-21 Free Tier Details: set to Open-source self-hosted version is free ($0) with 100+ LLM integrations, load b…
2026-06-21 Self Serve Signup: set to Yes
2026-06-21 Requires Sales Call: set to Yes
2026-06-21 Enterprise Plan Available: set to Yes
2026-06-21 SOC 2: set to type_1
2026-06-21 HIPAA: set to No
2026-06-21 GDPR: set to No
2026-06-21 ISO 27001: set to Yes
2026-06-21 PCI DSS: set to No
2026-06-21 SLA Published: set to Yes
2026-06-21 SLA URL: set to https://docs.litellm.ai/docs/enterprise
2026-06-21 Data Retention Policy URL: set to https://docs.litellm.ai/docs/data_security
2026-06-21 Known Restrictions: set to Enterprise features (SSO, audit logs, SCIM, per-key guardrails, batch cost trac…
2026-06-21 Auth Methods: set to api_key, jwt, oauth2
2026-06-21 Auth Docs URL: set to https://docs.litellm.ai/docs/proxy/virtual_keys
2026-06-21 API Style: set to rest
2026-06-21 Base URL: set to http://0.0.0.0:4000
2026-06-21 Versioning Scheme: set to none
2026-06-21 Stability: set to ga
2026-06-21 Deprecation Policy URL: set to https://docs.litellm.ai/docs/proxy/release_cycle
2026-06-21 MCP URL: set to https://docs.litellm.ai/docs/mcp
2026-06-21 Quickstart URL: set to https://docs.litellm.ai/docs/proxy/quick_start
2026-06-21 Idempotency Supported: set to No
2026-06-21 Slug: set to litellm
2026-06-21 Requires Verification: set to No
2026-06-21 Free Tier Limit: set to open-source self-host (SSO free up to 5 users)
2026-06-21 Launched At: set to 2023-01-01
2026-06-21 Notable Customers: set to Netflix, Lemonade
2026-06-21 Fields Not Found: set to documented_rate_limits, minimum_commitment, supported_regions, pci_dss, api_ver…
2026-06-21 Source Confidence: set to high
2026-06-21 Extractor: set to claude-subagent:sonnet

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/litellm \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/litellm/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →

Best for / Avoid if

Pricing & procurement

Capabilities

Trust & compliance

Developer surface

Integration

Adoption & maturity

Other AI Gateway & LLM Routing APIs

Vercel AI Gateway

Portkey

Bifrost (Maxim AI)

Cloudflare AI Gateway

TrueFoundry AI Gateway

Helicone

References

Change history

Suggest an edit / leave a review