LiteLLM
"LLM Gateway (OpenAI Proxy) to manage authentication, loadbalancing, and spend tracking across 100+ LLMs. All in the OpenAI format." [1]
LiteLLM is an open-source LLM gateway that provides a unified OpenAI-compatible API across 100+ model providers, handling load balancing, automatic failover, semantic caching, rate limiting, and spend tracking in one proxy layer. It is self-hosted rather than offered as a managed SaaS, making it suited for teams that need centralized governance over multiple LLM deployments without vendor lock-in. The free tier covers self-hosting with SSO available up to five users; enterprise licensing is required for features such as audit logs, SCIM, per-key guardrails, and batch cost tracking. LiteLLM holds SOC 2 Type I and ISO 27001 certifications, with SDKs for Python and Node.js and support for API key, JWT, and OAuth2 authentication.
Best for / Avoid if
Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; Teams needing broad API coverage out of the box
Pricing & procurement
- Pricing model
- Contact sales [2]
- Published pricing
- ✓ Yes [3]
- Free tier
- ✓ Yes [4]
- Free tier details
- Open-source self-hosted version is free ($0) with 100+ LLM integrations, load balancing, virtual keys, budgets, teams, and LLM guardrails. Available on GitHub with 51K+ stars and 240M+ Docker pulls. No data or telemetry stored on LiteLLM servers when self-hosted.
- Self-serve signup
- ✓ Yes [5]
- Requires sales call
- ✓ Yes [6]
- Enterprise plan
- ✓ Yes [7]
| Plan | Item | Per | Amount | Source |
|---|---|---|---|---|
| Open Source | Self-hosted open-source gateway | month | $0 | source |
| Enterprise | Enterprise license (cloud or self-hosted) | custom quote | - | source |
Capabilities
- Supported actions
- unified_chat_completions, openai_compatible_api, model_routing, automatic_fallback, load_balancing, semantic_caching, prompt_caching, spend_limits, budgets, rate_limiting, observability_logging, tracing, guardrails, pii_redaction, virtual_keys, byo_provider_keys, prompt_management, embeddings_routing, image_generation_routing, audio_transcription_routing, text_to_speech_routing, batch_api, tag_based_routing, latency_based_routing, cost_based_routing, usage_based_routing, context_window_fallback, content_policy_fallback, key_rotation, sso_oidc_saml, audit_logs, secret_manager_integration, prometheus_metrics, opentelemetry_tracing, multi_tenant_organizations, mcp_gateway [8]
- Input types
- chat completions, text completions, embeddings, image generation, audio transcription, text to speech, batch [9]
- Output types
- streaming (SSE), OpenAI-compatible response, JSON
- Webhooks
- ✓ Yes [10]
- Sandbox / test mode
- ✗ No
- SDK languages
- Python, Node.js [11]
- MCP server
- ✗ No [12]
Trust & compliance
- SOC 2
- SOC 2 Type I [13]
- HIPAA
- ✗ No [14]
- GDPR
- ✗ No [15]
- ISO 27001
- ✓ Yes [16]
- PCI DSS
- ✗ No [17]
- Published SLA
- ✓ Yes [18]
- Known restrictions
- Enterprise features (SSO, audit logs, SCIM, per-key guardrails, batch cost tracking) require Enterprise license, SOC 2 Type I and ISO 27001 reports available only on Enterprise plan upon request; SOC 2 Type II and ISO 27001 certifications undergoing recertification via Vanta as of March 2026 after prior Delve-issued certs were questioned, SSO free for up to 5 users; enterprise licensing required beyond that, PostgreSQL required for virtual keys and spend tracking in self-hosted deployments, Production minimum: 4 CPU cores and 8 GB RAM for self-hosted, No cloud-hosted SaaS gateway - product is self-hosted or customer-deployed, Enterprise pricing is contact-sales only; no public per-seat or per-request gateway fees [19]
Developer surface
Integration
Adoption & maturity
- Launched
- 2023-01-01
- Notable customers
- Netflix, Lemonade
Other AI Gateway & LLM Routing APIs
Vercel AI Gateway
"AI Gateway provides a unified API to access hundreds of AI models through a single endpoint, with built-in budgets, usage monitoring, and fallbacks."
Portkey
"Production Stack for Gen AI Builders"
Bifrost (Maxim AI)
"The fastest, most resilient, enterprise-grade LLM, MCP, and agent gateway."
Cloudflare AI Gateway
"Connect to any model, dynamically route requests, and manage usage, billing, and logs from one unified gateway."
TrueFoundry AI Gateway
"A unified AI gateway to securely manage and govern AI across 1600+ models with policy control, real-time monitoring, and up to 30% cost reduction."
Helicone
"Open-source LLM observability and monitoring platform for developers" - routes, debugs, and analyzes AI applications with access to 100+ models through one API with built-in observability, automatic fallbacks, and zero markup pricing.
References
- ↑Description: litellm.ai
- ↑Pricing model: docs.litellm.ai · litellm.ai
- ↑Published pricing: litellm.ai
- ↑Free tier: litellm.ai · docs.litellm.ai
- ↑Self-serve signup: github.com
- ↑Requires sales call: docs.litellm.ai · litellm.ai
- ↑Enterprise plan: docs.litellm.ai · litellm.ai
- ↑Supported actions: docs.litellm.ai · docs.litellm.ai
- ↑Input types: docs.litellm.ai
- ↑Webhooks: docs.litellm.ai
- ↑SDK languages: github.com
- ↑MCP server: docs.litellm.ai
- ↑SOC 2: docs.litellm.ai · docs.litellm.ai
- ↑HIPAA: docs.litellm.ai
- ↑GDPR: docs.litellm.ai
- ↑ISO 27001: docs.litellm.ai · docs.litellm.ai
- ↑PCI DSS: docs.litellm.ai
- ↑Published SLA: docs.litellm.ai
- ↑Known restrictions: docs.litellm.ai · docs.litellm.ai
Change history
- 2026-06-21 Capabilities: {} → {"guardrails":true,"self_hosted":true,"observability":true,"spend_controls":tru…
- 2026-06-21 Summary Md: (none) → LiteLLM is an open-source LLM gateway that provides a unified OpenAI-compatible…
- 2026-06-21 Score Docs Quality: (none) → 15
- 2026-06-21 Score Procurement Friction: (none) → 60
- 2026-06-21 Score Trust Readiness: (none) → 53
- 2026-06-21 Best For: (none) → Prototypes and side projects - free to start, no sales call, Regulated or enter…
- 2026-06-21 Scoring Methodology: (none) → Scores are computed deterministically from this profile's published, sourced fi…
- 2026-06-21 Score Agent Friendliness: (none) → 20
- 2026-06-21 Score Pricing Transparency: (none) → 60
- 2026-06-21 Score Setup Speed: (none) → 80
- 2026-06-21 Llms Txt Present: (none) → No
- 2026-06-21 Has Structured Data: (none) → No
- 2026-06-21 Robots Allows Agents: (none) → Yes
- 2026-06-21 Status Page URL: (none) → https://status.litellm.ai
- 2026-06-21 Docs URL: (none) → https://docs.litellm.ai/
- 2026-06-21 Rendering: (none) → client_rendered
- 2026-06-21 Pricing Model: set to contact_sales
- 2026-06-21 Has Published Pricing: set to Yes
- 2026-06-21 Free Tier Available: set to Yes
- 2026-06-21 Free Tier Details: set to Open-source self-hosted version is free ($0) with 100+ LLM integrations, load b…
- 2026-06-21 Self Serve Signup: set to Yes
- 2026-06-21 Requires Sales Call: set to Yes
- 2026-06-21 Enterprise Plan Available: set to Yes
- 2026-06-21 SOC 2: set to type_1
- 2026-06-21 HIPAA: set to No
- 2026-06-21 GDPR: set to No
- 2026-06-21 ISO 27001: set to Yes
- 2026-06-21 PCI DSS: set to No
- 2026-06-21 SLA Published: set to Yes
- 2026-06-21 SLA URL: set to https://docs.litellm.ai/docs/enterprise
- 2026-06-21 Data Retention Policy URL: set to https://docs.litellm.ai/docs/data_security
- 2026-06-21 Known Restrictions: set to Enterprise features (SSO, audit logs, SCIM, per-key guardrails, batch cost trac…
- 2026-06-21 Auth Methods: set to api_key, jwt, oauth2
- 2026-06-21 Auth Docs URL: set to https://docs.litellm.ai/docs/proxy/virtual_keys
- 2026-06-21 API Style: set to rest
- 2026-06-21 Base URL: set to http://0.0.0.0:4000
- 2026-06-21 Versioning Scheme: set to none
- 2026-06-21 Stability: set to ga
- 2026-06-21 Deprecation Policy URL: set to https://docs.litellm.ai/docs/proxy/release_cycle
- 2026-06-21 MCP URL: set to https://docs.litellm.ai/docs/mcp
- 2026-06-21 Quickstart URL: set to https://docs.litellm.ai/docs/proxy/quick_start
- 2026-06-21 Idempotency Supported: set to No
- 2026-06-21 Slug: set to litellm
- 2026-06-21 Requires Verification: set to No
- 2026-06-21 Free Tier Limit: set to open-source self-host (SSO free up to 5 users)
- 2026-06-21 Launched At: set to 2023-01-01
- 2026-06-21 Notable Customers: set to Netflix, Lemonade
- 2026-06-21 Fields Not Found: set to documented_rate_limits, minimum_commitment, supported_regions, pci_dss, api_ver…
- 2026-06-21 Source Confidence: set to high
- 2026-06-21 Extractor: set to claude-subagent:sonnet
Suggest an edit / leave a review
Leave a review or comment
curl -X POST https://apio.sh/api/feedback/litellm \
-H 'Content-Type: application/json' \
-d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'Suggest a correction to a field (cite a source)
curl -X POST https://apio.sh/api/suggest/litellm/FIELD \
-H 'Content-Type: application/json' \
-d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'