Portkey

"Production Stack for Gen AI Builders" [1]

portkey.ai · By Portkey · Agent JSON · Suggest an edit · Last verified 2026-06-21 · Source confidence: high

Portkey is a production infrastructure layer for generative AI teams, providing a unified REST API across 1,600+ models with built-in routing, automatic fallback, load balancing, semantic caching, and observability logging. It targets developers and enterprises building LLM-powered applications who need cost controls, prompt versioning, and AI guardrails including PII redaction. Paid plans start at $49 per month with a free tier capped at 10,000 logged requests; an open-source self-host option is available under the MIT license. Portkey holds SOC 2 Type II, HIPAA, GDPR, and ISO 27001 certifications, though compliance certificates and private VPC deployments are restricted to the Enterprise tier.

Best for / Avoid if

Best for: Prototypes and side projects - free to start, no sales call; Regulated or enterprise workloads - compliance attestations and an enterprise plan; AI agents and automation - an agent-ready surface (MCP / llms.txt)

Pricing & procurement

Pricing model
Hybrid (base + usage) [2]
Published pricing
Yes [3]
Free tier
Yes [4]
Free tier details
Developer plan: free forever, 10k recorded logs/month, 3-day log retention, 30-day metrics retention, community support. Also open-source self-hosted gateway available (MIT license, unlimited requests, basic dashboard). [5]
Self-serve signup
Yes
Requires sales call
No
Enterprise plan
Yes [6]
Published prices
PlanItemPerAmountSource
Open SourceSelf-hosted gateway (open source, MIT license)$0source
DeveloperPlatform plan feemonth$0source
DeveloperRecorded logs included10,000 logs/month$0source
ProductionPlatform plan feemonth$49source
ProductionRecorded logs included100,000 logs/month$0source
ProductionLog overage100,000 additional requests$9source
EnterprisePlatform plan feemonth (custom pricing) - source

Capabilities

  • Semantic caching
  • Fallback / routing
  • Spend controls
  • Observability
  • Guardrails
  • Self-hosted
Supported actions
unified_chat_completions, openai_compatible_api, model_routing, automatic_fallback, load_balancing, semantic_caching, simple_caching, prompt_caching, spend_limits, budgets, rate_limiting, observability_logging, tracing, guardrails, pii_redaction, virtual_keys, byo_provider_keys, prompt_management, prompt_versioning, conditional_routing, canary_testing, automatic_retries, circuit_breaker, request_timeout, multimodal_support, embeddings_routing, image_generation_routing, audio_routing, rerank_routing, realtime_api_websocket, grpc_support, opentelemetry_export, audit_logs, rbac, sso_oidc, scim_provisioning, byok_encryption, webhooks_budget_alerts, mcp_gateway [7]
Regions
US, EU, India, AWS, GCP, Azure (customer VPC deployments) [8]
Input types
chat completions, text completions, embeddings, image generation, audio (speech, transcription, translation), rerank, realtime (WebSocket), fine-tuning, batch, assistants, moderations
Output types
streaming (SSE), JSON, OpenAI-compatible response, WebSocket (realtime)
Webhooks
Yes [9]
Sandbox / test mode
No [10]
SDK languages
Python, Node.js, Node.js (gateway self-host), Python (OpenAI drop-in), Node.js (OpenAI drop-in) [11]
MCP server
Yes [12]

Trust & compliance

SOC 2
SOC 2 Type II [13]
HIPAA
Yes [14]
GDPR
Yes [15]
ISO 27001
Yes [16]
PCI DSS
Unknown [17]
Published SLA
No [18]
Known restrictions
Semantic caching available on Production ($49/mo) and Enterprise plans only, Free Developer plan limited to 10k recorded logs/month with no overage, Production plan logs capped at 100k/month; overages at $9 per additional 100k requests (up to 3M), Log retention: 3 days (Developer), 30 days (Production), custom (Enterprise), SOC2/HIPAA/GDPR compliance certificates and custom BAAs available on Enterprise plan only, Private cloud / VPC deployment is Enterprise-only, SCIM provisioning and SSO are Enterprise-only, Prompt templates limited to 3 on free Developer plan (unlimited on Production+), Rate limits on virtual keys available to Enterprise and select Pro customers only [19]

Developer surface

Docs rendering: static · llms.txt present

Integration

API style
rest
Base URL
https://api.portkey.ai/v1
Version
v1
Versioning
url
Stability
ga
Auth methods
api_key, jwt
Error format
openai-compatible

SDKs

  • Python portkey-ai · repo
  • Node.js portkey-ai · repo
  • Node.js (gateway self-host) @portkey-ai/gateway · repo
  • Python (OpenAI drop-in) openai · repo
  • Node.js (OpenAI drop-in) openai · repo

Adoption & maturity

Launched
2023-01-01
Notable customers
Snorkel AI, RVO Health, Haptik, SiteGPT, Snorkel AI, Theories Group, Fontys ICT

Other AI Gateway & LLM Routing APIs

  • Vercel AI Gateway

    "AI Gateway provides a unified API to access hundreds of AI models through a single endpoint, with built-in budgets, usage monitoring, and fallbacks."

    Usage · free tier · public pricing · self-serve

  • Bifrost (Maxim AI)

    "The fastest, most resilient, enterprise-grade LLM, MCP, and agent gateway."

    Sales-led · free tier · public pricing · self-serve

  • Cloudflare AI Gateway

    "Connect to any model, dynamically route requests, and manage usage, billing, and logs from one unified gateway."

    Hybrid · free tier · public pricing · self-serve

  • TrueFoundry AI Gateway

    "A unified AI gateway to securely manage and govern AI across 1600+ models with policy control, real-time monitoring, and up to 30% cost reduction."

    Hybrid · free tier · public pricing · self-serve

  • Helicone

    "Open-source LLM observability and monitoring platform for developers" - routes, debugs, and analyzes AI applications with access to 100+ models through one API with built-in observability, automatic fallbacks, and zero markup pricing.

    Hybrid · free tier · public pricing · self-serve

  • OpenRouter

    "The Unified Interface For LLMs" - OpenRouter scouts for the best prices, the lowest latencies, and the highest throughput across dozens of providers, offering a single OpenAI-compatible API with automatic fallback, model routing, and unified billing.

    Usage · free tier · public pricing · self-serve

Portkey alternatives · Portkey vs Vercel AI Gateway · All AI Gateway & LLM Routing APIs APIs

References

Each field above carries a numbered source - hover for a preview, click to jump here.

  1. Description: portkey.ai
  2. Pricing model: portkey.ai · portkey.ai
  3. Published pricing: portkey.ai
  4. Free tier: portkey.ai · portkey.ai
  5. Free tier details: portkey.ai · portkey.ai
  6. Enterprise plan: portkey.ai
  7. Supported actions: portkey.ai · portkey.ai · portkey.ai
  8. Regions: portkey.ai · portkey.ai
  9. Webhooks: portkey.ai · portkey.ai
  10. Sandbox: portkey.ai
  11. SDK languages: portkey.ai
  12. MCP server: portkey.ai · portkey.ai
  13. SOC 2: portkey.ai · portkey.ai · portkey.ai
  14. HIPAA: portkey.ai · portkey.ai
  15. GDPR: portkey.ai · portkey.ai
  16. ISO 27001: portkey.ai · portkey.ai
  17. PCI DSS: portkey.ai
  18. Published SLA: status.portkey.ai
  19. Known restrictions: portkey.ai · portkey.ai · portkey.ai

Change history

Every field change, who made it, and when - from our audited data pipeline and editors.

  1. 2026-06-21 Capabilities: {}{"guardrails":true,"self_hosted":true,"observability":true,"spend_controls":tru…
  2. 2026-06-21 Summary Md: (none)Portkey is a production infrastructure layer for generative AI teams, providing…
  3. 2026-06-21 Score Setup Speed: (none)85
  4. 2026-06-21 Score Docs Quality: (none)25
  5. 2026-06-21 Score Procurement Friction: (none)100
  6. 2026-06-21 Score Trust Readiness: (none)70
  7. 2026-06-21 Best For: (none)Prototypes and side projects - free to start, no sales call, Regulated or enter…
  8. 2026-06-21 Scoring Methodology: (none)Scores are computed deterministically from this profile's published, sourced fi…
  9. 2026-06-21 Score Agent Friendliness: (none)65
  10. 2026-06-21 Score Pricing Transparency: (none)100
  11. 2026-06-21 Llms Txt Present: (none)Yes
  12. 2026-06-21 Rendering: (none)static
  13. 2026-06-21 Has Structured Data: (none)Yes
  14. 2026-06-21 Robots Allows Agents: (none)Yes
  15. 2026-06-21 Status Page URL: (none)https://status.portkey.ai
  16. 2026-06-21 Docs URL: (none)https://portkey.ai/docs/introduction/what-is-portkey
  17. 2026-06-21 Llms Txt URL: (none)https://portkey.ai/llms.txt
  18. 2026-06-21 Supported Regions: set to US, EU, India, AWS, GCP, Azure (customer VPC deployments)
  19. 2026-06-21 Supported Languages: set to (none)
  20. 2026-06-21 Input Types: set to chat completions, text completions, embeddings, image generation, audio (speech…
  21. 2026-06-21 Output Types: set to streaming (SSE), JSON, OpenAI-compatible response, WebSocket (realtime)
  22. 2026-06-21 Webhooks Supported: set to Yes
  23. 2026-06-21 Sandbox Available: set to No
  24. 2026-06-21 SDK Languages: set to Python, Node.js, Node.js (gateway self-host), Python (OpenAI drop-in), Node.js …
  25. 2026-06-21 Enterprise Plan Available: set to Yes
  26. 2026-06-21 SOC 2: set to type_2
  27. 2026-06-21 HIPAA: set to Yes
  28. 2026-06-21 SLA Published: set to No
  29. 2026-06-21 Data Retention Policy URL: set to https://portkey.ai/privacy-policy
  30. 2026-06-21 Known Restrictions: set to Semantic caching available on Production ($49/mo) and Enterprise plans only, Fr…
  31. 2026-06-21 Auth Methods: set to api_key, jwt
  32. 2026-06-21 Auth Docs URL: set to https://portkey.ai/docs/api-reference/inference-api/authentication
  33. 2026-06-21 API Style: set to rest
  34. 2026-06-21 Base URL: set to https://api.portkey.ai/v1
  35. 2026-06-21 API Version: set to v1
  36. 2026-06-21 Versioning Scheme: set to url
  37. 2026-06-21 Stability: set to ga
  38. 2026-06-21 MCP URL: set to https://mcp.portkey.ai
  39. 2026-06-21 Quickstart URL: set to https://portkey.ai/docs/guides/getting-started/getting-started-with-ai-gateway
  40. 2026-06-21 Error Format: set to openai-compatible
  41. 2026-06-21 Webhook Events URL: set to https://portkey.ai/docs/integrations/guardrails/bring-your-own-guardrails
  42. 2026-06-21 Requires Verification: set to No
  43. 2026-06-21 SDK Packages: set to Python, Node.js, Node.js (gateway self-host), Python (OpenAI drop-in), Node.js …
  44. 2026-06-21 Price Basis: set to month
  45. 2026-06-21 Free Tier Limit: set to 10,000 recorded logs/month (Dev plan); open-source self-host available (MIT)
  46. 2026-06-21 Launched At: set to 2023-01-01
  47. 2026-06-21 Notable Customers: set to Snorkel AI, RVO Health, Haptik, SiteGPT, Snorkel AI, Theories Group, Fontys ICT
  48. 2026-06-21 Fields Not Found: set to pci_dss, documented_rate_limits, sla_published, minimum_commitment, deprecation…
  49. 2026-06-21 Source Confidence: set to high
  50. 2026-06-21 Extractor: set to claude-subagent:sonnet

Suggest an edit / leave a review

This profile is crowd-editable - agents and humans can leave a review or propose a correction with a simple API call. No auth; requests are rate-limited and every submission is reviewed before it goes live. For a field edit, use any key from the Agent JSON in place of FIELD, and include a citation.

Leave a review or comment

curl -X POST https://apio.sh/api/feedback/portkey \
  -H 'Content-Type: application/json' \
  -d '{"kind":"review","rating":5,"body":"Your experience with this API…"}'

Suggest a correction to a field (cite a source)

curl -X POST https://apio.sh/api/suggest/portkey/FIELD \
  -H 'Content-Type: application/json' \
  -d '{"value":"corrected value","citations":[{"url":"https://source.example/page","excerpt":"supporting quote"}],"note":"what changed and why"}'

All the ways to contribute →