Portkey Rate Limits
Portkey is an LLM gateway whose runtime quotas are dominated by the upstream provider being proxied (OpenAI, Anthropic, Bedrock, etc.); Portkey itself exposes plan-bound caps on recorded logs (10k/month on Developer, 100k/month on Production with overage to 3M, 10M+ on Enterprise) rather than per-second request throttling. Enterprise customers can configure granular budget and rate limits per virtual key and workspace. Concrete numeric request-per-second ceilings are not published on the public docs site at the time of writing.
Portkey Rate Limits is the machine-readable rate-limit profile for Portkey on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 5 rate-limit definitions, measuring requests_per_month and varies.
The profile also includes 4 backoff/retry policies defined and response codes documented for throttled, quotaExceeded, and serviceUnavailable.
Tagged areas include AI Gateways, Governance, Observability, and Rate Limiting.