RunPod · Rate Limits

Runpod Rate Limits

Name: Runpod Rate Limits
Creator: RunPod
Keywords: AI, Cloud, Compute, GPU, Inference, Machine Learning, Serverless, Rate Limiting, Quotas, Throttling

Scaffolded rate limit definitions for the RunPod API surface. Captures per-tier quotas, burst behavior, response signaling, and recovery semantics. Defaults are scaffold values to be replaced with published provider limits.

Runpod Rate Limits is the machine-readable rate-limit profile for RunPod on the APIs.io network, conforming to the API Commons Rate Limits specification.

It captures 2 rate-limit definitions, across the free and pro tiers, measuring requests_per_minute.

The profile also includes response codes documented for throttled, quotaExceeded, and serviceUnavailable.

Tagged areas include AI, Cloud, Compute, GPU, and Inference.

2 Limits Throttle: 429 Quota: 429

AICloudComputeGPUInferenceMachine LearningServerlessRate LimitingQuotasThrottling

Limits

Free Tier Default api-key

requests_per_minute · minute

Pro Tier Default api-key

requests_per_minute · minute

120