RunPod · Rate Limits

Runpod Rate Limits

Scaffolded rate limit definitions for the RunPod API surface. Captures per-tier quotas, burst behavior, response signaling, and recovery semantics. Defaults are scaffold values to be replaced with published provider limits.

Runpod Rate Limits is the machine-readable rate-limit profile for RunPod on the APIs.io network, conforming to the API Commons Rate Limits specification.

It captures 2 rate-limit definitions, across the free and pro tiers, measuring requests_per_minute.

The profile also includes response codes documented for throttled, quotaExceeded, and serviceUnavailable.

Tagged areas include AI, Cloud, Compute, GPU, and Inference.

2 Limits Throttle: 429 Quota: 429
AICloudComputeGPUInferenceMachine LearningServerlessRate LimitingQuotasThrottling

Limits

Free Tier Default api-key
requests_per_minute · minute
10
Pro Tier Default api-key
requests_per_minute · minute
120