Modal · Rate Limits
Modal Rate Limits
Plan-based quota limits for Modal. Modal does not publish per-request token-bucket rate limits in the public pricing page; instead it caps concurrent containers, GPU concurrency, deployed cron jobs, and deployed webhooks per plan. Enterprise customers negotiate custom limits.
Modal Rate Limits is the machine-readable rate-limit profile for Modal on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 3 rate-limit definitions, across the Starter, Team, and Enterprise tiers.
The profile also includes response codes documented for throttled and quotaExceeded.
Tagged areas include Rate Limiting, Quotas, Serverless, and GPU.
3 Limits
Throttle: 429
Quota: 429
Rate LimitingQuotasServerlessGPU