Modal · Rate Limits

Modal Rate Limits

Plan-based quota limits for Modal. Modal does not publish per-request token-bucket rate limits in the public pricing page; instead it caps concurrent containers, GPU concurrency, deployed cron jobs, and deployed webhooks per plan. Enterprise customers negotiate custom limits.

Modal Rate Limits is the machine-readable rate-limit profile for Modal on the APIs.io network, conforming to the API Commons Rate Limits specification.

It captures 3 rate-limit definitions, across the Starter, Team, and Enterprise tiers.

The profile also includes response codes documented for throttled and quotaExceeded.

Tagged areas include Rate Limiting, Quotas, Serverless, and GPU.

3 Limits Throttle: 429 Quota: 429
Rate LimitingQuotasServerlessGPU

Limits

Sources