Anthropic · Rate Limits

Anthropic Rate Limits

Reconciled rate limits for the Anthropic Messages, Batches, and Managed Agents APIs. Token-bucket algorithm; only uncached input tokens count toward ITPM on most models.

Anthropic Rate Limits is the machine-readable rate-limit profile for Anthropic on the APIs.io network, conforming to the API Commons Rate Limits specification.

It captures 12 rate-limit definitions, across the Tier 1, Tier 2, Tier 3, and Tier 4 tiers.

The profile also includes 3 backoff/retry policies defined and response codes documented for throttled and quotaExceeded.

Tagged areas include AI, Rate Limiting, and Quotas.

12 Limits Throttle: 429 Quota: 429
AIRate LimitingQuotas

Limits

Policies

Cache-aware ITPM
On most models, cache_read_input_tokens do NOT count toward ITPM, making prompt caching an effective way to increase throughput.
Auto tier advancement
Tiers advance automatically based on cumulative credit purchase thresholds.
Acceleration limits
Sharp usage spikes can trigger 429s independent of tier limits — ramp gradually.

Sources