Amazon Q Rate Limits
Amazon Q APIs (qbusiness, qdeveloper) follow standard AWS API throttling per account/region. Q Business agentic-request quotas and Q Developer agentic-request limits are tracked per user/subscription and surface as plan-level quotas rather than per-second throttles. AWS recommends exponential backoff with jitter on ThrottlingException.
Amazon Q Rate Limits is the machine-readable rate-limit profile for Amazon Q on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 5 rate-limit definitions, measuring varies, requests_per_second, requests_per_month, and lines_per_month.
The profile also includes 3 backoff/retry policies defined and response codes documented for throttled, quotaExceeded, and serviceUnavailable.
Tagged areas include Rate Limiting, GenAI, and Amazon Q.