Google Gemini Rate Limits
Gemini API rate limits are scoped per usage tier (Free, Tier 1, Tier 2, Tier 3) and per model. Each tier defines RPM (requests per minute), TPM (tokens per minute), and RPD (requests per day) ceilings. Tier promotion is automatic based on cumulative spend and account age. Specific numerical limits are not statically published per model; they are visible in Google AI Studio per project. The Batch API has separate limits.
Google Gemini Rate Limits is the machine-readable rate-limit profile for Google Gemini on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 7 rate-limit definitions, measuring varies, monthly_spend_cap_USD, concurrent_requests, bytes, and tokens.
The profile also includes 5 backoff/retry policies defined and response codes documented for throttled, resourceExhausted, and quotaExceeded.
Tagged areas include Generative AI, LLM, Google, and Rate Limiting.