Mistral Ai Rate Limits
Mistral AI's la Plateforme exposes a chat-completions API at api.mistral.ai/v1 with per-account, per-model rate limits enforced as requests-per-second and tokens-per-minute. Specific per-tier numbers are not displayed on the public docs / pricing pages we sampled — they are surfaced in-product on the la Plateforme console and can be raised via support. 429 with Retry-After indicates throttling.
Mistral Ai Rate Limits is the machine-readable rate-limit profile for Mistral AI on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 3 rate-limit definitions, measuring requests_per_second, tokens_per_minute, and concurrent_requests.
The profile also includes 4 backoff/retry policies defined and response codes documented for throttled and serviceUnavailable.
Tagged areas include Rate Limiting, AI, and Large Language Models.