MiniMax · Rate Limits

Minimax Ai Rate Limits

Per-modality rate limits for the MiniMax API. Text APIs share a single 500 RPM / 20M TPM pool; speech, video, image, and music APIs each have their own caps.

Minimax Ai Rate Limits is the machine-readable rate-limit profile for MiniMax on the APIs.io network, conforming to the API Commons Rate Limits specification.

It captures 8 rate-limit definitions, measuring requests-per-minute and tokens-per-minute.

The profile also includes 2 backoff/retry policies defined and response codes documented for throttled.

Tagged areas include AI, LLM, Multimodal, Rate Limiting, and Quotas.

8 Limits Throttle: 429
AILLMMultimodalRate LimitingQuotas

Limits

Text APIs (all models) account
requests-per-minute
500
Combined with 20,000,000 TPM (input + output).
Text APIs TPM account
tokens-per-minute
20000000
Speech (T2A) account
requests-per-minute
60
20,000 TPM.
Voice Cloning account
requests-per-minute
60
Voice Design account
requests-per-minute
20
Video Generation account
requests-per-minute
5
Image Generation account
requests-per-minute
10
60 TPM.
Music Generation account
requests-per-minute
120
Max 20 concurrent connections.

Policies

Backoff Strategy
Exponential backoff with jitter; honor Retry-After.
Limit Increase
Contact api@minimax.io for higher caps.

Sources