Unify · Rate Limits

Unify Ai Rate Limits

Unify applies rate limits at the API gateway level to ensure fair usage across all consumers and protect the reliability of provider routing. Because Unify aggregates across 100+ upstream LLM providers, effective throughput depends on both Unify's own gateway limits and the underlying limits of the target model and provider. Unify's routing layer manages fallbacks and load balancing to maximize throughput within these constraints. Specific numeric limits are credit-tier dependent and available via the console. The platform itself helps users avoid hitting individual provider rate limits through intelligent routing and load distribution.

Unify Ai Rate Limits is the machine-readable rate-limit profile for Unify on the APIs.io network, conforming to the API Commons Rate Limits specification.

It captures 5 rate-limit definitions, across the standard tier.

Tagged areas include LLM, AI, Large Language Models, LLM Routing, and Model Gateway.

5 Limits
LLMAILarge Language ModelsLLM RoutingModel GatewayAI GatewayOpenAIAnthropicMistralBenchmarkingModel ComparisonAI InfrastructureMachine Learning

Limits

API Request Rate account
Exact RPM limits are tier-dependent and visible in the console dashboard. Contact Unify for high-throughput production limits.
Concurrent Requests account
Concurrency limits scale with credit tier. The routing layer may fan out a single request to multiple providers for benchmarking or fallback.
Provider-Level Limits provider
Bringing your own provider API keys (BYOK) is supported for all major providers and recommended for production workloads requiring high throughput.
Message Attachment Size request
10
Credit Balance Throttling account
Monitor credit usage via the console dashboard at console.unify.ai.