Jina Ai Rate Limits
Jina AI applies per-API-key rate limits across three tiers (Free, Paid, Premium). Limits are enforced as RPM (requests per minute), TPM (tokens per minute), and concurrent in-flight requests and apply uniformly across all Search Foundation services (Embeddings, Reranker, Reader, Classifier, Segmenter, DeepSearch). Tier is determined by token balance / billing status on the key.
Jina Ai Rate Limits is the machine-readable rate-limit profile for Jina AI on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 9 rate-limit definitions, measuring requests_per_minute, tokens_per_minute, and concurrent_requests.
The profile also includes 4 backoff/retry policies defined and response codes documented for throttled and quotaExceeded.
Tagged areas include Rate Limiting, AI, Embeddings, and LLM.