Haystack Ai Rate Limits
The open-source Haystack framework and self-hosted Hayhooks deployments are not rate limited by deepset - throughput is bounded only by your own infrastructure and by the model/embedding providers your pipelines call. The hosted deepset Cloud REST API applies request timeouts (about 2 minutes for search and 3 minutes for other requests) and standard pagination, and enforces account/plan-level quotas that are governed by enterprise agreement rather than a published per-minute table. Specific numeric limits are not reconciled in this artifact.
Haystack Ai Rate Limits is the machine-readable rate-limit profile for Haystack / deepset on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 5 rate-limit definitions, measuring duration, items, and requests.
The profile also includes 2 backoff/retry policies defined and response codes documented for throttled.
Tagged areas include AI, LLM, RAG, Open Source, and Rate Limiting.