Inkeep Rate Limits
Inkeep's AI / RAG chat completions endpoint applies per-IP rate throttling and bounds each chat session to roughly 30 messages, with recommended input of <=100 tokens and output of <=1,000 tokens per request. Inkeep does not publish exact numeric per-account RPM/TPM ceilings; effective limits depend on plan and quoted usage. Specific values are not reconciled in this artifact.
Inkeep Rate Limits is the machine-readable rate-limit profile for Inkeep on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 5 rate-limit definitions, measuring requests, messages, and tokens.
The profile also includes 2 backoff/retry policies defined and response codes documented for throttled.
Tagged areas include AI, Support, RAG, Agents, and Documentation.