Openlayer Rate Limits
The Openlayer REST API authenticates with a Bearer API key and meters usage primarily through plan-level quotas, most notably the monthly cap on production inferences (rows published to inference pipelines) on the Basic plan. Openlayer does not publish explicit per-endpoint request-per-minute or token-per-minute limits in its public API reference; specific throttling thresholds are not reconciled in this artifact.
Openlayer Rate Limits is the machine-readable rate-limit profile for Openlayer on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 3 rate-limit definitions, measuring inferences, requests, and rows.
The profile also includes 3 backoff/retry policies defined and response codes documented for throttled.
Tagged areas include AI, Evaluation, Testing, Observability, and LLM.