Kubernetes Services Rate Limits
The Kubernetes Services API itself does not impose request-per-second limits on data-plane traffic to a Service. Control-plane operations (creating, updating, listing Services through the kube-apiserver) are subject to the API server's API Priority and Fairness (APF) just like any other Kubernetes API resource. Data-plane throughput depends on the chosen Service type and the underlying cloud-provider load balancer.
Kubernetes Services Rate Limits is the machine-readable rate-limit profile for Kubernetes Services on the APIs.io network, conforming to the API Commons Rate Limits specification.
It captures 2 rate-limit definitions, measuring concurrent_requests and varies.
The profile also includes 4 backoff/retry policies defined and response codes documented for throttled and serviceUnavailable.
Tagged areas include Rate Limiting, Kubernetes, Networking, and Open Source.