AI Rate Limiting Advanced Plugin

Token-aware rate limiting tailored for LLM traffic, with per-consumer and per-model budgets rather than just request counts.

API entry from apis.yml

apis.yml Raw ↑
aid: kong-ai-gateway:ai-rate-limiting-advanced-plugin
name: AI Rate Limiting Advanced Plugin
description: Token-aware rate limiting tailored for LLM traffic, with per-consumer and per-model budgets
  rather than just request counts.
humanURL: https://developer.konghq.com/plugins/ai-rate-limiting-advanced/
tags:
- Plugin
- Rate Limiting
- Token Budget
properties:
- type: Documentation
  url: https://developer.konghq.com/plugins/ai-rate-limiting-advanced/