AgentGateway

AgentGateway is an open-source, AI-native proxy and gateway for routing, observing, and governing traffic to and from AI agents, LLM providers, and MCP servers. Built on the A2A and MCP protocols, it provides a unified gateway for LLM consumption, MCP tool federation, agent-to-agent communication, security, and observability. AgentGateway supports multi-provider LLM routing across OpenAI, Anthropic, Google Gemini, AWS Bedrock, and Azure OpenAI with built-in RBAC, JWT authentication, rate limiting, and OpenTelemetry integration.

1 APIs 10 Features

AI GatewayAPI GatewayMCPLLMAgent-to-AgentOpen SourceCNCFObservabilitySecurity

APIs

AgentGateway

AgentGateway provides AI-native gateway capabilities for routing LLM traffic, federating MCP tools, enabling agent-to-agent communication, and applying security and observabilit...

Collections

AgentGateway Admin / Debug API

OPEN

Pricing Plans

Agentgateway Plans Pricing

3 plans

PLANS

Rate Limits

Agentgateway Rate Limits

5 limits

RATE LIMITS

FinOps

Agentgateway Finops

FINOPS

Features

LLM Gateway

Routes traffic to OpenAI, Anthropic, Google Gemini, AWS Bedrock, and Azure OpenAI through a unified API with model aliasing, failover, and load balancing.

MCP Gateway

Connects LLMs to tools via Model Context Protocol with static and dynamic routing, tool federation, and stateful MCP sessions.

Agent-to-Agent (A2A) Gateway

Enables secure, governed communication between AI agents using the A2A protocol for multi-agent orchestration.

Inference Routing

Intelligently routes requests to self-hosted models based on GPU utilization and request priority.

Security and Authentication

Provides JWT, OAuth2, API key management, CORS, CSRF protection, MCP authentication, and external authorization support.

Traffic Management

Supports request routing and matching, header manipulation, rate limiting, retries, gRPC routing, traffic splitting, and direct responses.

Observability

Integrates with OpenTelemetry for metrics, traces, and access logging with a built-in Admin UI and debugging tools.

Guardrails

Applies prompt guards, content filtering, regex filters, moderation policies, and custom webhooks for AI safety.

Cost Controls

Tracks budget and spend limits per user, team, or application with RBAC-based controls on LLM consumption.

Prompt Enrichment

Supports prompt templates and enrichment for standardizing and augmenting requests before routing to LLM providers.

Use Cases

Unified LLM Routing

Route requests across multiple LLM providers with a single API, enabling failover, load balancing, and cost optimization without changing client code.

MCP Tool Federation

Aggregate tools from multiple MCP servers behind a single gateway endpoint, enabling agents to discover and invoke tools from any connected MCP server.

Enterprise AI Governance

Apply organization-wide security policies, rate limits, budget controls, and content filters to all AI agent traffic through a centralized gateway.

REST API to MCP Conversion

Convert existing REST APIs into MCP-native tool endpoints that AI agents can discover and invoke through the Model Context Protocol.

Multi-Agent Orchestration

Enable secure agent-to-agent communication using the A2A protocol, allowing specialized agents to delegate tasks to each other through the gateway.

Observability and Debugging

Collect unified telemetry across all AI agent and LLM interactions to monitor cost, latency, and behavior at scale.

Integrations

OpenAI

Route to OpenAI GPT models through the AgentGateway LLM backend with model aliasing and budget controls.

Anthropic

Connect to Anthropic Claude models via the unified LLM gateway with failover and load balancing.

Google Gemini

Route traffic to Google Gemini models through the AgentGateway multi-provider backend.

AWS Bedrock

Integrate with AWS Bedrock for managed LLM access via the AgentGateway routing layer.

Azure OpenAI

Route requests to Azure-hosted OpenAI models through the unified gateway API.

Ollama

Connect to locally hosted Ollama models for self-hosted inference routing.

vLLM

Route to vLLM inference servers with GPU utilization-aware routing for optimal performance.

OpenTelemetry

Export metrics, traces, and logs to any OpenTelemetry-compatible observability backend.

Kubernetes Gateway API

Deploy and configure AgentGateway on Kubernetes using the standard Gateway API for dynamic configuration.

GitHubOrganization

🔗

LLM Backend Schema

JSONSchema

🔗

MCP Target Schema

Sources

opencollection: 1.0.0
info:
  name: AgentGateway Admin / Debug API
  version: 1.0.0
items:
- info:
    name: Config
    type: folder
  items:
  - info:
      name: Dump the runtime configuration
      type: http
    http:
      method: GET
      url: http://127.0.0.1:15000/config_dump
    docs: Dump the runtime configuration
- info:
    name: Debug
    type: folder
  items:
  - info:
      name: Stream a JSON-over-SSE trace of the next request
      type: http
    http:
      method: GET
      url: http://127.0.0.1:15000/debug/trace
    docs: Stream a JSON-over-SSE trace of the next request
  - info:
      name: Inspect the live tokio task tree
      type: http
    http:
      method: GET
      url: http://127.0.0.1:15000/debug/tasks
    docs: Inspect the live tokio task tree
- info:
    name: Logging
    type: folder
  items:
  - info:
      name: Get the current logging level
      type: http
    http:
      method: GET
      url: http://127.0.0.1:15000/logging
    docs: Get the current logging level
  - info:
      name: Set the logging level at runtime
      type: http
    http:
      method: POST
      url: http://127.0.0.1:15000/logging
    docs: Set the logging level at runtime
- info:
    name: Memory
    type: folder
  items:
  - info:
      name: Allocator and process memory statistics
      type: http
    http:
      method: GET
      url: http://127.0.0.1:15000/memory
    docs: Allocator and process memory statistics
- info:
    name: Profiling
    type: folder
  items:
  - info:
      name: CPU profile via pprof
      type: http
    http:
      method: GET
      url: http://127.0.0.1:15000/debug/pprof/profile
      params:
      - name: seconds
        value: ''
        type: query
    docs: CPU profile via pprof
  - info:
      name: Heap profile via pprof
      type: http
    http:
      method: GET
      url: http://127.0.0.1:15000/debug/pprof/heap
    docs: Heap profile via pprof
- info:
    name: Lifecycle
    type: folder
  items:
  - info:
      name: Initiate graceful shutdown
      type: http
    http:
      method: POST
      url: http://127.0.0.1:15000/quitquitquit
    docs: Initiate graceful shutdown
bundled: true

AgentGateway

APIs

AgentGateway

Collections

AgentGateway Admin / Debug API

Pricing Plans

Agentgateway Plans Pricing

Rate Limits

Agentgateway Rate Limits

FinOps

Agentgateway Finops

Features

Use Cases

Integrations

Semantic Vocabularies

Agentgateway Context

JSON Structure

Agentgateway Llm Backend Structure

Agentgateway Mcp Target Structure

Agentgateway Route Structure

Example Payloads

Agentgateway Llm Backend Example

Agentgateway Mcp Target Example

Agentgateway Route Example

Resources

Sources