Triton Plans Pricing
NVIDIA Triton Inference Server is open-source software (BSD-3-Clause) that customers self-host on their own CPUs / GPUs. There is no per-API call price from NVIDIA for the server itself; commercial entitlements (support, indemnification, cloud images) ship via the separate NVIDIA AI Enterprise subscription.
Triton Plans Pricing is the machine-readable pricing-plan profile for Triton Inference Server on the APIs.io network, conforming to the API Commons Plans specification.
It defines 2 plans, covering freemium and enterprise tiers, with named plans including Open Source, NVIDIA AI Enterprise (Optional Support).
Tagged areas include AI, Inference, Open Source, and Model Serving.
Plans
Self-hosted, freely redistributable Triton Inference Server. Cost to the user is compute (CPU / GPU) and operational overhead, not a license fee.
- BSD-3-Clause License
- Self-Hosted
- Community Support (GitHub Discussions)
- KServe V2 Inference Protocol
Optional commercial support, security patching, and indemnification for Triton delivered through the NVIDIA AI Enterprise subscription. Pricing is tied to the AI Enterprise product, not Triton calls.
- Enterprise Support
- Security Patching
- Indemnification