Triton Inference Server · Pricing Plans

Triton Plans Pricing

NVIDIA Triton Inference Server is open-source software (BSD-3-Clause) that customers self-host on their own CPUs / GPUs. There is no per-API call price from NVIDIA for the server itself; commercial entitlements (support, indemnification, cloud images) ship via the separate NVIDIA AI Enterprise subscription.

Triton Plans Pricing is the machine-readable pricing-plan profile for Triton Inference Server on the APIs.io network, conforming to the API Commons Plans specification.

It defines 2 plans, covering freemium and enterprise tiers, with named plans including Open Source, NVIDIA AI Enterprise (Optional Support).

Tagged areas include AI, Inference, Open Source, and Model Serving.

2 Plans API Commons Plans
View Source
AIInferenceOpen SourceModel Serving

Plans

Open Source freemium

Self-hosted, freely redistributable Triton Inference Server. Cost to the user is compute (CPU / GPU) and operational overhead, not a license fee.

Software License (month · month) 0.00 USD
NVIDIA AI Enterprise (Optional Support) enterprise

Optional commercial support, security patching, and indemnification for Triton delivered through the NVIDIA AI Enterprise subscription. Pricing is tied to the AI Enterprise product, not Triton calls.

Support Subscription (month · month) see NVIDIA AI Enterprise pricing USD

Sources