vLLM · Pricing Plans

Vllm Plans Pricing

vLLM is free open-source software (Apache 2.0). The project does not sell hosting. Cost is incurred entirely on your own GPU infrastructure (cloud or on-prem). Several third parties (RunPod, Modal, Anyscale, Baseten, etc.) offer managed vLLM hosting, billed by them — not by the vLLM project.

Vllm Plans Pricing is the machine-readable pricing-plan profile for vLLM on the APIs.io network, conforming to the API Commons Plans specification.

It defines 1 plan, covering free tiers, with named plans including Self-Hosted (Apache 2.0).

Tagged areas include LLM, Inference, Open Source, GPU, and OpenAI Compatible.

1 Plans API Commons Plans
View Source
LLMInferenceOpen SourceGPUOpenAI CompatibleSelf-HostedPlans

Plans

Self-Hosted (Apache 2.0) free

Run vLLM on your own GPU infrastructure under Apache 2.0.

Self-Host (deployment · lifetime) 0 USD

Sources