Lamini · Pricing Plans

Lamini Plans Pricing

Lamini offers a self-service, pay-as-you-go tier (Lamini On-Demand) for running inference and tuning jobs on its hosted GPU cluster, plus a custom Enterprise tier for dedicated, reserved, or on-premises deployments. On-Demand bills a flat per-token inference rate and a per-tuning-step rate; new accounts receive free credits to start. Enterprise pricing is negotiated with sales.

Lamini Plans Pricing is the machine-readable pricing-plan profile for Lamini on the APIs.io network, conforming to the API Commons Plans specification.

It defines 2 plans, covering usage and enterprise tiers, with named plans including Lamini On-Demand, Enterprise.

Tagged areas include AI, LLM, Fine-Tuning, Memory Tuning, and Inference.

2 Plans API Commons Plans
View Source
AILLMFine-TuningMemory TuningInferencePlans

Plans

Lamini On-Demand usage

Self-service, pay-as-you-go inference and tuning on Lamini's hosted GPU cluster, metered per token and per tuning step, with free starting credit.

Inference Tokens (tokens · month) $0.50 per 1M tokens (input, output, and JSON output) USD
Tuning Step (steps · month) ~$1 per tuning step on 1 GPU (linear multiplier for burst/multi-GPU) USD
Free Credit (usd · lifetime) $300 in free credit for new accounts USD
Enterprise enterprise

Dedicated, reserved-capacity, VPC, or on-premises deployments with volume commitments, security and compliance controls, and dedicated support. Contact Lamini sales.

Enterprise Agreement (contract · year) contact sales USD

Sources