Lamini · Pricing Plans

Lamini Plans Pricing

Name: Lamini Plans Pricing
Creator: Lamini
Keywords: AI, LLM, Fine-Tuning, Memory Tuning, Inference, Plans

Lamini offers a self-service, pay-as-you-go tier (Lamini On-Demand) for running inference and tuning jobs on its hosted GPU cluster, plus a custom Enterprise tier for dedicated, reserved, or on-premises deployments. On-Demand bills a flat per-token inference rate and a per-tuning-step rate; new accounts receive free credits to start. Enterprise pricing is negotiated with sales.

Lamini Plans Pricing is the machine-readable pricing-plan profile for Lamini on the APIs.io network, conforming to the API Commons Plans specification.

It defines 2 plans, covering usage and enterprise tiers, with named plans including Lamini On-Demand, Enterprise.

Tagged areas include AI, LLM, Fine-Tuning, Memory Tuning, and Inference.

2 Plans API Commons Plans

View Source

AILLMFine-TuningMemory TuningInferencePlans

Plans

Lamini On-Demand usage

Self-service, pay-as-you-go inference and tuning on Lamini's hosted GPU cluster, metered per token and per tuning step, with free starting credit.

Inference Tokens (tokens · month) $0.50 per 1M tokens (input, output, and JSON output) USD

Tuning Step (steps · month) ~$1 per tuning step on 1 GPU (linear multiplier for burst/multi-GPU) USD

Free Credit (usd · lifetime) $300 in free credit for new accounts USD

Inference Completions
Memory Tuning
Fine-Tuning
Classify
Embeddings

Enterprise enterprise

Dedicated, reserved-capacity, VPC, or on-premises deployments with volume commitments, security and compliance controls, and dedicated support. Contact Lamini sales.

Enterprise Agreement (contract · year) contact sales USD

Reserved / Dedicated GPU Capacity
On-Premises Deployment
Volume Pricing
Dedicated Support

Lamini Plans Pricing

Plans

Sources