Lamini
Lamini is an enterprise LLM platform for fine-tuning, tuning, and serving custom open models. Its Memory Tuning approach embeds factual recall into models to reduce hallucination, and the platform exposes a REST API for inference (completions), fine-tuning jobs, classification, and embeddings over open base models, deployable in Lamini's cloud, on-demand GPU cluster, or on-premises.
APIs
Lamini Inference Completions API
Generate text completions from open base or tuned models via POST /v1/completions, with structured (typed) output via output_type, plus streaming completions at /v3/streaming_co...
Lamini Fine-Tuning & Memory Tuning API
Submit and manage tuning jobs against open base models via POST /v1/train, supporting full fine-tuning and Lamini Memory Tuning (train_type), with job listing, status, cancel, a...
Lamini Classify API
Run LLM-based text classification against a trained classifier model via POST /v1/classifier/{model_id}/classification for scored labels, or /v1/classifier/{model_id}/prediction...
Lamini Embeddings API
Encode one or more text prompts into embedding vectors via POST /v1/embedding for similarity search, retrieval, and Memory RAG indexing workflows.