Predibase Deployments API
Create, read, update, and delete dedicated and private serverless deployments, selecting a base model and GPU accelerator (A10, A100) and enabling LoRA serving for fine-tuned adapters.
Documentation
Documentation
https://docs.predibase.com/user-guide/inference/dedicated_deployments
Documentation
https://docs.predibase.com/user-guide/inference/private_deployments