Predibase Fine-Tuning API
Create and manage supervised and reinforcement (GRPO) fine-tuning jobs that train efficient LoRA / Turbo LoRA adapters on top of open-source base models, returning adapter versions for serving.
Documentation
Documentation
https://docs.predibase.com/user-guide/fine-tuning/overview
Documentation
https://docs.predibase.com/user-guide/fine-tuning/grpo