UpTrain
UpTrain is an open-source (Apache-2.0) unified platform to evaluate and improve generative AI and LLM applications. It ships a Python framework plus a managed evaluation API that grades responses against 20+ preconfigured checks - context relevance, factual accuracy, response completeness, hallucination, tonality, prompt injection and more - and performs root cause analysis on failure cases.
APIs
UpTrain Evaluations API
Runs evaluations (POST /evaluate) on supplied LLM input/output/context rows against a list of named checks such as context_relevance, factual_accuracy, response_completeness and...
UpTrain Log and Evaluate API
Logs evaluation data under a named project and evaluates it in one call (POST /log_and_evaluate), persisting results so they appear on the managed UpTrain dashboard with real-ti...
UpTrain Root Cause Analysis API
Performs root cause analysis (POST /perform_root_cause_analysis) on failing RAG or LLM responses, classifying why a response was poor - e.g. incomplete context, poor retrieval, ...
UpTrain Runs and Datasets API
Manages evaluation datasets, checksets (reusable bundles of checks), and runs that pair a dataset with a checkset - create a run (POST /run), poll its status (GET /run/{run_id})...