uptrain  by uptrain-ai

Open-source platform to evaluate and improve GenAI apps

created 2 years ago
2,304 stars

Top 20.2% on sourcepulse

GitHubView on GitHub
Project Summary

UpTrain is an open-source platform designed to evaluate and enhance Generative AI applications. It offers over 20 pre-configured checks for language, code, and embeddings, performs root cause analysis on failures, and provides actionable insights for improvement, targeting developers and researchers working with LLMs.

How It Works

UpTrain utilizes an LLM-as-a-judge approach for evaluations, allowing customization of evaluation methods, few-shot examples, and scenario descriptions. It supports various LLM providers (OpenAI, Anthropic, Mistral, Azure, Anyscale) and embedding models, running analyses locally for data privacy. Root cause analysis helps pinpoint issues within the LLM pipeline based on negative feedback or low evaluation scores.

Quick Start & Requirements

  • Dashboard: Clone the repository and run bash run_uptrain.sh. Requires Docker.
  • Package: pip install uptrain.
  • Evaluations: Requires an OpenAI API key for model grading checks.
  • Resources: Local dashboard runs on your machine; no code required for dashboard use.
  • Documentation: How to evaluate your LLM application

Highlighted Details

  • Supports 20+ pre-configured evaluations including factual accuracy, response completeness, and prompt injection detection.
  • Local execution ensures data privacy, with only LLM calls leaving the secure environment.
  • Offers programmatic integration via a Python package and a local, code-free dashboard interface.
  • Enables customization of evaluation methods, few-shot examples, and custom evaluator creation.

Maintenance & Community

  • Active development with a roadmap for features like team collaboration and visualization.
  • Community support available via a Slack channel.
  • Direct contact with maintainers can be booked.

Licensing & Compatibility

  • Published under the Apache 2.0 license.
  • Compatible with commercial use and closed-source linking.

Limitations & Caveats

The UpTrain Dashboard is currently in Beta. Future features include embedding visualization, pattern recognition, and prompt improvement suggestions.

Health Check
Last commit

11 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
45 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.