trulens  by truera

Evaluation and tracking tool for LLM experiments and AI agents

Created 4 years ago
2,785 stars

Top 17.1% on SourcePulse

GitHubView on GitHub
Project Summary

TruLens provides systematic evaluation and tracking for Large Language Model (LLM) applications and AI agents, enabling developers to understand and improve performance. It targets developers building LLM-powered applications, offering fine-grained, stack-agnostic instrumentation and comprehensive evaluations to identify failure modes.

How It Works

TruLens instruments LLM applications to log prompts, models, retrievers, and knowledge sources. It allows users to define custom feedback functions and evaluations that run alongside the application, facilitating systematic iteration and comparison of different app versions through a user interface.

Quick Start & Requirements

  • Primary install: pip install trulens
  • Prerequisites: Python. No specific hardware or GPU requirements are mentioned for basic installation.
  • Links: Contributing Guide, Discourse Community

Highlighted Details

  • Stack-agnostic instrumentation for LLM applications.
  • Supports evaluation of RAG (Retrieval-Augmented Generation) systems.
  • Enables definition of custom feedback functions and evaluations.
  • Provides a user interface for comparing app versions.

Maintenance & Community

The project encourages community contributions and provides a Discourse forum for discussion. A GitHub star is requested as a form of support.

Licensing & Compatibility

The README does not explicitly state the license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The README focuses on core functionality and does not detail limitations, unsupported platforms, or potential caveats regarding stability or advanced features.

Health Check
Last Commit

1 day ago

Responsiveness

1 week

Pull Requests (30d)
49
Issues (30d)
5
Star History
75 stars in the last 30 days

Explore Similar Projects

Starred by Luis Capelo Luis Capelo(Cofounder of Lightning AI), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
6 more.

opik by comet-ml

1.7%
14k
Open-source LLM evaluation framework for RAG, agents, and more
Created 2 years ago
Updated 12 hours ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Yiran Wu Yiran Wu(Coauthor of AutoGen), and
12 more.

openai-agents-python by openai

1.6%
15k
Python SDK for multi-agent workflows
Created 6 months ago
Updated 18 hours ago
Feedback? Help us improve.