trulens  by truera

Evaluation and tracking tool for LLM experiments and AI agents

Created 5 years ago
2,884 stars

Top 16.5% on SourcePulse

GitHubView on GitHub
Project Summary

TruLens provides systematic evaluation and tracking for Large Language Model (LLM) applications and AI agents, enabling developers to understand and improve performance. It targets developers building LLM-powered applications, offering fine-grained, stack-agnostic instrumentation and comprehensive evaluations to identify failure modes.

How It Works

TruLens instruments LLM applications to log prompts, models, retrievers, and knowledge sources. It allows users to define custom feedback functions and evaluations that run alongside the application, facilitating systematic iteration and comparison of different app versions through a user interface.

Quick Start & Requirements

  • Primary install: pip install trulens
  • Prerequisites: Python. No specific hardware or GPU requirements are mentioned for basic installation.
  • Links: Contributing Guide, Discourse Community

Highlighted Details

  • Stack-agnostic instrumentation for LLM applications.
  • Supports evaluation of RAG (Retrieval-Augmented Generation) systems.
  • Enables definition of custom feedback functions and evaluations.
  • Provides a user interface for comparing app versions.

Maintenance & Community

The project encourages community contributions and provides a Discourse forum for discussion. A GitHub star is requested as a form of support.

Licensing & Compatibility

The README does not explicitly state the license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The README focuses on core functionality and does not detail limitations, unsupported platforms, or potential caveats regarding stability or advanced features.

Health Check
Last Commit

4 days ago

Responsiveness

1 week

Pull Requests (30d)
23
Issues (30d)
3
Star History
73 stars in the last 30 days

Explore Similar Projects

Starred by Luis Capelo Luis Capelo(Cofounder of Lightning AI), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
6 more.

opik by comet-ml

1.2%
15k
Open-source LLM evaluation framework for RAG, agents, and more
Created 2 years ago
Updated 4 hours ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Yiran Wu Yiran Wu(Coauthor of AutoGen), and
13 more.

openai-agents-python by openai

1.0%
17k
Python SDK for multi-agent workflows
Created 7 months ago
Updated 5 hours ago
Feedback? Help us improve.