langkit  by whylabs

Open-source toolkit for monitoring LLMs

Created 2 years ago
947 stars

Top 38.7% on SourcePulse

GitHubView on GitHub
Project Summary

LangKit is an open-source toolkit designed for monitoring Large Language Models (LLMs) by extracting key signals from prompts and responses. It targets ML engineers and researchers working with LLMs in production, providing observability into text quality, relevance, security, and sentiment to mitigate risks associated with unpredictable model behavior.

How It Works

LangKit integrates seamlessly with the whylogs data logging library, offering User-Defined Functions (UDFs) that automatically enhance text feature logging. It employs a modular approach, allowing users to select specific metric categories like text quality (readability, complexity), relevance (similarity to themes), security (jailbreaks, prompt injection, hallucinations, refusals), and sentiment/toxicity. This design facilitates granular control over observability and simplifies the integration of LLM-specific metrics into existing ML observability pipelines.

Quick Start & Requirements

Highlighted Details

  • Offers metrics for text quality, relevance, security (jailbreaks, prompt injection, hallucinations, refusals), and sentiment/toxicity.
  • Benchmarks show significantly higher throughput for "LLM metrics" and "All metrics" on GPU instances (g4dn.xlarge) compared to CPU (c5.xlarge).
  • Designed for integration with the whylogs observability library.

Maintenance & Community

  • Developed by WhyLabs.
  • Community links are not explicitly provided in the README.

Licensing & Compatibility

  • License: Apache 2.0.
  • Compatible with commercial use and closed-source linking.

Limitations & Caveats

The README indicates a substantial performance drop when enabling "All metrics" on CPU instances, suggesting a strong dependency on GPU acceleration for comprehensive monitoring. Throughput for "All metrics" on a c5.xlarge instance is as low as 0.28 chats/sec.

Health Check
Last Commit

10 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
10 stars in the last 30 days

Explore Similar Projects

Starred by Han Wang Han Wang(Cofounder of Mintlify), John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), and
6 more.

evidently by evidentlyai

0.3%
7k
Open-source framework for ML/LLM observability
Created 4 years ago
Updated 13 hours ago
Starred by Gregor Zunic Gregor Zunic(Cofounder of Browser Use), Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), and
14 more.

openllmetry by traceloop

0.4%
6k
Open-source observability SDK for LLM applications
Created 2 years ago
Updated 13 hours ago
Starred by Luis Capelo Luis Capelo(Cofounder of Lightning AI), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
6 more.

opik by comet-ml

1.7%
14k
Open-source LLM evaluation framework for RAG, agents, and more
Created 2 years ago
Updated 12 hours ago
Feedback? Help us improve.