mav  by attentionmech

LLM internal activity visualizer

Created 5 months ago
519 stars

Top 60.6% on SourcePulse

GitHubView on GitHub
Project Summary

MAV (Model Activity Visualiser) provides a real-time, interactive visualization of Large Language Model (LLM) internal states during text generation. It is designed for researchers and developers seeking to understand and debug LLM behavior, offering insights into attention mechanisms, token predictions, and generated text.

How It Works

MAV leverages the Hugging Face transformers library to load and run various LLM architectures. It visualizes key internal states such as attention entropy and top token predictions, alongside the generated text itself. The tool is built with a plugin architecture, allowing for custom visualizations and integration into training loops, enhancing interpretability.

Quick Start & Requirements

  • Install: uv run --with openmav mav --model gpt2 --prompt "hello mello" or pip install openmav.
  • Prerequisites: Python, Hugging Face transformers compatible models (e.g., gpt2, meta-llama/Llama-3.2-1B). For gated models, huggingface-cli login is required.
  • Documentation: documentation.md

Highlighted Details

  • Real-time visualization of LLM internal states.
  • Supports various Hugging Face models and sampling parameters.
  • Plugin architecture for custom visualizations and training loop integration.
  • Command-line interface for easy experimentation.

Maintenance & Community

The project is maintained by "attentionmech". Further community or contribution details are not explicitly provided in the README.

Licensing & Compatibility

The project appears to be licensed under the MIT License, allowing for broad use and modification, including commercial applications.

Limitations & Caveats

The README mentions a citation with a future year (2025), suggesting the project may be in early development or pre-publication. Specific performance benchmarks or detailed compatibility matrices are not provided.

Health Check
Last Commit

5 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
3 stars in the last 30 days

Explore Similar Projects

Starred by Jiayi Pan Jiayi Pan(Author of SWE-Gym; MTS at xAI), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
3 more.

unified-io-2 by allenai

0.3%
626
Unified-IO 2 code for training, inference, and demo
Created 1 year ago
Updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Chris Van Pelt Chris Van Pelt(Cofounder of Weights & Biases), and
3 more.

lida by microsoft

0.1%
3k
Library for LLM-driven data visualization and infographic generation
Created 2 years ago
Updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Pietro Schirano Pietro Schirano(Founder of MagicPath), and
1 more.

SillyTavern by SillyTavern

1.1%
18k
LLM frontend for power users
Created 2 years ago
Updated 1 day ago
Feedback? Help us improve.