neuronpedia by hijohnnylin

Open-source interpretability platform for neural networks

Created 2 years ago

510 stars

Top 61.1% on SourcePulse

Project Summary

Neuronpedia is an open-source platform for interpreting neural network features, targeting researchers and developers. It provides tools for exploring activations, steering models, and automatically generating explanations, enabling deeper understanding of AI behavior.

How It Works

Neuronpedia employs a microservices architecture, separating the webapp, database, inference, and auto-interpretation services. This modular design allows for independent development and extensibility, enabling users to swap components or run services individually. It leverages OpenAPI schemas for typed communication between services and generates clients for seamless integration.

Quick Start & Requirements

Instant Deploy: Deploy a custom instance via Vercel (requires a free Vercel account).
Local Demo:
- Install Docker.
- Run make init-env, make webapp-demo-build, make webapp-demo-run.
- Access at localhost:3000. Connects to public demo data (GPT-2 small, Gemma-2-2b) and inference servers.
Local Development: Requires Node.js. Run make install-nodejs, make webapp-localhost-install, make webapp-localhost-dev.
Inference Server: Requires Poetry. Build with make inference-localhost-build-gpu (CUDA) or make inference-localhost-build (no CUDA). Run with make inference-localhost-dev-gpu or make inference-localhost-dev, specifying MODEL_SOURCESET.
Auto-Interp Server: Requires Poetry. Build with make autointerp-localhost-build-gpu (CUDA) or make autointerp-localhost-build (no CUDA). Run with make autointerp-localhost-dev-gpu or make autointerp-localhost-dev.
Hardware: At least 16GB RAM recommended. CUDA is required for the embedding scorer in the auto-interp server.

Highlighted Details

Supports multiple models including GPT-2, Gemma, and DeepSeek.
Integrates with SAELens and SAEDashboard for custom data generation and visualization.
Features a search explanation capability requiring an OpenAI API key for semantic similarity.
Monorepo structure with clear separation of concerns for webapp, inference, and auto-interp services.

Maintenance & Community

Community support via Slack (#neuronpedia).
Contact: johnny@neuronpedia.org.
Contributions welcome via CONTRIBUTING.md.

Licensing & Compatibility

License not explicitly stated in the provided README text.

Limitations & Caveats

The local demo database is read-only; a local database setup is required for writing data.
The auto-interp server's embedding scorer requires CUDA and does not support Mac MPS or CPU.
Some sections like "high volume autointerp explanations" and "generate your own dashboards/data" are marked as "under construction."
Data import into the local admin panel is finicky and lacks resume functionality.

Health Check

Last Commit

3 days ago

Responsiveness

Inactive

Pull Requests (30d)

2

Issues (30d)

4

Star History

44 stars in the last 30 days

Explore Similar Projects

awesome-open-source-ai by suncloudsmoon

Curated list of open-source AI resources

Created 11 months ago

Updated 10 months ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

interpretability-literature by amarasovic

Curated list of interpretability research papers

Created 6 years ago

Updated 6 years ago

OpenXAI by AI4LIFE-GROUP

Library for transparent evaluation of AI model explanations

Created 3 years ago

Updated 1 year ago

xplique by deel-ai

Explainability toolbox for neural networks

Created 5 years ago

Updated 2 days ago

interpret-text by interpretml

SDK for explaining text-based ML models with a visualization dashboard

Created 6 years ago

Updated 1 year ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Wing Lian

Wing Lian(Founder of Axolotl AI), and

2 more.

ThoughtSource by OpenBioLink

Framework for chain-of-thought reasoning data and tools

Created 3 years ago

Updated 11 months ago

Quantus by understandable-machine-intelligence-lab

XAI toolkit for evaluating neural network explanations

Created 4 years ago

Updated 4 months ago

auto-prompt by AIDotNet

AI prompt engineering platform

Created 5 months ago

Updated 3 weeks ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Chaoyu Yang

Chaoyu Yang(Founder of Bento), and

1 more.

OmniXAI by salesforce

Python library for explainable AI (XAI)

Created 3 years ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow), and

5 more.

lit by PAIR-code

Interactive ML model analysis tool for understanding model behavior

Created 5 years ago

Updated 4 days ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind),

Maxime Labonne

Maxime Labonne(Head of Post-Training at Liquid AI), and

9 more.

argilla by argilla-io

Collaboration tool for building high-quality AI datasets

Created 4 years ago

Updated 6 days ago

Starred by

Didier Lopes

Didier Lopes(Founder of OpenBB),

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind), and

3 more.

open-deep-research by nickscamara

Open-source AI agent for web research

Created 10 months ago

Updated 6 months ago

Feedback? Help us improve.