Discover and explore top open-source AI tools and projects—updated daily.
CyberMagicianInteractive 3D RAG memory visualizer
Top 98.7% on SourcePulse
Project Golem provides a novel 3D interface for visualizing Retrieval-Augmented Generation (RAG) memory structures in real-time. It is designed for AI researchers and developers who need to understand and debug the semantic relationships within their RAG systems, offering a visual representation of how concepts are associated and queried, thereby enhancing interpretability.
How It Works
The project visualizes high-dimensional embeddings (768d) by projecting them down to a 3D interactive space using UMAP for dimensionality reduction. It leverages Google's embedding-gemma-300m for vectorization and LanceDB or local NumPy for vector storage and fast retrieval. The frontend, built with Three.js and WebGL, renders this "cortex," dynamically highlighting specific neural pathways related to user queries to enable visual debugging of concept association.
Quick Start & Requirements
pip install -r requirements.txtpython ingest.py to scrape Wikipedia, vectorize data, and generate golem_cortex.json and golem_vectors.npy. This step requires a GPU for reasonable performance.python GolemServer.py to start the backend server.http://localhost:8000.TARGETS in ingest.py to point to custom datasets (PDFs, Obsidian vaults). Integration with external vector DBs like Qdrant/Pinecone is possible by fetching vectors, applying UMAP, and modifying server.py.Highlighted Details
embedding-gemma-300m via sentence-transformers.Maintenance & Community
No specific details on maintainers, community channels (like Discord/Slack), sponsorships, or roadmap were found in the provided README.
Licensing & Compatibility
The README does not specify a software license. This lack of information requires clarification for adoption, particularly concerning commercial use or integration into closed-source projects.
Limitations & Caveats
Project Golem is described as an "experiment." The ingestion process requires a GPU for acceptable speed. The absence of a specified license is a significant caveat for potential adopters.
1 month ago
Inactive
enjalot