Research agent for discovering relevant GitHub repositories
Top 45.8% on sourcepulse
DeepGit is an advanced agentic workflow for deep GitHub repository research, targeting developers and researchers seeking to discover relevant, under-the-radar open-source tools. It leverages a hybrid retrieval system with ColBERT v2 embeddings and cross-encoder re-ranking, enhanced by hardware-awareness and activity analysis, to provide nuanced and contextually relevant recommendations.
How It Works
DeepGit employs a Langgraph-based agentic workflow. It begins with query expansion and hardware specification detection. A ColBERT-v2 retriever performs token-level semantic search, followed by a MiniLM-L6-v2 cross-encoder for passage-level re-ranking. A hardware-aware filter then discards incompatible repositories based on detected hardware constraints. Finally, community and code activity metrics are analyzed, and a multi-factor ranking is presented to the user. This approach enables fine-grained similarity matching and ensures discovered tools are practically usable on the user's hardware.
Quick Start & Requirements
pip install -r requirements.txt
after cloning the repository.python app.py
.langgraph dev
.Highlighted Details
Maintenance & Community
The project is open-source and actively developed. Links to community resources or roadmaps are not explicitly provided in the README.
Licensing & Compatibility
The README does not specify a license. Compatibility for commercial use or closed-source linking is not detailed.
Limitations & Caveats
The "Lite" version running on Hugging Face Spaces may not perform as well as the full version. The README does not specify the license, which could impact commercial adoption.
4 weeks ago
1 day