Discover and explore top open-source AI tools and projects—updated daily.
chunkhoundDeep code and file research engine for AI assistants
Top 95.8% on SourcePulse
This project addresses the challenge of making codebases deeply searchable for AI assistants by transforming them into knowledge bases. It targets engineers, researchers, and power users who need to understand complex code relationships and discover features semantically. ChunkHound offers a local-first, privacy-preserving solution that enhances AI-assisted code research and development.
How It Works
ChunkHound leverages the research-backed cAST (Chunking via Abstract Syntax Trees) algorithm for semantic code chunking, preserving code meaning through structure-aware parsing. It employs Multi-Hop Semantic Search to uncover interconnected code relationships beyond simple keyword matches, enabling natural language queries like "find authentication code" to discover related components. The system operates on a local-first architecture, ensuring code privacy and enabling offline use with local models. It supports structured parsing for 29 languages via Tree-sitter and custom parsers, providing consistent semantic understanding across diverse codebases.
Quick Start & Requirements
uv package manager. Install uv via curl -LsSf https://astral.sh/uv/install.sh | sh, then install ChunkHound with uv tool install chunkhound..chunkhound.json configuration file (e.g., specifying embedding provider and API key). Index your codebase using chunkhound index.chunkhound.github.io.Highlighted Details
Maintenance & Community
No specific details regarding maintainers, community channels (like Discord/Slack), or roadmap were provided in the README excerpt.
Licensing & Compatibility
Limitations & Caveats
Advanced semantic search capabilities require configuration with external API keys or a local Ollama setup. The README details complex exclusion and workspace overlay configurations that may require careful tuning. While benchmarks are cited, real-world performance may vary based on codebase size and complexity.
1 day ago
Inactive
yetone