Discover and explore top open-source AI tools and projects—updated daily.
glommerLocal code indexing and search for AI coding agents
New!
Top 97.6% on SourcePulse
A local, self-contained code indexing library and MCP server for AI coding agents, Codemogger addresses the need for efficient code understanding. It parses source files using tree-sitter, semantically chunks them into logical units, and stores these along with local embeddings in a single SQLite database. This enables AI agents to perform fast, precise keyword searches and nuanced semantic queries without relying on external servers or API keys, streamlining codebase navigation and comprehension.
How It Works
The system scans codebases, respecting .gitignore, and leverages tree-sitter (WASM) to generate Abstract Syntax Trees (ASTs) for semantic chunking of definitions like functions, structs, and classes. These chunks are then encoded using a local embedding model (defaulting to all-MiniLM-L6-v2) and stored in an embedded SQLite database. This database integrates Full-Text Search (FTS) for keyword matching and vector search for semantic similarity. Incremental indexing efficiently updates the database by re-processing only modified files based on SHA-256 hashes.
Quick Start & Requirements
npm install -g codemogger) or via npx.codemogger index ./my-project and search using codemogger search "query". It can be integrated as an MCP server via a JSON configuration.Highlighted Details
ripgrep and yields precise definitions. Semantic search excels at finding relevant code via natural language queries, outperforming keyword-based tools when exact terms are unknown.Maintenance & Community
The provided README does not contain specific details regarding maintainers, community channels (e.g., Discord, Slack), sponsorships, or a public roadmap.
Licensing & Compatibility
Limitations & Caveats
The README does not detail specific limitations, alpha status, or known bugs. Performance benchmarks are based on an Apple M2 (8GB) and may vary across different hardware configurations. The tool's effectiveness is dependent on the quality of tree-sitter grammars for supported languages.
4 days ago
Inactive