Discover and explore top open-source AI tools and projects—updated daily.
Ryandonofrio3Semantic code search for local development and AI agents
New!
Top 48.2% on SourcePulse
<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.>
osgrep provides semantic, natural-language search for codebases, functioning like grep but understanding concepts rather than just strings. It targets developers and power users seeking a fast, local, and private solution for code exploration, especially when integrating with AI coding agents. The primary benefit is enhanced code comprehension through intelligent, context-aware search capabilities.
How It Works
osgrep employs local transformer models via transformers.js to generate embeddings for code, enabling semantic search. It utilizes tree-sitter for smart chunking, splitting code by logical boundaries like functions and classes to capture complete concepts. A hybrid search approach combines vector search with keyword search using Reciprocal Rank Fusion (RRF) for improved accuracy. Adaptive throttling monitors system resources (CPU/RAM) to dynamically adjust indexing performance, ensuring it runs efficiently without overheating machines.
Quick Start & Requirements
npm install -g osgrep (or pnpm, bun).osgrep setup or automatically on first use.Highlighted Details
Maintenance & Community
The project acknowledges mgrep by MixedBread as a foundational influence, with significant rewrites for local-only operation. No specific contributors, sponsorships, or community channels (e.g., Discord, Slack) are detailed in the provided README.
Licensing & Compatibility
Licensed under the Apache License 2.0. This license is generally permissive for commercial use and integration into closed-source projects.
Limitations & Caveats
The tool relies on the Node.js ecosystem. Initial embedding model downloads require approximately 150MB of storage. While adaptive throttling is implemented, performance may vary based on the complexity of the codebase and the user's hardware.
2 days ago
Inactive
BloopAI