Discover and explore top open-source AI tools and projects—updated daily.
DCI-AgentRethinking retrieval for agentic search via direct corpus interaction
Top 90.7% on SourcePulse
Summary
DCI-Agent-Lite introduces a Direct Corpus Interaction (DCI) paradigm for agentic search and deep research on personal knowledge bases. It targets users needing to query private data without external services or pre-built indices. The benefit is a simplified, high-resolution retrieval system treating the corpus as an open research environment, yielding improved accuracy.
How It Works
DCI agents search raw files directly using terminal tools (rg, find, sed), bypassing semantic retrievers and index builds. This "zero-index retrieval" enables immediate operation, fine-grained control, and free search primitive composition. Built on Pi with bash tools and lightweight context management, it balances minimal complexity with robust long-horizon research.
Quick Start & Requirements
bash setup.sh (Unix/macOS). Manual setup requires uv, ripgrep, npm, Python deps, API keys (.env), and cloning pi-mono (https://github.com/jdf-prog/pi-mono.git).uv, ripgrep, npm, Python, LLM API keys (OpenAI/Anthropic). Specific models (e.g., gpt-5.4-nano) are needed for benchmark performance.assets/docs/setup.md, Hugging Face datasets.uv run dci-agent-lite --terminal --provider openai --model gpt-5.4-nano --cwd "corpus/wiki_corpus" --extra-arg="--thinking high"Highlighted Details
gpt-5.4-nano, outperforming other LLMs.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
gpt-5.4-nano).2 weeks ago
Inactive
mixedbread-ai
NVIDIA-AI-Blueprints