Discover and explore top open-source AI tools and projects—updated daily.
facebookresearchSelf-evolving search agents without training data
Top 64.5% on SourcePulse
Dr. Zero introduces a framework for self-evolving search agents that operate without requiring pre-existing training data. It targets researchers and engineers developing AI agents, offering a method to bootstrap complex reasoning and search capabilities through an automated, data-free curriculum, thereby matching or surpassing supervised approaches with reduced computational overhead.
How It Works
The core approach involves an iterative self-evolution loop between two agents: a proposer and a solver. The proposer generates diverse, increasingly challenging yet solvable questions, while the solver, initialized from a base model (e.g., Qwen, Llama), learns to answer these questions using a search tool. This process establishes an automated curriculum. To enhance training efficiency, hop-grouped relative policy optimization (HRPO) is employed, clustering structurally similar questions to minimize sampling overhead and reduce compute requirements.
Quick Start & Requirements
python scripts/download.py --save_path ./corpuscat $save_path/part_* > $save_path/e5_Flat.indexgzip -d $save_path/wiki-18.jsonl.gzpython process_train.py --local_dir ./datapython process_test.py --local_dir ./dataHighlighted Details
Maintenance & Community
No specific details regarding active contributors, community channels (e.g., Discord, Slack), sponsorships, or a public roadmap were found in the provided README.
Licensing & Compatibility
The code is released under a non-commercial license. This restriction means the project is not suitable for integration into commercial products or services.
Limitations & Caveats
The primary limitation is the non-commercial license, which restricts its use to research and non-profit applications. Setting up the required local search engine, including downloading and indexing a large corpus, represents a significant initial effort.
5 days ago
Inactive
CharlesQ9
grapeot