Discover and explore top open-source AI tools and projects—updated daily.
Framework for large-scale reinforcement learning of search agents
Top 71.5% on SourcePulse
ASearcher is an open-source framework for large-scale online reinforcement learning (RL) training of search agents, aiming to achieve expert-level Search Intelligence. It targets developers and researchers seeking to build high-performance, cost-effective search agents, offering released model weights, training methodologies, and data synthesis pipelines.
How It Works
ASearcher employs a fully asynchronous agentic RL approach, decoupling trajectory collection from model training to eliminate GPU idle time and enable efficient long-horizon RL. It also features a novel prompt-based LLM agent for autonomously generating diverse and challenging QA pairs, enhancing training data quality and complexity. This asynchronous design allows for extended tool calls and token generation per trajectory, leading to more robust agent behavior.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Primary contributors are from the RL Lab at Ant Research and Tsinghua University. Acknowledged assistance from AWorld team and Super Computing Technology (SCT) team at Ant Group. Inspired by Search-o1, Search-R1, and WebAgent.
Licensing & Compatibility
The project is released under a permissive license, allowing for commercial use and integration into closed-source projects.
Limitations & Caveats
Details for fine-tuning a QwQ-32B agent are marked as "coming soon." The single-node training for a 7B model is noted as potentially slow.
18 hours ago
Inactive