Research framework for autonomous web search and report drafting
Top 33.5% on sourcepulse
WebThinker is a framework that empowers Large Reasoning Models (LRMs) to conduct deep web research autonomously, enabling them to search, explore web pages, and draft research reports within their thinking process. It targets researchers and users needing in-depth, automated information gathering and report generation, offering an end-to-end solution that integrates knowledge acquisition directly into the LRM's reasoning.
How It Works
WebThinker utilizes a "Think-Search-and-Draft" strategy, allowing LRMs to interact with the web. A Deep Web Explorer enables models to perform searches, navigate pages by clicking elements, and extract information. The framework supports autonomous follow-up searches and deeper link traversal. For report generation, LRMs are equipped with tools for drafting, checking, and editing report sections, ensuring coherence and adaptability. RL-based training strategies are being developed to optimize performance using preference pairs from complex tasks.
Quick Start & Requirements
conda create -n webthinker python=3.9
), activate it (conda activate webthinker
), and install requirements (pip install -r requirements.txt
).cd demo; streamlit run_demo.py
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is actively developing RL-based training strategies, suggesting ongoing research and potential for future improvements or changes. Specific model serving configurations (vLLM) and API keys (Bing) are required for operation.
4 days ago
1 day