LLM system for automated knowledge curation and article generation
Top 1.4% on sourcepulse
STORM is an LLM-powered system for automated knowledge curation, designed to research a topic and generate comprehensive, citation-backed reports. It targets researchers, content creators, and anyone needing to synthesize information, offering a significant head start in the pre-writing phase of article generation. The system's novelty lies in its multi-perspective question-asking and simulated conversation approaches to improve research depth and breadth.
How It Works
STORM breaks down article generation into research and writing stages. It employs "Perspective-Guided Question Asking" by analyzing similar topics to inform its queries and "Simulated Conversation" where an LLM acts as both a writer and an expert, grounded in retrieved sources, to refine understanding and ask follow-up questions. Co-STORM enhances this with a collaborative protocol involving a moderator, question-answering agents, and human users, maintaining a shared mind map for conceptual clarity.
Quick Start & Requirements
pip install knowledge-storm
secrets.toml
. Example scripts are provided for STORM and Co-STORM.Highlighted Details
litellm
integration.Maintenance & Community
The project is actively developed, with recent updates including litellm
integration and Co-STORM release. Contributions are welcomed via issues and pull requests. Contact persons are Yijia Shao and Yucheng Jiang.
Licensing & Compatibility
The FreshWiki dataset is licensed under CC BY-SA. The code's license is not explicitly stated in the README, but it is a research preview from Stanford. Commercial use compatibility requires careful review of the underlying model licenses and any explicit project licensing.
Limitations & Caveats
The system produces articles that are helpful in a pre-writing stage but may require significant edits to be publication-ready. The README mentions specific branches for replicating paper results, indicating potential differences between the main branch and historical experimental setups.
1 month ago
Inactive