AI-powered web crawler for wide information gathering
Top 6.9% on sourcepulse
Wiseflow is an AI-powered information extraction tool designed to help users sift through vast amounts of data from diverse sources to find what's relevant. It targets users who need to monitor industries, gather background information, or collect customer intelligence without the need for specific, deep-dive queries. The primary benefit is saving time and filtering out noise by focusing on "wide search" scenarios.
How It Works
Wiseflow employs a "crawl-and-search integration" strategy, departing from traditional filter-extractor pipelines. Instead of treating each page as a unit, it segments HTML content into "main text blocks" and "external link blocks." Different LLM extraction strategies are applied to each block type to optimize token usage and relevance. Main text blocks are summarized based on user-defined focus points, while external link blocks are analyzed to intelligently decide which links warrant further exploration, eliminating the need for manual configuration of crawl depth or quantity.
Quick Start & Requirements
install_pocketbase
script (Linux/macOS: chmod +x install_pocketbase && ./install_pocketbase
; Windows: install_pocketbase.ps1
), configure core/.env
, create a virtual environment (conda create -n wiseflow python=3.12 && conda activate wiseflow
), install dependencies (cd wiseflow/core && pip install -r requirements.txt
), install Playwright (python -m playwright install --with-deps chromium
), and run (chmod +x run.sh && ./run.sh
or python windows_run.py
)..env
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The online service has limitations for non-mainland China users and does not support WeChat official accounts. For these scenarios, self-deployment of the open-source version is recommended. The 4.x plan aims to introduce an "insight module" for analyzing "hidden information" within fetched data.
3 days ago
1 day