Discover and explore top open-source AI tools and projects—updated daily.
JesseovoAI Agent for deep Chinese web research
Top 73.7% on SourcePulse
This project provides an AI Agent skill for deep research into Chinese internet platforms, automatically gathering and analyzing content from the last 30 days across eight major platforms. It aims to deliver up-to-date, well-cited research reports rapidly, significantly reducing the need for API keys and manual data collection for users and AI agents focused on the Chinese digital landscape.
How It Works
The core innovation is the integration of the MediaCrawler engine, leveraging Playwright for browser automation. This approach drastically minimizes API key dependencies, allowing 7 out of 8 supported platforms to function without specific keys. It employs a three-tier data acquisition strategy: prioritizing configured APIs, falling back to the Playwright-based crawler, and finally using public interfaces as a last resort, ensuring maximum data availability.
Quick Start & Requirements
pip install jieba playwright. Install Playwright browser binaries with playwright install chromium.~/.config/last30days-cn/.env for enhanced stability or WeChat access.Highlighted Details
Maintenance & Community
The project is maintained by Jesse (@Jesseovo). No specific community channels (like Discord/Slack) or sponsorship details are mentioned in the README.
Licensing & Compatibility
This project is released under the MIT License, which is permissive for commercial use and integration into closed-source projects. However, the project's usage terms strictly prohibit commercial use of the scraped data or the tool itself for commercial data services, emphasizing its use for learning and research.
Limitations & Caveats
Users are explicitly warned that the project is for educational and research purposes only, and commercial use is forbidden. Strict adherence to Chinese laws, platform Terms of Service, and robots.txt is mandatory. Platform interfaces are subject to change, potentially impacting functionality, and users are advised to manage request frequency to avoid account bans. WeChat functionality primarily relies on API keys.
1 month ago
Inactive
sentient-agi