Discover and explore top open-source AI tools and projects—updated daily.
strohneAutomated web data retrieval and extraction
Top 58.3% on SourcePulse
Summary
Facepager is a tool for automated data retrieval from websites and APIs like YouTube, Twitter, and knowledge infrastructure sources. It simplifies complex data collection tasks, including multi-threading, rate limits, and pagination, benefiting researchers and power users by efficiently gathering and exporting public data.
How It Works
Facepager automates online data collection via APIs and web scraping, managing multi-threaded operations, rate limits, pagination, and data extraction. Data is stored in SQLite and exportable to CSV. It offers presets for various sources and allows custom pipelines for targeted data collection, including cloud services.
Quick Start & Requirements
.exe installer), macOS (.pkg, requires security adjustments), Linux (build from source per src/readme.md).Highlighted Details
Maintenance & Community
Help is available via the Facepager Usergroup on Facebook. Updates are announced on the Facepager Facebook Page.
Licensing & Compatibility
Distributed under the permissive MIT License, allowing commercial use, modification, and distribution with attribution.
Limitations & Caveats
Official Facebook, Twitter, and YouTube API support is limited; users must obtain and configure their own API keys. Database files may not be compatible across versions.
1 week ago
Inactive
datalevin
vespa-engine