Discover and explore top open-source AI tools and projects—updated daily.
itsOwenWeb scraper for AI-powered data extraction
Top 23.0% on SourcePulse
CyberScraper 2077 is an AI-powered web scraping tool designed for extracting data from the internet, including .onion sites via Tor. It targets data analysts, netrunners, and researchers, offering intelligent parsing, multi-format exports, and a user-friendly Streamlit interface.
How It Works
This tool leverages Large Language Models (LLMs) from OpenAI, Gemini, and local Ollama deployments to understand and parse web content intelligently. It employs asynchronous operations for speed and includes features like caching (content-based and query-based LRU) to reduce redundant API calls. Stealth mode parameters and current browser instance usage aim to bypass bot detection.
Quick Start & Requirements
pip install -r requirements.txt, and playwright install.pip install ollama and model download).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
6 days ago
1 day
hyperbrowserai
adbar
apify
ScrapeGraphAI