Web scraper for AI-powered data extraction
Top 25.2% on sourcepulse
CyberScraper 2077 is an AI-powered web scraping tool designed for extracting data from the internet, including .onion sites via Tor. It targets data analysts, netrunners, and researchers, offering intelligent parsing, multi-format exports, and a user-friendly Streamlit interface.
How It Works
This tool leverages Large Language Models (LLMs) from OpenAI, Gemini, and local Ollama deployments to understand and parse web content intelligently. It employs asynchronous operations for speed and includes features like caching (content-based and query-based LRU) to reduce redundant API calls. Stealth mode parameters and current browser instance usage aim to bypass bot detection.
Quick Start & Requirements
pip install -r requirements.txt
, and playwright install
.pip install ollama
and model download).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 month ago
1 day