freeproxy  by CharlesPikachu

Python SDK for dynamic free proxy collection and management

Created 7 years ago
271 stars

Top 95.2% on SourcePulse

GitHubView on GitHub
Project Summary

FreeProxy is a Python library designed to automatically discover, collect, and manage a vast pool of free proxy servers from over 20 diverse internet sources. It targets web scraping, data analysis, and any application requiring robust proxy rotation. The project offers a continuously updated, high-quality proxy list, saving users significant effort in manual proxy acquisition and maintenance.

How It Works

The core of FreeProxy involves modular ProxiedSession classes, each responsible for scraping proxies from a specific source. It supports HTTP, HTTPS, SOCKS4, and SOCKS5 protocols, with detailed capabilities listed per source. Users can apply custom filtering rules based on country code, anonymity level, protocol, and latency. A key feature is the ProxiedSessionClient, which acts as a drop-in replacement for requests.Session, managing an internal proxy pool that automatically replenishes by scraping as needed, simplifying proxy integration into applications.

Quick Start & Requirements

Installation is straightforward via pip: pip install pyfreeproxy. Some proxy sources require Playwright, which handles its own browser driver downloads. The project is actively maintained, with recent updates in late 2025 and early 2026.

Highlighted Details

  • Supports scraping from over 20 distinct proxy sources.
  • Offers granular filtering by country, anonymity, protocol, and network latency.
  • Provides a ProxiedSessionClient for seamless integration, mimicking requests.Session with automatic proxy pool management and replenishment.
  • Recent updates indicate ongoing development and feature additions.

Maintenance & Community

The project shows active maintenance with releases in late 2025 and early 2026. The author maintains a WeChat official account ("Charles的皮卡丘") for updates and engagement. Links to documentation and a live proxy list are provided.

Licensing & Compatibility

The repository's README does not explicitly state a software license. This omission requires clarification for commercial use or integration into proprietary projects, as the default copyright applies.

Limitations & Caveats

Reliance on free, third-party proxy sources means availability and quality can fluctuate unpredictably. Some scraping methods depend on Playwright, introducing a heavier dependency and potential browser driver management. The effectiveness of latency-based filtering (max_tcp_ms, max_http_ms) may significantly slow down the initial proxy scraping process.

Health Check
Last Commit

2 hours ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
35 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.