ai-ublock-blacklist  by alvi-se

Filtering AI content farms

Created 1 year ago
465 stars

Top 65.2% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a curated uBlock Origin filter list specifically targeting AI-generated content farms. It aims to help users avoid low-quality, ad-filled, and potentially misleading websites, ensuring search results prioritize human-generated content. The benefit is a cleaner, more reliable browsing experience for users who prefer human-authored information.

How It Works

The project maintains a manual blacklist of AI-generated websites identified through browsing. The core approach is to block content farms that offer no unique value and are primarily driven by advertising or SEO, distinguishing itself from lists that block all AI-related search results. The curator manually identifies sites based on patterns like generic introductions, lack of sources, excessive referral links, and AI-like writing styles, aiming for precision over broad automation.

Quick Start & Requirements

  • Install: Subscribe directly via the uBlock Origin extension using this link: https://raw.githubusercontent.com/alvi-se/ai-ublock-blacklist/master/list.txt. Alternatively, import the URL as a 3rd-party list within uBlock Origin settings.
  • Prerequisites: uBlock Origin browser extension.
  • Contribution: Suspected AI content farms can be reported via GitHub Issues, or users can submit entries directly via Pull Requests by adding domain or specific path entries to the list.txt file using uBlock Origin's filter syntax (e.g., ||example.com^$doc).

Highlighted Details

  • Manual Curation: Emphasizes human judgment to identify and block low-value AI content farms, avoiding the accidental blocking of legitimate AI tools.
  • Detection Criteria: Utilizes a detailed set of heuristics, including article structure, sourcing, monetization tactics, content quality, and publication patterns, to recognize AI-generated sites.
  • Targeted Blocking: Focuses exclusively on content farms, differentiating from broader AI-result-blocking lists.
  • Effectiveness: Claims effectiveness even with a small list size due to the prevalence of SEO-optimized content farms.

Maintenance & Community

The project relies on community contributions via GitHub Issues for reporting and Pull Requests for adding new entries to the blacklist. There are no formal community channels or dedicated maintainers mentioned.

Licensing & Compatibility

  • License: The repository README does not specify a software license.
  • Compatibility: Designed for use with the uBlock Origin browser extension. Users should be aware of potential implications due to the lack of a specified license.

Limitations & Caveats

The list is manually curated and may exhibit bias, particularly towards Italian websites due to the author's location. The effectiveness relies on ongoing manual effort, and the list's size is currently limited. AI content detection is inherently challenging, and the criteria used are guidelines rather than strict rules. No software license is provided, which may impact commercial use or redistribution.

Health Check
Last Commit

4 days ago

Responsiveness

Inactive

Pull Requests (30d)
11
Issues (30d)
5
Star History
460 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.