TextAttack by QData

Python framework for NLP adversarial attacks, data augmentation, and model training

Created 6 years ago

3,341 stars

Top 14.4% on SourcePulse

View on GitHub

9 Experts Love This Project

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Yaowei Zheng

Author of LLaMA-Factory

Andrew Kane

Author of pgvector

Lewis Tunstall

Research Engineer at Hugging Face

and 5 more!

Project Summary

TextAttack is a comprehensive Python framework designed for researchers and practitioners in Natural Language Processing (NLP) to generate adversarial examples, augment datasets, and train NLP models. It provides a unified interface for understanding, developing, and benchmarking various adversarial attack methods against NLP models, enhancing model robustness and interpretability.

How It Works

TextAttack modularizes adversarial attacks into four key components: Goal Functions (defining attack success), Constraints (validating perturbations), Transformations (generating modifications), and Search Methods (navigating the perturbation space). This design allows for the assembly of existing attacks from literature and the creation of novel ones by combining these components, enabling model-agnostic analysis of any NLP model that can process string inputs.

Quick Start & Requirements

Install via pip: pip install textattack
Requires Python 3.6+. A CUDA-compatible GPU is optional but recommended for speed.
Documentation: https://textattack.readthedocs.io/en/master/

Highlighted Details

Supports 16+ adversarial attack recipes from academic literature.
Model-agnostic, compatible with models from any deep learning framework.
Includes built-in support for Hugging Face Transformers models and datasets.
Offers command-line and Python interfaces for attacks, data augmentation, and model training.

Maintenance & Community

Active development, currently in an "alpha" stage.
Join the TextAttack Slack channel for updates and help.
Contribution guidelines are available in CONTRIBUTING.md.

Licensing & Compatibility

The project does not explicitly state a license in the README. This requires further investigation for commercial use or closed-source linking.

Limitations & Caveats

The project is in an "alpha" stage, indicating potential for ongoing changes and instability.
The README notes that direct comparison of attack recipes without constant constraint spaces can be misleading, and emphasizes the need for careful consideration of adversarial example quality, particularly regarding semantic preservation and grammaticality.

Health Check

Last Commit

6 months ago

Responsiveness

1 week

Pull Requests (30d)

Issues (30d)

Star History

24 stars in the last 30 days