OpenAttack by thunlp

Text attack toolkit for evaluating & improving NLP model robustness

Created 5 years ago

764 stars

Top 45.7% on SourcePulse

Project Summary

OpenAttack is a comprehensive Python toolkit designed for generating and evaluating textual adversarial attacks against NLP models. It caters to researchers and practitioners aiming to assess model robustness, develop new attack strategies, or implement adversarial training. The package streamlines the entire adversarial attack pipeline, from text preprocessing to victim model interaction and result evaluation.

How It Works

OpenAttack employs a modular architecture, separating concerns into TextProcessor, Victim, Attacker, AttackAssist, Metric, AttackEval, and DataManager. This design facilitates extensibility, allowing users to easily integrate custom datasets, victim models, or attack algorithms. It supports various attack types (sentence, word, character level; gradient, score, decision, blind) and offers parallel processing for improved efficiency. The toolkit is tightly integrated with Hugging Face's Transformers and Datasets libraries, simplifying the use of pre-trained models and datasets.

Quick Start & Requirements

Install via pip: pip install OpenAttack
Clone repo and install: git clone https://github.com/thunlp/OpenAttack.git && cd OpenAttack && python setup.py install
Requires Python.
Demo available: python demo.py
Examples and documentation: README

Highlighted Details

Supports 15 attack models, covering sentence, word, and character-level perturbations.
Offers multilinguality (English and Chinese) with an extensible design for more languages.
Fully compatible with 🤗 Hugging Face Transformers and Datasets.
Includes built-in victim models (e.g., BERT, RoBERTa) and supports custom victim models and datasets.

Maintenance & Community

Developed by THUNLP.
Contributions are welcomed.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project's license is not clearly stated in the README, which may pose a barrier for commercial adoption or use in closed-source projects.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

1

Star History

6 stars in the last 30 days

Explore Similar Projects

Awesome-LVLM-Attack by liudaizong

Curated list of attacks on large vision-language models (LVLMs)

Created 1 year ago

Updated 1 week ago

Visual-Adversarial-Examples-Jailbreak-Large-Language-Models by Unispac

Visual adversarial examples bypass LLM safety alignments

Created 2 years ago

Updated 1 year ago

awesome-trustworthy-deep-learning by MinghuiChen43

Curated list of trustworthy deep learning papers

Created 5 years ago

Updated 1 month ago

langtest by Pacific-AI-Corp

NLP testing SDK for model safety and effectiveness

Created 3 years ago

Updated 2 weeks ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI),

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory), and

2 more.

universal-triggers by Eric-Wallace

NLP attack/analysis research paper (EMNLP 2019)

Created 6 years ago

Updated 1 year ago

Starred by

Dan Hendrycks

Dan Hendrycks(Author of MMLU; Executive Director at Center for AI Safety).

nanoGCG by GraySwanAI

PyTorch implementation of the Greedy Coordinate Gradient (GCG) algorithm

Created 1 year ago

Updated 8 months ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

robustbench by RobustBench

Standardized benchmark for adversarial robustness research

Created 5 years ago

Updated 9 months ago

offensive-ai-compilation by jiep

Curated list of Offensive AI resources

Created 2 years ago

Updated 6 days ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

TAADpapers by thunlp

Curated list of must-read papers on textual adversarial attack and defense

Created 6 years ago

Updated 7 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory), and

7 more.

TextAttack by QData

Python framework for NLP adversarial attacks, data augmentation, and model training

Created 6 years ago

Updated 6 months ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

6 more.

llm-attacks by llm-attacks

Attack framework for aligned LLMs, based on a research paper

Created 2 years ago

Updated 1 year ago

Starred by

Lilian Weng

Lilian Weng(Cofounder of Thinking Machines Lab),

Binyuan Hui

Binyuan Hui(Research Scientist at Alibaba Qwen), and

10 more.

cleverhans by cleverhans-lab

Adversarial example library for benchmarking ML model robustness

Created 9 years ago

Updated 1 year ago

Feedback? Help us improve.