ml_privacy_meter by privacytrustlab

Privacy auditing library for assessing data privacy risks in ML models

Created 5 years ago

686 stars

Top 49.5% on SourcePulse

Project Summary

Privacy Meter is an open-source library designed to audit data privacy in statistical and machine learning algorithms, targeting researchers and practitioners in sensitive domains like healthcare and finance. It provides quantitative assessments of privacy risks using state-of-the-art membership inference attacks, helping organizations comply with data protection regulations like GDPR.

How It Works

Privacy Meter employs a configuration-driven approach using YAML files to specify models, datasets, and privacy games. It supports multiple auditing methodologies, including membership inference, range membership inference, and dataset usage cardinality inference, to detect information leakage through training points, vicinity of training points, and dataset usage percentages. The library also allows auditing differential privacy (DP) lower bounds.

Quick Start & Requirements

Install via pip install -r requirements.txt or conda env create -f env.yaml.
Supports various datasets (CIFAR10, AG News, etc.) and models (CNN, MLP, GPT-2, etc.).
Integration with HuggingFace datasets and transformers is supported with custom file creation.
Custom training scripts can be integrated, with an example of a fast training library achieving high accuracy quickly.
For auditing pre-trained models, a specific directory structure and models_metadata.json file are required.
Official documentation and sample configurations are available.

Highlighted Details

Audits a wide range of ML algorithms including classification, regression, computer vision, and NLP.
Implements advanced auditing strategies beyond basic membership inference.
Integrates a fast training library achieving state-of-the-art training speed and accuracy.
Audit results include detailed attack outcomes, ROC curves, and timing logs.

Maintenance & Community

Developed at NUS Data Privacy and Trustworthy Machine Learning Lab.
Welcomes community contributions.
Discussion channel available via Slack.
Key research papers underpinning the library are cited.

Licensing & Compatibility

The README does not explicitly state the license.

Limitations & Caveats

The library's license is not specified in the README, which may impact commercial use or closed-source integration.

Health Check

Last Commit

7 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

5 stars in the last 30 days

Explore Similar Projects

Awesome-ML-SP-Papers by gnipping

Curated list of ML security/privacy papers from top security conferences

Created 3 years ago

Updated 2 weeks ago

Starred by

Jonathan Ragan-Kelley

Jonathan Ragan-Kelley(Professor at MIT),

Dan Guido

Dan Guido(Cofounder of Trail of Bits), and

1 more.

openpcc by openpcc

Provably private AI inference framework

Created 3 weeks ago

Updated 4 days ago

awesome-trustworthy-deep-learning by MinghuiChen43

Curated list of trustworthy deep learning papers

Created 5 years ago

Updated 2 days ago

membership-inference-machine-learning-literature by HongshengHu

Curated list of papers on ML membership inference attacks/defenses

Created 4 years ago

Updated 3 weeks ago

llm-sp by chawins

LLM security/privacy resources: papers, tools, datasets, blogs

Created 2 years ago

Updated 5 months ago

PPML-Resource by Ye-D

Privacy-preserving ML resources

Created 6 years ago

Updated 2 months ago

Awesome-Privacy by Guyanqi

Privacy research paper collection

Created 6 years ago

Updated 1 year ago

DecodingTrust by AI-secure

Research paper for comprehensive GPT model trustworthiness assessment

Created 2 years ago

Updated 1 year ago

awesome-privacy-chinese by international-explore

Privacy compliance resource for China

Created 3 years ago

Updated 10 months ago

awesome-ml-privacy-attacks by stratosphereips

Curated list of research papers on ML privacy attacks

Created 5 years ago

Updated 1 year ago

Awesome-LM-SSP by CryptoAILab

Curated list for large model trustworthiness

Created 1 year ago

Updated 5 days ago

Starred by

Lei Zhang

Lei Zhang(Director Engineering AI at AMD),

Shishir Patil

Shishir Patil(Author of BFCL, Gorilla), and

17 more.

awesome-production-machine-learning by EthicalML

Curated list of open-source libraries for production ML

Created 7 years ago

Updated 3 days ago

Feedback? Help us improve.