Library for generating/evaluating synthetic tabular data for privacy, fairness, augmentation
Top 56.9% on sourcepulse
Synthcity is a comprehensive Python library for generating and evaluating synthetic tabular, time-series, survival, and image data. It targets researchers and practitioners needing to augment datasets, ensure privacy, or test fairness, offering a wide array of state-of-the-art generative models and robust evaluation metrics.
How It Works
Synthcity employs a plugin-based architecture, allowing easy integration of diverse generative models including GANs (AdsGAN, CTGAN, PATEGAN), VAEs (TVAE, RTVAE), Normalizing Flows, Bayesian Networks, and LLM-based models (GReaT). It supports specialized generators for time-series and survival data, alongside privacy-focused methods like DECAF and DP-GAN. The library also provides a rich suite of evaluation metrics for data quality, privacy, and fairness.
Quick Start & Requirements
pip install synthcity
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 month ago
1 week