neosync  by nucleuscloud

Data security platform for anonymizing PII and syncing data across environments

created 1 year ago
3,897 stars

Top 12.8% on sourcepulse

GitHubView on GitHub
Project Summary

Neosync is an open-source platform designed to help developers safely test code against production data by anonymizing Personally Identifiable Information (PII) and orchestrating synthetic data synchronization across environments. It addresses the challenges of reproducing production bugs locally, ensuring data quality in lower environments, and meeting compliance requirements like GDPR and HIPAA.

How It Works

Neosync employs an asynchronous, event-sourcing model for robust job management, including automatic retries and playback. It supports generating synthetic data based on schemas, anonymizing existing production data, and subsetting databases using SQL queries while maintaining referential integrity. Configurations are declarative and GitOps-friendly, allowing integration into CI pipelines for automated database hydration.

Quick Start & Requirements

  • Install/Run: Clone the repository and run make compose/up using docker compose.
  • Prerequisites: Docker installed and running.
  • Setup: Minimal setup time, primarily dependent on Docker installation.
  • Resources: Docs, Discord.

Highlighted Details

  • Supports custom data transformers via JavaScript or LLMs.
  • Offers pre-built integrations with Postgres, MySQL, and S3.
  • Handles referential integrity automatically for data synchronization.
  • Declarative, GitOps-based configurations for CI integration.

Maintenance & Community

The project is actively maintained with community contributions encouraged via Discord and GitHub issues for feature requests and bug reports.

Licensing & Compatibility

Released under the MIT expat license, permitting commercial use and integration with closed-source projects.

Limitations & Caveats

The README does not detail specific limitations or known issues. The project appears to be in active development, and users should consult the roadmap and community channels for the latest status.

Health Check
Last commit

20 hours ago

Responsiveness

1 day

Pull Requests (30d)
22
Issues (30d)
4
Star History
65 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Daniel Han Daniel Han(Cofounder of Unsloth), and
1 more.

airweave by airweave-ai

0.6%
3k
Semantic MCP server for AI agents
created 7 months ago
updated 1 day ago
Feedback? Help us improve.