Open source data onboarding platform for businesses using CSV files
Top 41.3% on sourcepulse
YoBulk is an open-source, AI-driven data onboarding platform designed to streamline CSV data import for businesses. It offers a no-code solution for creating import buttons, smart column matching, custom validation rules, and a review interface, aiming to be a free alternative to commercial solutions like Flatfile.com.
How It Works
YoBulk is a full-stack Next.js application utilizing MongoDB for data storage. It processes CSV files, offering features like smart auto-matching between CSV columns and template columns, custom validation rules (including regex), and streaming capabilities for large files up to 1GB. Its AI integration, powered by OpenAI, provides auto-suggestions for error correction and aims to build a knowledge graph for data mapping decisions.
Quick Start & Requirements
git clone https://github.com/yobulkdev/yobulkdev.git && cd yobulkdev && docker-compose up -d
(Requires OpenAI API key for AI features).docker run --rm -it -p 5050:5050/tcp --env="OPENAI_SECRET_KEY=****" yobulk/yobulk
(Requires local MongoDB instance).git clone https://github.com/yobulkdev/yobulkdev && cd yobulkdev && yarn install && yarn run dev
(Requires local MongoDB instance and OpenAI API key in .env
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is actively developing, with features like custom LLM models and data mapping knowledge graphs listed as "Coming Soon." The README explicitly states it does not claim to outperform Flatfile.com in functionality or design at present.
2 years ago
1 day