Private data assistant for conversational data manipulation
Top 75.3% on sourcepulse
DataDM offers a private, conversational interface for data analysis, enabling users to load, clean, transform, and visualize data without writing code. It targets users who prioritize data privacy and seek an intuitive, AI-powered data manipulation tool.
How It Works
DataDM leverages a persistent Jupyter kernel backend for executing data manipulation code generated by a large language model (LLM). Users interact via natural language, and the LLM translates these requests into Python code executed within the kernel. This approach allows for complex data operations and visualizations through a conversational interface while maintaining a stateful session for iterative analysis.
Quick Start & Requirements
docker run -p 7860:7860 -it ghcr.io/approximatelabs/datadm:latest
(for OpenAI models) or docker run --gpus all -p 7860:7860 -it ghcr.io/approximatelabs/datadm:latest-cuda
(for local StarChat models).pip install datadm
or pip install "datadm[cuda]"
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
LLMs are prone to hallucination, requiring users to verify generated results. GGML-based CPU-only mode and rollback functionality are listed as "Work in Progress." Support for additional data sources like SQL and S3 is planned but not yet implemented.
10 months ago
Inactive