BambooAI by pgalko

Python library for conversational data analysis using LLMs

Created 2 years ago

757 stars

Top 46.1% on SourcePulse

2 Experts Love This Project

ishaan-jaff

Cofounder of LiteLLM

AntonOsika

Cofounder of Lovable

Project Summary

BambooAI is a Python library designed for conversational data discovery and analysis, empowering users to interact with datasets using natural language. It targets data analysts, researchers, and power users seeking to streamline workflows and derive insights without extensive coding, offering both web UI and Jupyter notebook interfaces.

How It Works

BambooAI employs a multi-agent system that breaks down data analysis into six sequential steps: initiation, task routing, user feedback, dynamic prompt building, debugging and execution, and results presentation. It leverages LLMs for tasks like code generation, error correction, and planning, with options for internet search and knowledge base integration via vector databases.

Quick Start & Requirements

Install via pip: pip install bambooai
Configure environment variables (.env) and LLM settings (LLM_CONFIG.json).

Example usage:

import pandas as pd
from bambooai import BambooAI
df = pd.read_csv('titanic.csv')
bamboo = BambooAI(df=df, planning=True, vector_db=False, search_tool=True)
bamboo.pd_agent_converse()

Supports API-based models (OpenAI, Gemini, Anthropic, etc.) and local models (Ollama, VLLM).
Requires API keys for chosen LLM providers.

Highlighted Details

Supports custom data frame ontologies (RDF/OWL) for enhanced understanding.
Offers a planning agent for complex task decomposition.
Includes self-healing/error correction capabilities for generated code.
Provides a web application interface and Docker deployment option.

Maintenance & Community

Contributions are welcome via pull requests.
Development is ongoing with planned future improvements.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README.

Limitations & Caveats

The project is described as experimental.
API keys are required for most model providers, and their setup is crucial for functionality.
The README mentions that if no LLM configuration is provided, execution will fail.

Health Check

Last Commit

1 week ago

Responsiveness

1 week

Pull Requests (30d)

1

Issues (30d)

0

Star History

10 stars in the last 30 days

Explore Similar Projects

datasetGPT by radi-cho

CLI tool for generating textual/conversational datasets using LLMs

Created 2 years ago

Updated 2 years ago

DataHorse by DeDolphins

Data science tool for conversational data analysis using LLMs

Created 1 year ago

Updated 1 year ago

chatbi by chatbi

Chat interface for BI analysis using LLMs

Created 2 years ago

Updated 1 day ago

langchain-GLM_Agent by jayli

Agentic tool for local knowledge base QA using custom LLM

Created 2 years ago

Updated 2 years ago

Auto-Analyst by FireBird-Technologies

AI data science platform automating complex workflows

Created 11 months ago

Updated 3 months ago

Starred by

Rodrigo Nader

Rodrigo Nader(Cofounder of Langflow) and

Jesse Clark

Jesse Clark(Cofounder of Marqo).

griptape by griptape-ai

Python framework for AI agents and workflows

Created 3 years ago

Updated 2 days ago

Advanced-QA-and-RAG-Series by Farzad-R

LLM chatbots for Q\&A using agents and RAG

Created 1 year ago

Updated 2 weeks ago

airda by hitsz-ids

Multi-agent system for data analysis

Created 2 years ago

Updated 1 year ago

DeepBI by DeepInsight-AI

AI-native data analysis platform using LLMs

Created 2 years ago

Updated 9 months ago

BotSharp by SciSharp

.NET framework for AI agent application development

Created 8 years ago

Updated 2 days ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

2 more.

open_deep_research by langchain-ai

Open-source research assistant for automated deep research, generating comprehensive reports

Created 1 year ago

Updated 4 months ago

Starred by

Andrej Karpathy

Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n),

Anton Troynikov

Anton Troynikov(Cofounder of Chroma), and

47 more.

llama_index by run-llama

Data framework for building LLM-powered agents

Created 3 years ago

Updated 3 days ago

Feedback? Help us improve.