pandas-ai  by sinaptik-ai

Python SDK for conversational data analysis using LLMs and RAG

Created 2 years ago
22,100 stars

Top 1.9% on SourcePulse

GitHubView on GitHub
Project Summary

PandasAI is a Python library designed to make data analysis conversational, enabling users to query databases, CSVs, and datalakes using natural language. It targets both technical and non-technical users, aiming to simplify data interaction and accelerate analysis workflows through Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG).

How It Works

PandasAI integrates with LLMs to interpret natural language queries and translate them into executable code (e.g., Python/Pandas or SQL). It supports RAG for enhanced context and accuracy, allowing it to query various data sources. The library can also generate visualizations based on user prompts, and it supports querying across multiple DataFrames. A key feature is its Docker sandbox for secure code execution.

Quick Start & Requirements

Highlighted Details

  • Supports querying SQL databases, CSVs, and Parquet files.
  • Enables natural language querying across multiple DataFrames.
  • Offers a Docker sandbox for secure code execution.
  • Can generate data visualizations based on natural language prompts.

Maintenance & Community

  • Beta Notice: Release v3 is currently in beta.
  • Community: Discord server available for discussions.
  • Contributing: Guidelines are provided for contributions.

Licensing & Compatibility

  • License: MIT Expat license for the core library. The pandasai/ee directory has a separate license.
  • Commercial Use: Compatibility for commercial use is implied by the MIT license for the core library, but specific terms for enterprise offerings should be confirmed.

Limitations & Caveats

The project is currently in beta (v3), meaning features and functionality are in progress and subject to change. The default LLM (BambooLLM) requires an API key, and while other LLMs can be configured, this is not explicitly detailed in the README.

Health Check
Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
11
Star History
293 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.