Library for LLM-driven data visualization and infographic generation
Top 15.8% on sourcepulse
LIDA is a Python library designed for the automatic generation of data visualizations and infographics using large language models (LLMs). It targets data scientists, researchers, and developers who need to quickly create, edit, explain, and evaluate visualizations from data, offering a grammar-agnostic approach compatible with various visualization libraries and LLM providers.
How It Works
LIDA treats visualizations as code, enabling programmatic generation, execution, editing, explanation, and repair. It first summarizes a given dataset, then generates potential visualization goals based on the summary (optionally with a persona), and finally creates visualization code (e.g., for Matplotlib, Seaborn, Altair) that can be executed. This approach allows for iterative refinement and analysis of visualizations through natural language commands.
Quick Start & Requirements
pip install -U lida
export OPENAI_API_KEY=<your key>
).pip install lida[infographics]
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
LIDA currently works best with datasets having a small number of columns (<= 10) due to LLM context limitations. It assumes datasets are preprocessed and suitable for loading into pandas DataFrames. Smaller LLMs may have limited instruction-following capabilities, with LIDA performing best with larger models like GPT-3.5/4. The infographic generation is experimental.
11 months ago
Inactive