AutoViz  by AutoViML

Automatic data visualizer for any dataset

Created 6 years ago
1,843 stars

Top 23.4% on SourcePulse

GitHubView on GitHub
Project Summary

AutoViz is an open-source Python library designed for automated exploratory data analysis (EDA) and visualization. It enables users to generate insightful visualizations from any dataset, regardless of size, with a single line of code, making it accessible for both beginners and experts. The library also includes functionality for assessing and fixing data quality issues.

How It Works

AutoViz automates the process of creating multiple visualizations by analyzing data patterns, trends, and relationships. It supports various chart formats including static (PNG, SVG, JPG) via Matplotlib, interactive Bokeh charts, and server-based dashboards. For large datasets, it employs statistically valid sampling to maintain performance. The library also offers data quality assessment and correction capabilities through its FixDQ() function.

Quick Start & Requirements

  • Install via pip: pip install autoviz
  • For specific Python versions, use requirements.txt (Python < 3.10), requirements-py310.txt (Python 3.10), or requirements-py311.txt (Python 3.11+).
  • Usage example:
    from autoviz import AutoViz_Class
    AV = AutoViz_Class()
    AV.AutoViz(filename="your_file.csv", depVar="your_target_variable", chart_format="bokeh")
    
  • Official documentation and examples are available on GitHub.

Highlighted Details

  • Visualizes any dataset size using statistically valid sampling.
  • Supports static, interactive (Bokeh), and server-based chart formats.
  • Includes a FixDQ() function for data quality assessment and correction.
  • Customizable through arguments like chart_format, verbose, max_rows_analyzed, and max_cols_analyzed.

Maintenance & Community

The project is actively maintained by @AutoViML and collaborators. Contributions are welcomed via pull requests on GitHub.

Licensing & Compatibility

AutoViz is released under the Apache License, Version 2.0. This license permits commercial use and linking with closed-source projects.

Limitations & Caveats

The project is not an official Google project and is provided without warranty. Users are advised to select the appropriate requirements file based on their Python version for optimal compatibility.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
10 stars in the last 30 days

Explore Similar Projects

Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), and
9 more.

lilac by databricks

0.1%
1k
Data exploration tool for LLM dataset curation and quality control
Created 2 years ago
Updated 1 year ago
Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Dominik Moritz Dominik Moritz(Research Scientist at Apple; Professor at CMU), and
8 more.

pygwalker by Kanaries

0.1%
15k
Interactive UI for Pandas dataframes in Jupyter
Created 2 years ago
Updated 2 months ago
Feedback? Help us improve.