pygraphistry  by graphistry

GPU-accelerated Python library for visual graph analysis and AI

Created 10 years ago
2,421 stars

Top 18.7% on SourcePulse

GitHubView on GitHub
Project Summary

PyGraphistry is an open-source Python library designed for data scientists and developers to visualize, analyze, and scale large graph datasets. It offers GPU-accelerated processing, dataframe-native data ingestion, and a novel graph query language (GFQL), enabling rapid exploration and AI-driven insights from complex relationships.

How It Works

The library processes graph data directly from common Python dataframes (Pandas, Spark, RAPIDS cuDF) and leverages Apache Arrow for efficient columnar operations. It integrates optional, significant speedups (100X+) via RAPIDS for GPU acceleration. A key innovation is GFQL, a dataframe-native query language for relationship analysis without traditional databases, complemented by streamlined graph ML/AI workflows.

Quick Start & Requirements

  • Installation: Core functionality: pip install graphistry. With AI features: pip install graphistry[ai].
  • Prerequisites: Optional GPU acceleration requires RAPIDS and/or a GPU-ready Graphistry server. Python 3.x is assumed.
  • Resources: Links to interactive demos, visualization quickstarts, and comprehensive documentation are available.

Highlighted Details

  • Performance: Achieves 100X+ speedups with optional RAPIDS-based GPU acceleration.
  • GFQL: Introduces a dataframe-native graph query language for complex relationship analysis.
  • AI/ML Integration: Streamlined methods for clustering, UMAP embeddings, GNNs, and automatic feature engineering.
  • Connectors: Supports integration with various data platforms, SQL databases, graph databases, and Python libraries.
  • LLM Assistance: graphistry-skills package enhances AI coding assistants for PyGraphistry tasks.

Maintenance & Community

The project encourages community involvement via a Slack channel and provides contribution guidelines. Support is available through GitHub Issues, with dedicated enterprise support via Graphistry ZenDesk. Updates are shared on Twitter and LinkedIn.

Licensing & Compatibility

The provided README does not explicitly state the software license. This omission requires clarification for assessing commercial use or closed-source linking compatibility.

Limitations & Caveats

The README focuses on features and does not detail known limitations, alpha status, or specific bugs. The primary adoption consideration is the dependency on optional GPU hardware or a Graphistry server for optimal performance, and the unstated licensing terms.

Health Check
Last Commit

18 hours ago

Responsiveness

Inactive

Pull Requests (30d)
16
Issues (30d)
2
Star History
13 stars in the last 30 days

Explore Similar Projects

Starred by Tobi Lutke Tobi Lutke(Cofounder of Shopify), Dominik Moritz Dominik Moritz(Research Scientist at Apple; Professor at CMU), and
8 more.

pygwalker by Kanaries

0.1%
16k
Interactive UI for Pandas dataframes in Jupyter
Created 3 years ago
Updated 1 week ago
Feedback? Help us improve.