pixeltable by pixeltable

AI data infrastructure for multimodal apps using declarative, incremental approach

Created 2 years ago

1,581 stars

Top 26.1% on SourcePulse

View on GitHub

3 Experts Love This Project

Travis Fischer

Founder of Agentic

Eric Zhu

Coauthor of AutoGen; Research Scientist at Microsoft Research

Wes McKinney

Author of Pandas

Project Summary

Pixeltable provides a declarative data infrastructure for multimodal AI applications, addressing the complexity of stitching together disparate tools for data ingestion, transformation, indexing, and orchestration. It targets AI engineers and researchers building production-ready multimodal applications, offering a unified framework to simplify data plumbing and accelerate development.

How It Works

Pixeltable operates as a database, storing metadata and computed results persistently. Users define data processing and AI workflows declaratively using computed columns on tables. The engine automatically handles data ingestion (referencing files in place), transformation via Python UDFs or built-in operations, AI model integration for inference, and vector index creation for semantic search. Its core advantage lies in incremental computation, ensuring only necessary recomputations occur when data or code changes, alongside automatic versioning and lineage tracking.

Quick Start & Requirements

Install via pip: pip install pixeltable
Requires Python 3.8+
Supports Linux, macOS, and Windows.
See Installation and Quick Start.

Highlighted Details

Unified multimodal interface for images, video, audio, and documents.
Declarative computed columns for automatic processing and AI model integration.
Built-in vector search and similarity indexing.
Supports Python UDFs and agentic workflows with LLM tool calling.
Persistent storage with automatic versioning and lineage tracking.

Maintenance & Community

Active development with a public roadmap for cloud infrastructure and deployment.
Community support available via Discord.
Contributions are welcomed via their Contributing Guide.

Licensing & Compatibility

Licensed under the Apache 2.0 License.
Permissive license suitable for commercial use and integration into closed-source projects.

Limitations & Caveats

The project is actively under development, with a roadmap indicating future cloud features. While it supports various AI integrations, specific model compatibility or performance tuning for niche use cases may require custom UDFs.

Health Check

Last Commit

23 hours ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

37 stars in the last 30 days