bytewax  by bytewax

Python framework for stateful stream processing

created 3 years ago
1,787 stars

Top 24.6% on sourcepulse

GitHubView on GitHub
Project Summary

Bytewax is a Python-first framework for stateful stream processing, designed to simplify complex event-driven applications and online machine learning. It targets Python developers seeking to leverage familiar tools for scalable, distributed dataflow pipelines, offering an alternative to Java-centric frameworks like Flink and Spark.

How It Works

Bytewax employs a dataflow computational model, allowing users to define pipelines using Python operators and connectors. It distinguishes itself with a Pythonic interface, enabling seamless integration with the Python ecosystem. The framework manages distributed state, provides fault tolerance, and supports event-time windowing for advanced analytics. A Rust-based engine underpins its performance, while the waxctl CLI tool facilitates deployment and management across various infrastructures, including Kubernetes.

Quick Start & Requirements

  • Install Bytewax via pip: pip install bytewax
  • Install waxctl for deployment management.
  • Requires Python.
  • Official documentation and examples are available.

Highlighted Details

  • Python-first API for leveraging existing libraries and tooling.
  • Stateful stream processing with automatic state recovery and fault tolerance.
  • Scalable from local development to multi-node, distributed deployments.
  • Rich connector ecosystem for various data sources and sinks.
  • Flexible dataflow API with stateless, stateful, windowing, and join operators.

Maintenance & Community

  • Active community on Slack for support and discussion.
  • Contributions are welcomed via GitHub issues and a contribution guide.
  • Follows a Code of Conduct.

Licensing & Compatibility

  • Licensed under the Apache-2.0 license.
  • Permissive license suitable for commercial use and integration with closed-source applications.

Limitations & Caveats

The README mentions a "commercially licensed Platform" for scaling, implying potential limitations or additional costs for advanced enterprise features beyond the open-source offering. Specific details on these commercial offerings are not elaborated within the provided text.

Health Check
Last commit

4 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
73 stars in the last 90 days

Explore Similar Projects

Starred by Adam Wolff Adam Wolff(Claude Code Core; MTS at Anthropic), Samuel Colvin Samuel Colvin(Author of Pydantic, Pydantic Logfire, PydanticAI), and
3 more.

anthropic-sdk-python by anthropics

0.7%
2k
Python SDK for Anthropic's REST API
created 2 years ago
updated 23 hours ago
Starred by Nat Friedman Nat Friedman(Former CEO of GitHub), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
7 more.

langflow by langflow-ai

2.9%
93k
Visual tool for AI agent and workflow creation/deployment
created 2 years ago
updated 1 day ago
Feedback? Help us improve.