risingwave  by risingwavelabs

Stream processing and serving for AI agents and real-time data applications

Created 4 years ago
7,710 stars

Top 6.7% on SourcePulse

GitHubView on GitHub
Project Summary

RisingWave is an event streaming platform designed for agentic AI and real-time applications. It addresses the complexity and latency of traditional event streaming stacks (like Debezium, Kafka, Flink, and a serving DB) by offering a unified system that ingests, processes, and serves data incrementally with sub-100ms freshness and low-latency query responses.

How It Works

RisingWave performs incremental computation, recomputing only affected results when upstream data changes, enabling always-up-to-date materialized views with sub-100ms end-to-end freshness. It unifies ingestion from diverse sources like webhooks, databases (via CDC), event streams (Kafka, Pulsar, Kinesis), and historical storage under a SQL interface. Processed data is served from an internal row store at 10-20ms p99 latency, while long-term retention and analytical access are managed via native integration with Apache Iceberg™, including automated table maintenance. This architecture replaces complex, multi-system stacks with a single, efficient system.

Quick Start & Requirements

A quick start can be initiated with curl -L https://risingwave.com/sh | sh. Further deployment options, including Docker Compose and Kubernetes (Helm/Operator), are detailed in the quick start guide. No specific hardware or software prerequisites beyond standard system requirements are detailed in the README for basic setup.

Highlighted Details

  • Unified ingestion from webhooks, CDC (PostgreSQL, MySQL), Kafka, Pulsar, Kinesis, and S3.
  • Incremental computation for materialized views with end-to-end freshness under 100ms.
  • Low-latency serving (10-20ms p99) via an internal row store, accessible via PostgreSQL wire protocol.
  • Native integration with Apache Iceberg™ for durable storage and analytical queries, with automated table maintenance.
  • Cost-efficient architecture leveraging object storage for internal state and tables, with optional elastic disk cache for low-latency serving.

Maintenance & Community

Community support and discussions are available via Slack. Development guidelines are provided in the RisingWave Developer Guide.

Licensing & Compatibility

The project is licensed under the Apache License 2.0. This permissive license generally allows for commercial use and integration into closed-source applications without requiring the derived work to be open-sourced.

Limitations & Caveats

The provided README does not explicitly detail limitations, alpha/beta status, known bugs, or unsupported platforms. Users should consult the project's issue tracker or community channels for potential caveats.

Health Check
Last Commit

1 day ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Maxime Beauchemin Maxime Beauchemin(Author of Apache Airflow, Superset; Founder of Preset), and
3 more.

bytewax by bytewax

0.3%
2k
Python framework for stateful stream processing
Created 4 years ago
Updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Chaoyu Yang Chaoyu Yang(Founder of Bento).

seatunnel by apache

0.2%
9k
High-performance multimodal data integration
Created 8 years ago
Updated 1 day ago
Starred by Clement Delangue Clement Delangue(Cofounder of Hugging Face), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
26 more.

datasets by huggingface

0.1%
21k
Access and process large AI datasets efficiently
Created 6 years ago
Updated 1 day ago
Feedback? Help us improve.