risingwave  by risingwavelabs

Stream processing and serving for AI agents and real-time data applications

Created 4 years ago
9,047 stars

Top 5.7% on SourcePulse

GitHubView on GitHub
Project Summary

RisingWave is an event streaming platform designed for agentic AI and real-time applications. It addresses the complexity and latency of traditional event streaming stacks (like Debezium, Kafka, Flink, and a serving DB) by offering a unified system that ingests, processes, and serves data incrementally with sub-100ms freshness and low-latency query responses.

How It Works

RisingWave performs incremental computation, recomputing only affected results when upstream data changes, enabling always-up-to-date materialized views with sub-100ms end-to-end freshness. It unifies ingestion from diverse sources like webhooks, databases (via CDC), event streams (Kafka, Pulsar, Kinesis), and historical storage under a SQL interface. Processed data is served from an internal row store at 10-20ms p99 latency, while long-term retention and analytical access are managed via native integration with Apache Iceberg™, including automated table maintenance. This architecture replaces complex, multi-system stacks with a single, efficient system.

Quick Start & Requirements

A quick start can be initiated with curl -L https://risingwave.com/sh | sh. Further deployment options, including Docker Compose and Kubernetes (Helm/Operator), are detailed in the quick start guide. No specific hardware or software prerequisites beyond standard system requirements are detailed in the README for basic setup.

Highlighted Details

  • Unified ingestion from webhooks, CDC (PostgreSQL, MySQL), Kafka, Pulsar, Kinesis, and S3.
  • Incremental computation for materialized views with end-to-end freshness under 100ms.
  • Low-latency serving (10-20ms p99) via an internal row store, accessible via PostgreSQL wire protocol.
  • Native integration with Apache Iceberg™ for durable storage and analytical queries, with automated table maintenance.
  • Cost-efficient architecture leveraging object storage for internal state and tables, with optional elastic disk cache for low-latency serving.

Maintenance & Community

Community support and discussions are available via Slack. Development guidelines are provided in the RisingWave Developer Guide.

Licensing & Compatibility

The project is licensed under the Apache License 2.0. This permissive license generally allows for commercial use and integration into closed-source applications without requiring the derived work to be open-sourced.

Limitations & Caveats

The provided README does not explicitly detail limitations, alpha/beta status, known bugs, or unsupported platforms. Users should consult the project's issue tracker or community channels for potential caveats.

Health Check
Last Commit

14 hours ago

Responsiveness

Inactive

Pull Requests (30d)
261
Issues (30d)
66
Star History
103 stars in the last 30 days

Explore Similar Projects

Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), Maxime Beauchemin Maxime Beauchemin(Author of Apache Airflow, Superset; Founder of Preset), and
3 more.

bytewax by bytewax

0.4%
2k
Python framework for stateful stream processing
Created 4 years ago
Updated 1 week ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Chaoyu Yang Chaoyu Yang(Founder of Bento).

seatunnel by apache

0.2%
9k
High-performance multimodal data integration
Created 8 years ago
Updated 1 day ago
Starred by Clement Delangue Clement Delangue(Cofounder of Hugging Face), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
26 more.

datasets by huggingface

0.1%
22k
Access and process large AI datasets efficiently
Created 6 years ago
Updated 1 day ago
Feedback? Help us improve.