RisingWave

Viewing Revision #27 from 2023-09-20 02:07 View Current

RisingWave is an open-source distributed streaming database targeting real-time analytics and event-driven applications. It uses the incremental computation model to process streaming data with low latency. RisingWave implements a traditional change propagation framework to keep user-defined materialized views up-to-date. An incremental checkpoint mechanism is used to ensure data consistency. It also has an elastic multi-node architecture with separate data and compute nodes. RisingWave is built from scratch with Rust and is wire compatible with PostgreSQL.[05][06][07]

Logo Versions

Website: https://www.risingwave.com[01]
Source Code: https://github.com/risingwavelabs/risingwave[02] Accessed: Jul 21, 2026 Last Commit: Jul 21, 2026
Tech Docs: https://docs.risingwave.com/get-started/intro[03]
Developer: RisingWave Labs Inc.
Country of Origin: US
Start Year: 2021
Project Types: Commercial, Open Source
Written in: Rust
Supported Languages: Go, Java, JavaScript, Python
Compatible With: PostgreSQL
Operating System: Linux
License: Apache v2
Twitter: @RisingWaveLabs[04]

Database Entry

RisingWave

Viewing Revision #27 from 2023-09-20 02:07 View Current

Streaming

History[08][09][10]

RisingWave Database is built by RisingWave Labs (formerly known as Singularity Data), a database systems startup founded in 2021 by former IBM researcher and Amazon Redshift engineer Yingjun Wu.

While working at Amazon Redshift, Wu noticed that existing database systems cannot process streaming data efficiently and existing streaming systems were too complicated for most companies to use. These observations motivated Wu to found RisingWave Labs with a mission to “democratize stream processing”.

Checkpoints[11][12][13]

Non-Blocking Consistent

RisingWave uses the Chandy–Lamport algorithm to create consistent checkpoints.

To ensure that data is correct and consistent, read queries always fetch data from the most recent checkpoint. This means RisingWave does not ensure read-after-write consistency.

A local shared buffer is used to stage uncommitted write batches submitted by operators. The storage manager will notify all operators to commit their buffered writes into the shared storage when the checkpoint trigger message has reached all operators.

Compression[14][15]

Naïve (Page-Level) Prefix Compression

RisingWave applies both naive compression and prefix compression at the block-level. It uses LZ4 and Zstd for naive compression.

Concurrency Control[16]

Not Supported

RisingWave does not support concurrency control in its compute engine. However, it does employ Multi-version Concurrency Control (MVCC) in its storage engine.

Data Model[17]

Relational

RisingWave uses a relational data model. Its relational tables are composed of a list of strongly-typed columns. All columns are implicitly nullable. The supported primitive data types are boolean, integer, fixed-point and floating-point numbers, strings, and temporals. Composite data types of struct and list are also supported.

Foreign Keys

Not Supported

Indexes[16][18]

Not Supported

RisingWave does not support traditional index data structures. Instead, indexes are implemented as specialized materialized views. RisingWave stores materialized views as key-value pairs in log-structured merge trees.

Isolation Levels

Not Supported

Joins[19][20][21]

Nested Loop Join Hash Join Index Nested Loop Join

RisingWave supports hash join, nested loop join, and index nested loop join (also called lookup join). RisingWave has two execution modes: the batch-query mode and the streaming mode. All three join strategies are used in the batch-query mode, while only hash join and lookup join are used in the streaming mode. The supported join types are inner join, left outer join, right outer join, and full outer join. Window joins are also available as part of RisingWave's support for time window functions.

Logging[12]

Not Supported

Parallel Execution[22][23][24][25]

Bushy

RisingWave supports bushy parallelism in both its batch-query and streaming modes. Consistent hashing is employed to partition data for parallel execution.

Query Compilation

Not Supported

Query Execution[17][26]

Vectorized Model

RisingWave adopts the top-to-bottom vectorized processing model. Operators emit a Data Chunk in batch-query mode and a Stream Chunk in streaming mode. A Data Chunk consists of multiple columns and a visibility array (represents each row's visibility status for row filtering purposes). A Stream Chunk consists of multiple columns, a visibility array, and an additional ops column marking an operation on each row. Each entry in the ops column can be one of Delete, Insert, UpdateDelete, or UpdateInsert.

Query Interface[27][28]

SQL

The RisingWave SQL query interface is mostly compatible with PostgreSQL. It has client libraries in Java, Node.js, Python, and Go. RisingWave is wire compatible with PostgreSQL and can be accessed with PostgreSQL terminal psql. As a cloud-native database, RisingWave can be integrated with cloud services such as Confluent Cloud, DataStax, and Grafana Cloud. To support stream processing, RisingWave can be integrated with the following message brokers or streaming services: Kafka, Redpanda, Apache Pulsar, DataStax Astra Streaming, StreamNative Cloud, and Kinesis Data Streams.

Storage Architecture[13][16]

Disk-oriented

RisingWave has a disk-oriented storage architecture. Files are directly written to an S3-compatible shared storage service by default. RisingWave also supports local drives, Google Cloud Storage, and HDFS/WebHDFS as shared storage destinations.

Storage Model[16][17][29]

N-ary Storage Model (Row/Record)

RisingWave uses the row store format. Each row is encoded into key-value entries.

Storage Organization[16]

Log-structured

RisingWave uses a LSM-Tree based key-value storage engine that provides MVCC read and write capabilities. All key-value pairs are stored in block-based sorted strings tables (SST). Each SST consists of two files:

The .data file is composed of 64KB blocks containing the actual key-value pairs.
The .meta file contains metadata such as min-max index, Bloom filter, and per-block metadata.

Stored Procedures[30]

Supported

RisingWave offers UDF implemented as external functions in Python.

System Architecture[22][31]

Shared-Everything

RisingWave adopts a disaggregated architecture to support elastically scaling the compute nodes without migrating the storage. A RisingWave database is composed of four key layers:

A serving layer that parses, plans, and optimizes SQL queries.
A processing layer that performs all the data computations and updates.
A persistence layer that stores and retrieves data on an object storage service (such as S3).
A meta server that coordinates the operations between the serving layer and the processing layer.

The serving, processing, and persistent layers each consist of multiple nodes. Compute nodes in the processing layer execute the optimized query plan (generated by the serving layer) in parallel. Compactor nodes in the persistent layer compact data with consistent hashing and upload the resulting SST files to shared storage.

The meta server is responsible for many tasks, including but not limited to:

Managing the stream graph of RisingWave
Managing barrier injection and collection for initiating checkpoint capture
Manages the cluster information, such as the address and status of nodes
Detecting failure by periodically sending heartbeats to serving-layer and processing-layer nodes

Views[32][33][34]

Virtual Views Materialized Views

RisingWave supports both views and materialized views. New materialized views can be created based on existing materialized views. RisingWave updates materialized views incrementally as new data is fed into the database.

Citations

34 sources

RisingWave | Streaming Infrastructure for Agentic AI risingwave.com Modified: 2026-07-10 Accessed: 2026-07-18
GitHub - risingwavelabs/risingwave: Event streaming platform for agentic AI. Continuously ingest, transform, and serve event streams in real time, at scale. · GitHub github.com Accessed: 2026-06-05
What is RisingWave? - RisingWave risingwave.com Modified: 2026-06-05 Accessed: 2026-06-05
https://twitter.com/RisingWaveLabs twitter.com
https://www.risingwave.com/products/RisingWaveDatabase/ risingwave.com Dead — Check Archive Modified: 2026-06-01 Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/intro risingwave.com Dead — Check Archive Accessed: 2026-06-02
risingwave/docs at main · risingwavelabs/risingwave · GitHub github.com Accessed: 2026-06-02
https://www.risingwave.com/company/ risingwave.com Dead — Check Archive Modified: 2026-06-01 Accessed: 2026-06-02
Streaming data processing platform RisingWave lands $36M to launch a cloud service | TechCrunch techcrunch.com Accessed: 2026-06-02
https://www.risingwave.com/blog/risingwave-A-Cloud-Native-Streaming-Database/ risingwave.com Dead — Check Archive Modified: 2026-06-01 Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/checkpoint.md github.com Dead — Check Archive Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/fault-tolerance risingwave.com Dead — Check Archive Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/data-persistence risingwave.com Dead — Check Archive Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/state-store-overview.md#compaction github.com Dead — Check Archive Accessed: 2026-06-02
risingwave/src/storage/src/hummock/sstable/block.rs at main · risingwavelabs/risingwave · GitHub github.com Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/state-store-overview.md github.com Dead — Check Archive Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/data-model-and-encoding.md github.com Dead — Check Archive Accessed: 2026-06-02
Shared Indexes and Joins in Streaming Databases | RisingWave risingwave.com Modified: 2026-06-01 Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/tree/main/src/batch/src/executor/join github.com Dead — Check Archive Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/sql-select#parameters risingwave.com Dead — Check Archive Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/sql-function-time-window#window-joins risingwave.com Dead — Check Archive Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/architecture-design.md github.com Dead — Check Archive Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/consistent-hash.md github.com Dead — Check Archive Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/batch-local-execution-mode.md github.com Dead — Check Archive Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/streaming-overview.md github.com Dead — Check Archive Accessed: 2026-06-02
risingwave/src at main · risingwavelabs/risingwave · GitHub github.com Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/rw-integration-summary risingwave.com Dead — Check Archive Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/risingwave-flink-comparison risingwave.com Dead — Check Archive Accessed: 2026-06-02
risingwave/integration_tests/iceberg-sink/README.md at main · risingwavelabs/risingwave · GitHub github.com Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/user-defined-functions#5-use-your-functions-in-risingwave risingwave.com Dead — Check Archive Accessed: 2026-06-02
https://docs.risingwave.com/docs/upcoming/architecture risingwave.com Dead — Check Archive Accessed: 2026-06-02
https://github.com/risingwavelabs/risingwave/blob/main/docs/mv-on-mv.md github.com Dead — Check Archive Accessed: 2026-06-02
https://docs.risingwave.com/docs/current/key-concepts risingwave.com Dead — Check Archive Accessed: 2026-06-02
https://docs.risingwave.com/docs/upcoming/intro#real-time-results-via-materialized-views risingwave.com Dead — Check Archive Accessed: 2026-06-02

Revision #27 Last Updated: 2023-09-19 22:07