Hyrise

Viewing Revision #8 from 2019-10-16 13:00 View Current

Hyrise is a platform for research and education in the area of relational in-memory databases. The goal is to have a code base that is easy to understand and extend. This should make evaluations of new research concepts easier than it would be in full-featured DBMS products.[04]

Logo Versions

Website: https://hpi.de/plattner/projects/hyrise.html[01]
Source Code: https://github.com/hyrise/hyrise[02] Accessed: Jul 29, 2026 Last Commit: Jul 13, 2026
Tech Docs: https://github.com/hyrise/hyrise/wiki[03]
Developer: Hasso Plattner Institute
Country of Origin: DE
Start Year: 2009
Project Types: Academic, Open Source
Written in: C++
Supported Languages: SQL
Compatible With: PostgreSQL
Operating Systems: Linux, macOS
License: MIT License

Logo Versions

Website: https://hpi.de/plattner/projects/hyrise.html[01]
Source Code: https://github.com/hyrise/hyrise[02] Accessed: Jul 29, 2026 Last Commit: Jul 13, 2026
Tech Docs: https://github.com/hyrise/hyrise/wiki[03]
Developer: Hasso Plattner Institute
Country of Origin: DE
Start Year: 2009
Project Types: Academic, Open Source
Written in: C++
Supported Languages: SQL
Compatible With: PostgreSQL
Operating Systems: Linux, macOS
License: MIT License

Derivative Systems

Skyrise

Hyrise

Viewing Revision #8 from 2019-10-16 13:00 View Current

History[04]

The initial version of Hyrise -- presented in PVLDB 2011 -- focussed on optimizing the table layout in order to optimize CPU caching for a given workload. It used a workload-aware highly flexible partitioning approach to cluster data. This flexible data layout came with a high number of virtual function calls/several indirections for data accesses. Due to the grown code base and a shifted research focus, a new version of Hyrise has been built from scratch starting in 2016. The new version sets its focus more on topics like NUMA support (cf. chunked partitioning), NVRAM, SQL optimization (cf. query optimizer), and Self-Driving.

Compression

Dictionary Encoding Delta Encoding Run-Length Encoding Bit Packing / Mostly Encoding Null Suppression

Hyrise includes a compression framework based on C++ iterators and zero-cost abstractions. Such zero-cost abstractions avoid the runtime overheads of dynamic dispatching for increased compile times. As a heavy-weight compression algorithm, LZ4 is also supported.

Concurrency Control[05][06]

Multi-version Concurrency Control (MVCC)

For each row, three pieces of information are stored: The commit id of the transaction that successfully inserted the row (begin cid), that of the transaction that fully deleted it (end cid), and the transaction id of the transaction that currently modifies it. The linked Wiki page describes how these are used to calculate the row's visibility.

Data Model

Relational

Indexes[07][08]

Adaptive Radix Tree (ART)

Adaptive Radix Trees (ARTs) and Group Key Indexes are used. These are created on a per-chunk basis and different chunks can have different types of indexes. Additionally, probabilistic filters are used for access avoidance in cases where no index exists.

Isolation Levels

Snapshot Isolation

Joins

Nested Loop Join Hash Join Sort-Merge Join Semi Join Index Nested Loop Join

Query Compilation

Not Supported

A JIT-compiled execution engine was partially implemented. The underlying approach to transferring data from and to the JIT engine turned out to be a bottleneck, which was why it was ultimately removed.

Query Execution

Materialized Model

Most operators produce position lists that contain chunk ids and chunk offsets. Together with a pointer to the original table, these can be used to reference data without fully materializing it. The JIT engine (see above) uses a Tuple-at-a-Time Model.

Query Interface[09]

SQL Stored Procedures

The SQLPipeline is an interface that takes an SQL string and returns the result table(s). Hyrise uses its own SQL-Parser to translate SQL queries into an Abstract Syntax Tree (AST). These are converted into a logical query plan (LQP), which is optimized, translated into a physical query plan (PQP) and finally executed. The pipeline handles both query plan caching and stored procedures.

Storage Architecture

In-Memory

Storage Model[10]

Decomposition Storage Model (Columnar)

Hyrise started as one of the first databases with a hybrid memory layout. With the rewrite, only columnar storage is implemented. Hybrid layouts are planned to come back, but are not a high priority.

Stored Procedures

Supported

Stored Procedures are stored as named, static entries in the otherwise automatic query cache.

System Architecture

Shared-Everything

The current version of Hyrise runs on a single node. Replication, which was part of Hyrise1, is planned.

Views

Virtual Views

Views are stored as logical query plans (LQPs). For a query selecting from a view, the LQP is inserted at the position at which usually a stored table would appear before the optimizer is called. This makes it possible for the optimizer to optimize across view boundaries - for example pushing down additional predicates into the view.

Derivative Systems

Skyrise

Citations

10 sources

Hyrise hpi.de Accessed: 2026-07-16
GitHub - hyrise/hyrise: Hyrise is a research in-memory database. · GitHub github.com Accessed: 2026-06-04
Home · hyrise/hyrise Wiki · GitHub github.com Accessed: 2026-06-05
https://openproceedings.org/2019/conf/edbt/EDBT19_paper_152.pdf openproceedings.org Dead — Check Archive Modified: 2019-05-13 Accessed: 2026-06-07
https://imdm.ws/2014/papers/schwalb.pdf imdm.ws Dead — Check Archive Accessed: 2026-05-22
MVCC · hyrise/hyrise Wiki · GitHub github.com Accessed: 2026-05-22
http://www.adms-conf.org/faust_adms12.pdf adms-conf.org Dead — Check Archive Modified: 2020-09-24 Accessed: 2026-06-07
IndexesAndFilters · hyrise/hyrise Wiki · GitHub github.com Accessed: 2026-05-22
SQL · hyrise/hyrise Wiki · GitHub github.com Accessed: 2026-05-22
ComparisonToHyrise1 · hyrise/hyrise Wiki · GitHub github.com Accessed: 2026-05-22

Revision #8 Last Updated: 2019-10-16 09:00