AresDB

Viewing Revision #10 from 2019-11-28 22:02 View Current

AresDB is a GPU-based real-time analytics storage and query engine with low memory overhead, real-time upserts with primary key deduplication, and time series aggregations on both streaming and finite dimensional data.[01]

Logo Versions

Website: https://eng.uber.com/aresdb[01]
Source Code: https://github.com/uber/aresdb[02] Accessed: Jun 3, 2026 Last Commit: Apr 23, 2020
Tech Docs: https://github.com/uber/aresdb/wiki[03]
Developer: Uber Technologies, Inc.
Country of Origin: US
Start Year: 2018 [10]
Project Type: Open Source
Written in: C, C++, Go
Inspired By: Elasticsearch, HeavyDB, Kinetica, Ocelot, Pinot
Operating System: Linux
License: Apache v2

Logo Versions

Website: https://eng.uber.com/aresdb[01]
Source Code: https://github.com/uber/aresdb[02] Accessed: Jun 3, 2026 Last Commit: Apr 23, 2020
Tech Docs: https://github.com/uber/aresdb/wiki[03]
Developer: Uber Technologies, Inc.
Country of Origin: US
Start Year: 2018 [10]
Project Type: Open Source
Written in: C, C++, Go
Inspired By: Elasticsearch, HeavyDB, Kinetica, Ocelot, Pinot
Operating System: Linux
License: Apache v2

AresDB

Viewing Revision #10 from 2019-11-28 22:02 View Current

History[01]

Developed by Uber to meet their specific need "to make similar queries over relatively small, yet highly valuable, subsets of data (with maximum data freshness) at high QPS and low latency," with queries such as time series aggregations over geofences.

Checkpoints[04]

Fuzzy

Snapshots are triggered by either a certain number of mutations or a certain time frame specific to each table.

Compression[01]

Run-Length Encoding

Only compresses data with user defined sort orders that are low cardinality.

Data Model

Relational

Foreign Keys[05]

Supported

AresDB supports foreign key joins.

Hardware Acceleration[01]

GPU

Uses GPUs for query execution

Indexes[01]

Hash Table

Used mainly for primary key deduplication.

Joins[05][01]

Hash Join

Supports hash joins from fact tables (finite set data such as cities) to dimension tables (infinite streaming data such as rides). Also supports geospatial joins (i.e. geographically bounded area overlap) and normal foreign key joins. Note that AresDB uses late materialization for its joins, meaning the join may not be executed until a foreign key is accessed.

Logging[06]

Logical Logging

Log files contain description of database upserts which must be replayed to rebuild the database after a crash.

Parallel Execution[07]

Intra-Operator (Horizontal)

Executes queries with the one operation per kernel (OOPK) model.

Query Compilation

Not Supported

Query Execution

Vectorized Model

AresDB works with vector batches that are efficiently processed in parallel using the Thrust library.

Query Interface[05]

Custom API

Uses a proprietary execution language called Ares Query Language (AQL) which is based in the JSON format, making it compatible with any language that can handle files and/or JSON.

Storage Architecture

Hybrid

Data within the archival delay of a table is kept uncompressed in live batches, while everything else is stored in compressed archival batches. If new data is ingested that is outside the archival array, it's added to an archival backfill queue which will be inserted into the archived batches asynchronously.

Storage Model

Decomposition Storage Model (Columnar)

Storage Organization[08][09]

Sorted Files

Archived data is sorted in a user specified column order, and files are organized by UTC day and Unix time cutoffs.

Stored Procedures

Not Supported

System Architecture[01]

Shared-Disk

The CPU is only used to load information from storage into CPU memory and to distribute this data to GPU memory. The database system delegates each operator in a query to some GPU, so it's able to handle multiple GPUs by delegating different operations to different GPUs, each of which have completely separate memory. There are plans to implement proper distributed designs, but currently we're limited to a single system with multiple GPUs.

Views[05]

Materialized Views

AresDB uses late materialization for its joins, meaning that it may only physically execute the join once a foreign key is accessed.

Citations

10 sources

https://eng.uber.com/aresdb uber.com Dead — Check Archive Accessed: 2026-06-03
GitHub - uber/aresdb: A GPU-powered real-time analytics storage and query engine. · GitHub github.com Accessed: 2026-06-03
Home · uber/aresdb Wiki · GitHub github.com Accessed: 2026-06-05
Data Snapshot · uber/aresdb Wiki · GitHub github.com Accessed: 2026-05-30
Ares Query Language · uber/aresdb Wiki · GitHub github.com Accessed: 2026-05-30
Redo Logs · uber/aresdb Wiki · GitHub github.com Accessed: 2026-05-30
Query Execution · uber/aresdb Wiki · GitHub github.com Accessed: 2026-05-30
Data Archiving · uber/aresdb Wiki · GitHub github.com Accessed: 2026-05-30
Data Layout On Disk · uber/aresdb Wiki · GitHub github.com Accessed: 2026-05-30
travis yml setup github.com Modified: 2018-10-24 Accessed: 2026-05-26

Revision #10 Last Updated: 2019-11-28 17:02