Venice

Venice is a derived data storage platform that supports asynchronous ingestion from batch and streaming sources.

History

The Venice project started in 2014 at LinkedIn. It was originally a closed-source, internal system. Then LinkedIn released Venice's source code on Github in 2022.

Data Model

Document / XML

Fault Tolerance

Synchronous Replication

Query Interface

Custom API

Storage Format

Apache Parquet Apache Avro

System Architecture

Shared-Disk