TrailDB

Viewing Revision #14 from 2018-12-11 14:32 View Current

TrailDB is an easy portable C library that allows you to query a series of relative events. It is used to group the existing relative events in a time-series format and produce an immutable database with high compression rate.[04]

Logo Versions

Website: https://traildb.io/[01]
Source Code: https://github.com/traildb/traildb[02] Accessed: Jun 4, 2026 Last Commit: Nov 1, 2019
Tech Docs: https://traildb.io/docs[03]
Developer: AdRoll Inc.
Country of Origin: US
Start Year: 2014 [10]
Project Types: Commercial, Open Source
Written in: C
Supported Languages: C, D, Go, Haskell, Python, R
License: MIT License

It is designed as a complement to current existing relational databases or key-value stores and targeted for OLAP workload such as analyzing usage patterns, predicting user behavior, and detecting anomalies. One key design feature is that the database is immutable once produced. This immutability feature allows the TrailDB to reach another key feature - data compression. It leverages relativity among time-series events to achieve high compression. These two key features allow TrailDB to achieve good performance in OLAP workload.

Logo Versions

Website: https://traildb.io/[01]
Source Code: https://github.com/traildb/traildb[02] Accessed: Jun 4, 2026 Last Commit: Nov 1, 2019
Tech Docs: https://traildb.io/docs[03]
Developer: AdRoll Inc.
Country of Origin: US
Start Year: 2014 [10]
Project Types: Commercial, Open Source
Written in: C
Supported Languages: C, D, Go, Haskell, Python, R
License: MIT License

TrailDB

Viewing Revision #14 from 2018-12-11 14:32 View Current

Checkpoints

Not Supported

TrailDB does not support checkpoints as each database is immutable once produced.

Compression[05]

Delta Encoding Run-Length Encoding Prefix Compression

First, within a trail, events are always sorted by time. Thus, it utilizes Delta Encoding to compress the 64-bit timestamps.

Second, since events are grouped by UUID, which usually represents a logical entity such as an online shopping customer, these events within a trail tend to be predictable and TrailDB only encodes every change in behavior. This is not exactly the same as the Run-Length Encoding but similar.

Third, Huffman Coding, which is a kind of Prefix Compression method, is used to encode the skewed, low-entropy distributions of values.

Concurrency Control

Not Supported

As each TrailDB is an immutable file, modifications are not allowed. There's only one process to produce a database and no one can issue read operations before the creation is finalized. Thus, there's no concurrency in TrailDB.

Data Model[06]

Relational

TrailDB adopts a specific relational data model.

Each database is a collection of trails.

Each trail is identified by a 128-bit user-defined ID and an automatically assigned trail ID. Each trail consists a sequence of events which is ordered by time.

Each event consists of a 64-bit timestamp and several user-pre-defined fields.

Each field contains a set of values.

Foreign Keys

Not Supported

In TrailDB, each database consists of a collection of trails each of which is identified by a unique UUID. There are no multiple tables within a database and no constraints among databases. Thus, it does not support foreign keys.

Indexes[07]

Hash Table

This feature is introduced in TrailDB 0.6. [TODO]

Isolation Levels

Serializable

When creating a database, there's only one process to handle it and others cannot access it. Once the database is produced, it is a read-only immutable file. Thus, everyone can issue read requests to it, but cannot issue any write operations. In this view, it is equivalent to the serializable isolation level.

Logging[08]

Not Supported

TrailDB does not support logging and there's only one process to create the database. There is no recovery handler if the process crashes during the creation of the database. Thus, users need to start from the very beginning of the producing process.

But, TrailDB system allows merging existing TrailDBs to create a new immutable database. It is suggested to do so if there's a huge number of input events.

Query Interface[09]

Custom API

TrailDB system does not support the standard SQL query interface. It offers the query interface in several programming languages: C, Go, Python, R, Haskell, and D. TrailDB system also provides a query engine called trck, which is a domain specific language to aggregate metrics based on events of identical UUID.

Stored Procedures

Not Supported

Each TrailDB is a read-only immutable file, it does not support stored procedures.

System Architecture

Embedded

Views

Not Supported

TrailDB system does not support views. But, as each database is an immutable file, users can create "views" by creating another immutable database by extracting data from the existing TrailDBs.

Citations

10 sources

TrailDB traildb.io Modified: 2021-05-11 Accessed: 2026-07-19
GitHub - traildb/traildb: TrailDB is an efficient tool for storing and querying series of events · GitHub github.com Accessed: 2026-06-05
TrailDB Docs traildb.io Modified: 2021-04-21 Accessed: 2026-06-05
Introduction to TrailDB slides.com Accessed: 2026-06-07
http://traildb.io/docs/technical_overview/ traildb.io Dead — Check Archive Accessed: 2026-05-23
Technical Overview - TrailDB Docs traildb.io Modified: 2021-04-21 Accessed: 2026-06-07
traildb/tdbcli/tdb_index.c at master · traildb/traildb · GitHub github.com Accessed: 2026-05-23
traildb/doc/docs/api.md at master · traildb/traildb · GitHub github.com Accessed: 2026-05-24
GitHub - traildb/trck: Query engine for TrailDB · GitHub github.com Accessed: 2026-05-24
implement basic support for reading lines and fields github.com Modified: 2014-06-10 Accessed: 2026-05-21

Revision #14 Last Updated: 2018-12-11 09:32