OpenTSDB

OpenTSDB is a distributed, scalable Time Series Database (TSDB) written on top of HBase. OpenTSDB was written to address a common need: store, index, and serve metrics collected from computer systems (network gear, operating systems, applications) at a large scale, and make this data easily accessible and graphable. OpenTSDB is the first open-source monitoring system built on an open-source distributed database.

OpenTSDB is written in Java because HBase, its embedding storage model, is written in Java.

History

OpenTSDB was originally written by Benoit Sigoure in 2010 to monitor metrics of the StumbleUpon search engine which requires storing over 1 billion data points per day. StumbleUpon was in charge of the initial development and its open-source release. Yahoo! is currently maintaining OpenTSDB along with the open-source community.

Storage Organization

Sorted Files

Foreign Keys

Not Supported

System Architecture

Shared-Disk

Data Model

Key/Value

Stored Procedures

Not Supported

Query Compilation

Not Supported

Parallel Execution

Inter-Operator (Vertical)

Concurrency Control

Timestamp Ordering

OpenTSDB avoids multiple writers creating duplicate rows in the case of writer restart by making writes idempotent. It enforces a fixed timestamp boundary for each row. When a write reconnects to HBase, it will always write to the appropriate row according to the timestamp instead of creating new rows.

Storage Architecture

Disk-oriented