Prometheus

Viewing Revision #33 from 2018-12-09 07:09 View Current

Prometheus is an open-source time series database developed by SoundCloud, and serves as the storage layer for the Prometheus monitoring system. Inspired by the Gorilla system at Facebook, Prometheus is specially designed for monitoring and metric collection. Prometheus contains a user-defined multi-dimensional data model and a query language on multi-dimensional data called PromQL. Apart from local disk storage, Prometheus also has remote storage integrations via Protocol Buffer.

Logo Versions

Website: https://prometheus.io[01]
Source Code: https://github.com/prometheus/prometheus[02] Accessed: Jul 21, 2026 Last Commit: Jul 21, 2026
Tech Docs: https://prometheus.io/docs/introduction/overview[03]
Developer: SoundCloud
Country of Origin: DE
Start Year: 2012
Project Type: Open Source
Written in: Go
Supported Languages: Go
License: Apache v2

Prometheus is written in Go and supports Go/Java/Ruby/Python clients. Prometheus also has unofficial client bindings for many other languages.

Logo Versions

Website: https://prometheus.io[01]
Source Code: https://github.com/prometheus/prometheus[02] Accessed: Jul 21, 2026 Last Commit: Jul 21, 2026
Tech Docs: https://prometheus.io/docs/introduction/overview[03]
Developer: SoundCloud
Country of Origin: DE
Start Year: 2012
Project Type: Open Source
Written in: Go
Supported Languages: Go
License: Apache v2

Compatible Systems

VictoriaMetrics

Timbala

QuestDB

View All (9)

Prometheus

Viewing Revision #33 from 2018-12-09 07:09 View Current

Prometheus is written in Go and supports Go/Java/Ruby/Python clients. Prometheus also has unofficial client bindings for many other languages.

History

Prometheus was started in 2012 in SoundCloud as an open-source project for system monitoring, therefore the system requires an efficient and fault-tolerant storage layer for incoming metrics as well as metadata for these metrics. Thus they built the Prometheus time series database as the backend for the whole monitoring platform.

The Prometheus time series database has gone through three major versions. Prometheus v1 is a basic implementation, where all time series data and label metadata are stored in LevelDB. V2 addressed several shortcomings of v1 by storing time series data on a per time series basis and adoption of delta-of-delta compression. V3 made further improvements by implementing write ahead logging and better data block compaction.

Checkpoints[04]

Non-Blocking Consistent

Prometheus supports periodic checkpoints, which is done every two hours by default. Checkpoints in Prometheus is done by compacting write ahead logs in the most recent time range and include the latest checkpoint if it exists. All the checkpoints are stored in the same directory with the name checkpoint.xxx, where the xxx suffix is a number monotonically increasing. Therefore when Prometheus recovers from crash, it can restore the checkpoints in the checkpoint directory with the order as the suffixes.

Compression[05]

Delta Encoding

Since each sample in Prometheus can be viewed as a tuple of a timestamp and a numerical value, therefore Prometheus has different compression techniques for timestamps and value respectively.

For the compression of timestamps, the algorithm that Prometheus uses is similar to that of Facebook's Gorilla time-series database, called delta-of-delta compression algorithm. For example, given a series of timestamp 1496163646, 1496163676, 1496163706, 1496163735, 1496163765, storing this timestamps in raw bytes are not efficient since these values only change very little over time. A better approach is to encode timestamp with deltas, i.e. 1496163646, +30, +30, +29, +30. Due to the fact that metrics usually come in a constant rate, Prometheus adopts the delta-of-delta encoding, i.e. 1496163646 +30 +0 -1 +1. If metrics come in a constant rate, then most of these delta-of-deltas will become 0.

In addition to timestamp compression, Prometheus also compresses numerical values. Its approach is similar to existing floating point compression algorithms. The idea is that the XOR value of neighboring floating point data in a time series often has clustered 0s. Therefore the compression algorithm leverages this fact to compress numerical values.

With regard to integration with remote storage engines, Prometheus uses a snappy-compressed protocol buffer encoding over HTTP for both read and write protocols.

Concurrency Control

Not Supported

Data Model[06]

Key/Value

Prometheus stores data as time series. A time series is defined by a metric and a set of key-value labels. A data sample is a data point at a given timestamp, including a float64 value and a unix timestamp. Therefore a time series can be formally defined as <metric>{<label_1>=<value1>, <label_2>=<value2>...}.

Prometheus supports the following metric types:

Counter: monotonically increasing/decreasing data, e.g. the number of requests. It supports Inc() and Add(float) operations.
Gauge: numeric data point, e.g. CPU usage. It supports Set(float), Inc()/Dec(), Add(float)/Sub(float), and SetToCurrentTime() operations.
Histogram: samples in a given time range, e.g. request latencies. It supports Observe(float) operation.
Summary: similar to histogram, but only stores quantile data. Its supports Observe(float) operation.

Indexes[07]

Inverted Index (Full Text)

In Prometheus, a series can have labels and the total number of series can be very large. Therefore it is necessary to build an index that returns a list of series given a label. Prometheus uses inverted indexes for this purpose. For example, if a query will fetch all the series with the label "backend", it does not have to iterate through all the series. On the other hand, it can directly get all the series with the label "backend" by a single query to the inverted index.

With respect to the index on the timestamp dimension, Prometheus partitions horizontally, i.e. the timestamp dimension, into non-overlapping blocks. Therefore, each block contains the data for series in that time window. For a timestamp query over a series, Prometheus does a sequential scan over the blocks of the series.

Joins

Not Supported

Logging[08]

Physical Logging

Prometheus ensures data durability by write ahead logging (WAL). The format of how logs are stored on disk in Prometheus is largely inspired by LevelDB/RocksDB. A data sample's log record is a triple (series_id, timestamp, value), therefore Prometheus does physical logging.

Query Compilation

Not Supported

Query Execution[09]

Tuple-at-a-Time Model

Prometheus internally use a SeriesIterator interface to fetch series samples from disk.

Query Interface[10]

Custom API HTTP / REST

Prometheus has a custom query language called PromQL, which is specially designed to query time-series data. Prometheus query interface also implements math/datetime related functions as well as aggregation. Prometheus also provides a RESTful interface over HTTP.

Storage Architecture[11]

Disk-oriented

Prometheus supports two storage architectures. The first is local disk storage: data are compressed and stored on local disk. The second is remote storage: Prometheus supports third-party storage (e.g. Kafka, PostgreSQL, Amazon S3) via ProtoBuffer adaptor.

Storage Model

N-ary Storage Model (Row/Record)

Since the underlying data representation of each series in Prometheus is a list of key-value pair, the storage model of Prometheus is similar to normal key-value databases, e.g. LevelDB and RocksDB.

Storage Organization

Sorted Files

Prometheus maintains database files under the trunk directory. Each file contains metadata of the min/max timestamps of the data samples in the file. Timestamps in all the files jointly comprises the whole time series.

Stored Procedures

Not Supported

System Architecture[03]

Shared-Disk

Prometheus supports flexible configuration to choose backend storage service. Prometheus itself maintains a on-disk checkpoint of series data and also supports remote read/write to other storage systems, making Prometheus's integration with other systems much easier.

Views

Not Supported

Compatible Systems

VictoriaMetrics

Timbala

QuestDB

View All (9)

Citations

11 sources

Prometheus - Monitoring system & time series database prometheus.io Accessed: 2026-07-17
GitHub - prometheus/prometheus: The Prometheus monitoring system and time series database. · GitHub github.com Accessed: 2026-05-22
https://prometheus.io/docs/introduction/overview prometheus.io Accessed: 2026-06-05
tsdb/checkpoint.go at master · prometheus-junkyard/tsdb · GitHub github.com Accessed: 2026-05-22
http://www.vldb.org/pvldb/vol8/p1816-teller.pdf vldb.org Dead — Check Archive Modified: 2015-06-20 Accessed: 2026-06-07
https://prometheus.io/docs/concepts/data_model prometheus.io Accessed: 2026-06-05
https://github.com/prometheus/tsdb/blob/master/docs/format/index.md github.com Dead — Check Archive Accessed: 2026-05-22
tsdb/docs/format/wal.md at master · prometheus-junkyard/tsdb · GitHub github.com Accessed: 2026-05-22
tsdb/querier.go at master · prometheus-junkyard/tsdb · GitHub github.com Accessed: 2026-05-22
https://prometheus.io/docs/prometheus/latest/querying/basics prometheus.io Accessed: 2026-06-05
https://prometheus.io/docs/prometheus/latest/storage prometheus.io Accessed: 2026-06-05

Revision #33 Last Updated: 2018-12-09 02:09