IoTDB

View Current Viewing Revision #13 from 02/14/2024 8:32 a.m.

IoTDB is a specialized database management system for time series data generated by a network of IoT devices with low computational power. It targets a workload that has high-frequency data write, large-volume data storage, and complex analytical queries. IoTDB supports queries that are common in monitoring and collecting metrics in IoT devices, namely filtering by predicates, query by time range, group aggregation, and data sample. Data in IoTDB is stored in TsFile, a file format designed for accessing, compressing, and storing time series data. Its storage is organized in LSM based structure catering to write throughput.

IoTDB supports a Java and Python APIs, as well as a command-line interface. IoTDB provides supports data analysis systems such as Spark, Hadoop, Hive, and Grafana.

History

IoTDB is a project started in 2017 by Prof. Jianmin Wang’s group in the School of Software of Tsinghua University and China’s National Engineering Laboratory for Big Data Software. The project entered incubation in Apache Foundation in November 2018.

The project evolves from a prior project of the same group called TsFile. TsFile is a columnar storage format optimized for storing time series data. IoTDB uses TsFile as its underlying storage format.

Compression

Delta Encoding Run-Length Encoding Naïve (Record-Level)

Encoding

IoTDB uses different encoding methods for different data types.

RLE (Run-Length Encoding): INT32, INT64, FLOAT, DOUBLE, BOOLEAN

Suitable for the sequence of integer values and low-precision floating-point values that appear monotonic.

TS_2DIFF (Second-order Differential Encoding): INT32, INT64

Default encoding for time series data.

Regular Data Encoding: INT32, INT64

Suitable for fixed interval increasing sequence like time series.

GORILLA Encoding: FLOAT, DOUBLE

Suitable for floating-point values with small variance.

Compression

After encoding, data is cast to a binary stream; the binary stream is then compressed with SNAPPY.

Concurrency Control

Not Supported

As IoTDB does not support transaction, it has a bare-bone concurrency control implementation with read locks and write locks. Their implementation does not follow a Two-Phase Locking protocol, as there are cases where a lock is acquired after another lock is released previously in the same function, and example is included in citation 1 of this section. IoTDB uses Java's native ReentrantReadWriteLock in the implementation.

To avoid access conflict when concurrently reading or writing to user or role, IoTDB has HashLock implemented for user manager and lock manager. A HashLock lock is a wrapper around a fixed number of ReentrantReadWriteLock locks. By default, it initializes with an array of 100 ReentrantReadWriteLock locks. Each applicable database object corresponds to one lock in the array, according to hash value of the object. This avoids conflicts resulted from concurrent access of same database object, user or role in this case, while in the same time limit the amount of resource needed to managing those locks.

Data Model

Hierarchical

Storage groups and time series in IoTDB can be created with any number of prefixes, which decides the depth of the node in the hierarchy tree and the path to its storage location. With similar prefixes, time series from one storage group can be continuously written to same file. Uses of prefixes in time series name in IoTDB also allows users to issue a coarser-grained query on data from certain level of hierachy.

Foreign Keys

Not Supported

Indexes

Log-Structured Merge Tree

KV-Match Index for pattern matching queries and PISA for aggregation queries.

Isolation Levels

Serializable

Joins

Not Supported

Join does not fit in the workload which IoTDB is designed for. Rather, it supports filtering and aggregation, which are common use scenarios in device monitoring and performance metric collection.

Logging

Logical Logging

Physical query plans are serialized and stored as logs. Write-Ahead-Logging with only REDO records.

SQL-like customized query language. It has different naming conventions of database objects. Storage Group can be assimilated to a table but can be expressed in a tree hierarchy with prefixes in the path. A time series can be assimilated to a column; in IoTDB, it is usually an attribute of a device, containing a sequence of pairs of timestamp and corresponding values.

IoTDB uses Antlr 4 to translate query statements to logical plan operators and then to physical plan operators.

Storage Architecture

Disk-oriented

Storage Model

Decomposition Storage Model (Columnar)

IoTDB's underlying TsFile is a columnar storage file format. It is similar to CarbonData and Parquet but designed for time series data.

Storage Organization

Log-structured

Storage based on LSM (Log-Structured Merge Tree) to provide better write throughput.

Stored Procedures

Supported

SQL-like PREPARE statement is supported.

System Architecture

Embedded

The overall IoTDB follows a client-server architecture. IoTDB client resides in the sensors(IoT devices) of the system, handling data collection and sending data to IoTDB server. Client can sync its data collected every user-configured interval with the server using Sync Tool; this allows data collected by the sensor to constantly being persisted in server, where the data can then be used for native query or shipped to other open-source platform for data analysis. Currently support single node server deployment. The group is working in progress to support shared-nothing cluster. IoTDB currently supports writing to HDFS.

Views

Not Supported

Checkpoints

Non-Blocking Fuzzy

There is not an explicit checkpointing operation in IoTDB; whereas, crash recovery is handled with continuous recycle of WAL. IoTDB makes assumptions on the workload and access pattern of queries: it assumes that writes are mostly single record insert, updates on device metrics are rare, and there is no multi-query transaction.

Every write operation to the database occurs in an in-memory buffer data structure named memtable. Each memtable corresponds to a WAL file. Once a memtable persists on disk, the WAL corresponds to it will be deleted. This is essentially a checkpoint, as all the log exists are all logs that the database needs to replay when recovering from a crash, and all the logs that are deleted belong to queries that have already been committed.

Revision #13 | Updated 02/14/2024 8:32 a.m.