TileDB uses a multi-dimensional array format that handles both sparse data and dense data. An array is composed of fragments, where each fragment is an array snapshot containing cells written in that write operation. Fragments can be categorised into sparse fragments and dense fragments. Sparse fragments store their elements in a global order. Dense fragments store their elements into regularised chunks in the index space.
Decomposition Storage Model (Columnar)
TileDB supports columnar format for different attributes stored in arrays.
Dictionary Encoding Run-Length Encoding
TileDB supports the following compressors: GZIP, Zstandard, LZ4, RLE, Bzip2, and Double-delta. Double-delta is a compressor created for TileDB, and is similar to Facebook's Gorilla system.
https://github.com/TileDB-Inc/TileDB
https://people.csail.mit.edu/stavrosp/papers/vldb2017/VLDB17_TileDB.pdf
TileDB Inc, Intel Labs
2017
TileDB Inc