Dictionary Encoding Delta Encoding Run-Length Encoding Bit Packing / Mostly Encoding
BlazingSQL supports compressing and decompressing directly on the GPU. It accepts a variety of input formats such as Apache Parquet, BlazingDB Simpatico (GPU-compressed distributed files), and GDF (GPU dataframes built on Apache Arrow). Data is then sent to the GPU compressed.
Decomposition Storage Model (Columnar)
BlazingSQL is a column-store. To execute a query, it compresses and transmits relevant columns to the GPU. On the GPU, data is represented as a GPU DataFrame (GDF). GDFs are built on top of Apache Arrow, which is a columnar in-memory format.