DataFusion

DataFusion is an extensible framework for planning and executing SQL queries.

Data Model

Relational

Parallel Execution

Intra-Operator (Horizontal)

Query Interface

SQL

Storage Format

Apache Parquet Apache Arrow

System Architecture

Shared-Everything

Embeddings