DataFusion is an extensible framework for planning and executing SQL queries.
DataFusion became a top-level Apache project in 2024.
Relational
Intra-Operator (Horizontal)
SQL
Apache Parquet Apache Arrow
Decomposition Storage Model (Columnar)
Shared-Everything
https://arrow.apache.org/datafusion/
https://github.com/apache/arrow-datafusion
https://docs.rs/datafusion/latest/datafusion/
@ApacheDataFusio
US
2016
Open Source
Rust
Apache v2