APACHE

Apache DataFusion

@apache_datafusion

US
https://datafusion.apache.org
Software Development

Overview

About Apache DataFusion

Apache DataFusion is a fast, feature rich and extensible query engine built on the Apache Arrow memory model.

“Out of the box,” DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community. Python Bindings are also available.

DataFusion features a full query planner, a columnar, streaming, multi-threaded, vectorized execution engine, and partitioned data sources. You can customize DataFusion at almost all points including additional data sources, query languages, functions, custom operators and more. See the Architecture section for more details.

Headquarters

-

Website

https://datafusion.apache.org

Company Size

51-200 employees

Industry

Software Development

Company Type

Nonprofit

Founded

2025

Specialties

-

Posts