Skip to content

Comparing DataFusion with DuckDB based on ClickBench, H2O, and TPC-H

Notifications You must be signed in to change notification settings

JayjeetAtGithub/datafusion-duckdb-benchmark

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DataFusion / DuckDB Benchmarking Scripts

Compare DataFusion and DuckDB with

Versions

  • DataFusion 32.0.0
  • DuckDB 0.9.1

Results

All results are checked in to results

The scripts in this repository run queries via python bindings for both DataFusion and DuckDB

Setting up the Environment

# Setup Python virtual environment and databases
python3 -m venv venv
source venv/bin/activate
pip install pyarrow pandas matplotlib seaborn prettytable

# install DuckDB
pip install duckdb==0.9.1 psutil

# install DataFusion
pip install datafusion==32.0.0

Credits:

About

Comparing DataFusion with DuckDB based on ClickBench, H2O, and TPC-H

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 63.6%
  • Shell 20.1%
  • TeX 16.3%