[Question] TPC-DS 1TB Benchmarking results for Non-Partitioned Delta tables with Velox Backend #11463

shadowmmu · 2026-01-22T07:12:50Z

shadowmmu
Jan 22, 2026

Hi Gluten Community,

I am currently exploring the performance of Apache Gluten with the Velox backend specifically for Delta Lake workloads.

While there are several TPC-DS benchmark reports available for Parquet/ORC, I am looking for insights or existing benchmarking results for the following specific setup:

Scale Factor: 1TB (TPC-DS)
Data Format: Delta Lake (non-partitioned)
Backend: Velox
Storage: (e.g., S3 / HDFS / Local NVMe)

Context:
We are evaluating the overhead of the Delta Log reading process versus the native acceleration provided by Velox. Specifically, we are interested in:

How non-partitioned Delta tables perform compared to standard Parquet in a Gluten environment.
If anyone has observed specific bottlenecks in metadata handling or scan performance with this configuration.
Recommended Spark/Gluten configurations to optimize the Delta-Velox scan path for large-scale non-partitioned data.

If anyone has run these benchmarks or has a performance comparison (Native Spark vs. Gluten+Velox) for this setup, I would greatly appreciate it if you could share your findings or any tuning tips!

Thanks!

FelixYBW · 2026-01-27T02:37:11Z

FelixYBW
Jan 27, 2026
Collaborator

Delta tables has a bit lower performance than pure hive table. Delta uses SQL to query metadata during the SQL processing. But some operators are not supported in the metadata query which caused frequent C2R, R2C in some cases and perform worse than vanilla spark. Welcome to fix.

0 replies

shadowmmu · 2026-01-27T06:59:50Z

shadowmmu
Jan 27, 2026
Author

Thanks @FelixYBW for your detailed response.
I am up for any kind of contribution, please guide me how can I proceed with.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] TPC-DS 1TB Benchmarking results for Non-Partitioned Delta tables with Velox Backend #11463

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[Question] TPC-DS 1TB Benchmarking results for Non-Partitioned Delta tables with Velox Backend #11463

Uh oh!

shadowmmu Jan 22, 2026

Replies: 2 comments

Uh oh!

FelixYBW Jan 27, 2026 Collaborator

Uh oh!

shadowmmu Jan 27, 2026 Author

shadowmmu
Jan 22, 2026

FelixYBW
Jan 27, 2026
Collaborator

shadowmmu
Jan 27, 2026
Author