Skip to content

Comments

ORC-2081: Support ORC LZ4 in bench module#2519

Closed
dongjoon-hyun wants to merge 1 commit intoapache:mainfrom
dongjoon-hyun:ORC-2081
Closed

ORC-2081: Support ORC LZ4 in bench module#2519
dongjoon-hyun wants to merge 1 commit intoapache:mainfrom
dongjoon-hyun:ORC-2081

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Feb 6, 2026

What changes were proposed in this pull request?

This PR aims to support ORC LZ4 in bench module.

Why are the changes needed?

To benchmark LZ4 like the other codecs.

How was this patch tested?

Manually run the following.

BUILD

$ cd java

$ mvn package -DskipTests -Pbenchmark

WRITE

$ java -jar core/target/orc-benchmarks-core-*-uber.jar generate data -d sales -c lz4 -f orc
Processing sales [orc]

FILE NAME

$ ls -alR data/generated/sales
total 6029680
drwxr-xr-x@ 3 dongjoon  staff          96 Feb  6 15:10 .
drwxr-xr-x@ 3 dongjoon  staff          96 Feb  6 14:50 ..
-rw-r--r--@ 1 dongjoon  staff  3083885325 Feb  6 15:07 orc.lz4

READ

$ java -jar core/target/orc-benchmarks-core-*-uber.jar scan data -d sales -c lz4 -f orc
data/generated/sales/orc.lz4 rows: 25000000 batches: 24415

ORC-TOOLS

$ orc-tools meta data/generated/sales/orc.lz4 | head -n4
Processing data file data/generated/sales/orc.lz4 [length: 3083885325]
Structure for data/generated/sales/orc.lz4
File Version: 0.12 with ORC_14 by ORC Java 2.3.0-SNAPSHOT
Rows: 25000000
Compression: LZ4

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Opus 4.5 on Claude Code

@github-actions github-actions bot added the JAVA label Feb 6, 2026
@dongjoon-hyun dongjoon-hyun deleted the ORC-2081 branch February 6, 2026 23:36
@dongjoon-hyun dongjoon-hyun added this to the 2.3.0 milestone Feb 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant