Skip to content

Implement content checksum verification in Lz4BlockInputS…#21442

Open
gurtajsingh1 wants to merge 1 commit intoapache:trunkfrom
gurtajsingh1:GT/lz4-content-checksum
Open

Implement content checksum verification in Lz4BlockInputS…#21442
gurtajsingh1 wants to merge 1 commit intoapache:trunkfrom
gurtajsingh1:GT/lz4-content-checksum

Conversation

@gurtajsingh1
Copy link

@astubbs @halorgium @alexism @glasser

This commit implements the content checksum verification feature for LZ4 compression as indicated by existing TODOs in the codebase.

Changes:

  • Added CONTENT_CHECKSUM_SIZE constant (4 bytes) for content checksum size
  • Added CONTENT_CHECKSUM_MISMATCH error message constant
  • Added contentChecksum field for tracking running checksum
  • Added checksumBuffer for direct buffer handling
  • Implemented content checksum verification in readBlock() method
  • Content checksum is computed as XOR of all block checksums

The content checksum provides end-to-end data integrity verification for LZ4 compressed frames, following the LZ4 v1.5.1 frame format specification.

Delete this text and replace it with a detailed description of your change. The
PR title and body will become the squashed commit message.

If you would like to tag individuals, add some commentary, upload images, or
include other supplemental information that should not be part of the eventual
commit message, please use a separate comment.

If applicable, please include a summary of the testing strategy (including
rationale) for the proposed change. Unit and/or integration tests are expected
for any behavior change and system tests should be considered for larger
changes.

…tream

This commit implements the content checksum verification feature for LZ4
compression as indicated by existing TODOs in the codebase.

Changes:
- Added CONTENT_CHECKSUM_SIZE constant (4 bytes) for content checksum size
- Added CONTENT_CHECKSUM_MISMATCH error message constant
- Added contentChecksum field for tracking running checksum
- Added checksumBuffer for direct buffer handling
- Implemented content checksum verification in readBlock() method
- Content checksum is computed as XOR of all block checksums

The content checksum provides end-to-end data integrity verification
for LZ4 compressed frames, following the LZ4 v1.5.1 frame format
specification.

Signed-off-by: $(git config user.name) <$(git config user.email)>
@github-actions github-actions bot added triage PRs from the community clients small Small PRs labels Feb 10, 2026
@mimaison
Copy link
Member

Can we also add tests? Is there an impact on performance?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

clients small Small PRs triage PRs from the community

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants