Skip to content

Conversation

PedroMDuarte
Copy link

@PedroMDuarte PedroMDuarte commented Sep 3, 2025

This PR addresses the issue described in this spark dev mailing list thread: https://lists.apache.org/thread/27bg3gsd4czpd4o0ybvc2pxv6qhmpbfb

What changes were proposed in this pull request?

Include timestamps of all retained .zip files in the timestamp threshold used to detect orphaned files. This helps protect against state store corruption when multiple file managers are simultaneously running maintenance for the same partition.

Why are the changes needed?

To avoid state store corruption.

Does this PR introduce any user-facing change?

No

How was this patch tested?

A new test was added that goes over the problem. The test is added in the first commit in this PR.

Was this patch authored or co-authored using generative AI tooling?

No

@PedroMDuarte PedroMDuarte changed the base branch from master to branch-3.5 September 3, 2025 02:05
@PedroMDuarte PedroMDuarte changed the title [WIP][SPARK-XXXX] Rocks db state store orphan removal protection [SPARK-XXXX] Rocks db state store orphan removal protection Sep 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant