You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[SPARK-49687][SQL] Delay sorting in validateAndMaybeEvolveStateSchema
### What changes were proposed in this pull request?
In `validateAndMaybeEvolveStateSchema`, existing schema and new schema are sorted by column family name.
The sorting can be delayed until `createSchemaFile` is called.
When computing `colFamiliesAddedOrRemoved`, we can use `toSet` to compare column families.
### Why are the changes needed?
This would make `validateAndMaybeEvolveStateSchema` faster.
### Does this PR introduce _any_ user-facing change?
No.
### How was this patch tested?
Existing tests.
### Was this patch authored or co-authored using generative AI tooling?
No.
Closesapache#48116 from tedyu/ty-comp-chk.
Authored-by: Zhihong Yu <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
Copy file name to clipboardexpand all lines: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/StateSchemaCompatibilityChecker.scala
+5-5
Original file line number
Diff line number
Diff line change
@@ -168,12 +168,12 @@ class StateSchemaCompatibilityChecker(
0 commit comments