Regression: Invalid comparison operation: Utf8 == Utf8View
error during LEFT ANTI JOIN
#13510
Labels
bug
Something isn't working
Describe the bug
Between
42.2.0
and43.0.0
there appears to have been a regression that introduced an error:Note that the error happens at the plan execution phase, i.e. plan validation passes successfully.
As Arrow / DF errors don't have any trace info I have no visibility on which operator this error originates from. I will proceed to patch Arrow to try to dump the traceback and produce a minimal reproduce steps.
To Reproduce
I'm still struggling to isolate a minimal repro case. Can only share the full optimized physical plan for now:
The error happens deep in our system when merging two dataframes, one read from Parquet (and having
Utf8View
columns) and another from CSV (havingUtf8
columns). The key operation is to de-duplicate records in CSV with those in Parquet usingLeftAnti
JOIN.Expected behavior
No error / error during planning if some operation is invalid
Additional context
No response
The text was updated successfully, but these errors were encountered: