Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Avoid logging full rows with sensitive information on conversion failure in Flink CDC #4290

Closed
2 tasks done
atallahade opened this issue Oct 8, 2024 · 0 comments · Fixed by #4918
Closed
2 tasks done
Labels
enhancement New feature or request

Comments

@atallahade
Copy link
Contributor

atallahade commented Oct 8, 2024

Search before asking

  • I searched in the issues and found nothing similar.

Motivation

Using a Flink CDC connector, when a row fails to convert according to the given schema, Paimon logs the entire row, which may contain sensitive information.
You can see this in CdcRecordUtils.java. For example:

2024-10-08 14:14:48,673 [] INFO  org.apache.paimon.flink.sink.cdc.CdcRecordUtils              [] - Failed to convert value <REDACTED_ROW> to type <REDACTED_SCHEMA>. Waiting for schema update.

Solution

Logging only the primary key could be a solution.

Anything else?

No response

Are you willing to submit a PR?

  • I'm willing to submit a PR!
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant