-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[fix](cloud-mow) MS should create new Transaction to continue geting delete bitmap when encounter TXN_TOO_OLD #43509
base: master
Are you sure you want to change the base?
Conversation
…delete bitmap when encounter TXN_TOO_OLD
Thank you for your contribution to Apache Doris. Please clearly describe your PR:
|
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
do { | ||
err = txn->get(start_key, end_key, &it); | ||
#ifdef UNIT_TEST |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
use syncpoint instead of MACRO, or change config::max_get_delete_bitmap_retry_times before testing
int n = 10000;
syncpoint(&n);
err = txn->get(start_key, end_key, &it, true, n);
@@ -211,6 +211,9 @@ CONF_Validator(s3_client_http_scheme, [](const std::string& config) -> bool { | |||
// Max retry times for object storage request | |||
CONF_mInt64(max_s3_client_retry, "10"); | |||
|
|||
// Max retry times for get delete bitmap | |||
CONF_mInt64(max_get_delete_bitmap_retry_times, "120"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
suggest naming max_get_delete_bitmap_iterations
@@ -211,6 +211,9 @@ CONF_Validator(s3_client_http_scheme, [](const std::string& config) -> bool { | |||
// Max retry times for object storage request | |||
CONF_mInt64(max_s3_client_retry, "10"); | |||
|
|||
// Max retry times for get delete bitmap | |||
CONF_mInt64(max_get_delete_bitmap_retry_times, "120"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we really need to retry so many times?
return; | ||
} | ||
#ifdef UNIT_TEST | ||
err = txn->get(start_key, end_key, &it, true, 2); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
make sure all delete bitmap keys are committed before rowset keys, so that the snapshot read won't miss data.
#else | ||
err = txn->get(start_key, end_key, &it, true); | ||
#endif | ||
retry++; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
print the delete bitmap count and size once you have retry, which means there are too many delete bitmaps ,we may need log to debug.
When delete bitmap count is big, geting delete bitmap may encounter TXN_TOO_OLD, ms should create a new transaction to reading the remaining data instead of returning TXN_TOO_OLD code.
Check List (For Author)
Test
Behavior changed:
Does this need documentation?
Check List (For Reviewer who merge this PR)