PS-9391 Fixes replication break error because of HASH_SCAN #5528

VarunNagaraju · 2024-12-26T10:22:50Z

https://perconadev.atlassian.net/browse/PS-9391

Problem

When a replica's slave_rows_search_algorithms is set to HASH_SCAN, the
replication may break with HA_ERR_KEY_NOT_FOUND.

Analysis

When a replica's slave_rows_search_algorithms is set to HASH_SCAN,
it prepares a unique key list for all the rows in a particular Row_event.
The same unique key list will later be used to retrieve all tuples associated
to each key in the list from storage engine. In the case of multiple updates
targeted at the same row like how it is shown in the testcase, it may happen
that this unique key list filled with entries which don't exist yet in the table.
This is a problem when there is an intermediate update which changes the value
of the index column to a lesser value than the original entry and that changed
value is used in another update as shown in the second part of the testcase.
It is an issue because the unique key list is a std::set which internally
sorts it's entries. When this sorting happens, the first entry of the list
could potentially be a value which doesn't exist in the table and the when
it is searched in next_record_scan() method, it fails returning
HA_ERR_KEY_NOT_FOUND error.

Solution

Instead of using std::set to store the distinct keys, a combination of
unordered_set and a list is used to preserve the original order of
updates and avoid duplicates at the same time which prevents the side
effects of sorting.

github-actions

⚠️ Clang-Tidy found issue(s) with the introduced code (1/1)

sql/log_event.cc

VarunNagaraju · 2024-12-27T04:15:59Z

https://ps80.cd.percona.com/view/8.0/job/percona-server-8.0-pipeline-parallel-mtr/847/

percona-ysorokin · 2025-01-06T12:33:48Z

sql/log_event.h

@@ -2898,14 +2898,34 @@ class Rows_log_event : public virtual binary_log::Rows_event, public Log_event {
    Key_compare(KEY **ki = nullptr) : m_key_info(ki) {}
    bool operator()(uchar *k1, uchar *k2) const {
      return key_cmp2((*m_key_info)->key_part, k1, (*m_key_info)->key_length,
-                      k2, (*m_key_info)->key_length) < 0;
+                      k2, (*m_key_info)->key_length) == 0;


Rename this helper class to Key_equal.
Compare may be misleading as it may mean comparator and in your scenario this is no longer the case. All you do now is checking for equality.

percona-ysorokin · 2025-01-06T12:37:52Z

sql/log_event.h

+    class Key_hash {
+    public:
+      Key_hash(KEY **ki = nullptr) : m_key_info(ki) {}
+      size_t operator()(uchar* ptr) const {


May be just create a std::string_view from ptr and (*m_key_info)->key_length and calculate a hash of this string_view using standard hash function.

std::string_view sv{ptr, (*m_key_info)->key_length}; return std::hash(sv);

Thanks for the suggestion. Done.

percona-ysorokin · 2025-01-06T12:41:24Z

sql/log_event.cc

@@ -9142,9 +9141,10 @@ int Rows_log_event::add_key_to_distinct_keyset() {
  DBUG_TRACE;
  assert(m_key_index < MAX_KEY);
  key_copy(m_distinct_key_spare_buf, m_table->record[0], m_key_info, 0);
-  std::pair<std::set<uchar *, Key_compare>::iterator, bool> ret =
+  std::pair<std::unordered_set<uchar *, Key_hash, Key_compare>::iterator, bool> ret =


May be just auto?

percona-ysorokin · 2025-01-06T12:44:19Z

sql/log_event.cc

@@ -9088,14 +9090,11 @@ int Rows_log_event::open_record_scan() {

  if (m_key_index < MAX_KEY) {
    if (m_rows_lookup_algorithm == ROW_LOOKUP_HASH_SCAN) {
-      /* initialize the iterator over the list of distinct keys that we have */
-      m_itr = m_distinct_keys.begin();


Shouldn't we change this to m_distinct_key_id = 0?

m_distinct_key_idx is already set to 0 when it is declared.

percona-ysorokin · 2025-01-06T12:45:33Z

sql/log_event.cc

@@ -9068,8 +9068,10 @@ int Rows_log_event::next_record_scan(bool first_read) {
      if ((error = table->file->ha_index_read_map(
               table->record[0], m_key, HA_WHOLE_KEY, HA_READ_KEY_EXACT))) {
        DBUG_PRINT("info", ("no record matching the key found in the table"));
-        if (!is_trx_retryable_upon_engine_error(error))
+        if (!is_trx_retryable_upon_engine_error(error)) {
+          sleep(10);


Is this intentional or just for debugging?

It was just for debugging. Removed it.

https://perconadev.atlassian.net/browse/PS-9391 Problem ======= When a replica's slave_rows_search_algorithms is set to HASH_SCAN, the replication may break with HA_ERR_KEY_NOT_FOUND. Analysis ======== When a replica's slave_rows_search_algorithms is set to HASH_SCAN, it prepares a unique key list for all the rows in a particular Row_event. The same unique key list will later be used to retrieve all tuples associated to each key in the list from storage engine. In the case of multiple updates targeted at the same row like how it is shown in the testcase, it may happen that this unique key list filled with entries which don't exist yet in the table. This is a problem when there is an intermediate update which changes the value of the index column to a lesser value than the original entry and that changed value is used in another update as shown in the second part of the testcase. It is an issue because the unique key list is a std::set which internally sorts it's entries. When this sorting happens, the first entry of the list could potentially be a value which doesn't exist in the table and the when it is searched in next_record_scan() method, it fails returning HA_ERR_KEY_NOT_FOUND error. Solution ======== Instead of using std::set to store the distinct keys, a combination of unordered_set and a list is used to preserve the original order of updates and avoid duplicates at the same time which prevents the side effects of sorting.

VarunNagaraju · 2025-01-08T06:51:35Z

https://ps80.cd.percona.com/view/8.0/job/percona-server-8.0-pipeline-parallel-mtr/849/

github-actions bot reviewed Dec 26, 2024

View reviewed changes

sql/log_event.cc Outdated Show resolved Hide resolved

VarunNagaraju force-pushed the PS-9391 branch from c95cb14 to 783aab3 Compare January 1, 2025 16:27

percona-ysorokin requested changes Jan 6, 2025

View reviewed changes

VarunNagaraju force-pushed the PS-9391 branch from 783aab3 to 92d965a Compare January 7, 2025 07:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PS-9391 Fixes replication break error because of HASH_SCAN #5528

PS-9391 Fixes replication break error because of HASH_SCAN #5528

VarunNagaraju commented Dec 26, 2024

github-actions bot left a comment

VarunNagaraju commented Dec 27, 2024

percona-ysorokin Jan 6, 2025

VarunNagaraju Jan 7, 2025

percona-ysorokin Jan 6, 2025

VarunNagaraju Jan 7, 2025

percona-ysorokin Jan 6, 2025

VarunNagaraju Jan 7, 2025

percona-ysorokin Jan 6, 2025

VarunNagaraju Jan 7, 2025

percona-ysorokin Jan 6, 2025

VarunNagaraju Jan 7, 2025

VarunNagaraju commented Jan 8, 2025

PS-9391 Fixes replication break error because of HASH_SCAN #5528

Are you sure you want to change the base?

PS-9391 Fixes replication break error because of HASH_SCAN #5528

Conversation

VarunNagaraju commented Dec 26, 2024

Problem

Analysis

Solution

github-actions bot left a comment

Choose a reason for hiding this comment

VarunNagaraju commented Dec 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

VarunNagaraju commented Jan 8, 2025