You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Sep 30, 2024. It is now read-only.
I got two failure detections under two comaster clusters. there were UnreachableIntermediateMasterWithLaggingReplicas and DeadIntermediateMasterAndReplicas failures while the clusters were co-master.
Under such architecture, there should be UnreachableMaster or other co-master failure.
a.Description="Intermediate master cannot be reached by orchestrator and all of its replicas are lagging"
//
If LastCheckPartialSuccess is true and syncing between two co-masters works well, then these IntermediateMaster failures will be reported instead of the co-master ones.
With syncing working well, we will get DeadIntermediateMasterAndReplicas if two co-masters are unreachable, and get UnreachableIntermediateMasterWithLaggingReplicas if the primary co-master is unreachable and some replicas are lagging.
LastCheckPartialSuccess is set as true in the process of discovery SQL:
I got two failure detections under two comaster clusters. there were
UnreachableIntermediateMasterWithLaggingReplicas
andDeadIntermediateMasterAndReplicas
failures while the clusters were co-master.Under such architecture, there should be UnreachableMaster or other co-master failure.
Then I check the analysis code:
orchestrator/go/inst/analysis_dao.go
Lines 566 to 569 in 1a6c3cd
orchestrator/go/inst/analysis_dao.go
Lines 590 to 597 in 1a6c3cd
If
LastCheckPartialSuccess
is true and syncing between two co-masters works well, then these IntermediateMaster failures will be reported instead of the co-master ones.With syncing working well, we will get
DeadIntermediateMasterAndReplicas
if two co-masters are unreachable, and getUnreachableIntermediateMasterWithLaggingReplicas
if the primary co-master is unreachable and some replicas are lagging.LastCheckPartialSuccess
is set as true in the process of discovery SQL:orchestrator/go/inst/instance_dao.go
Lines 425 to 430 in 1a6c3cd
There should be a bug in analyzing co-master and intermediate-master failures. It might be the
if-else
judgement fault.The text was updated successfully, but these errors were encountered: