Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Attach version and member information to Raft configuration changes #2546

Merged
merged 3 commits into from
Jan 27, 2025

Conversation

tillrohrmann
Copy link
Contributor

This commit adds version and member information to Raft configuration changes
so that we can check for members that have lost their disks and improve
observability.

This PR is based on #2544.

@tillrohrmann
Copy link
Contributor Author

The failing KafkaIngress e2e test seems to be unstable (#2548).

Copy link
Contributor

@muhamadazmy muhamadazmy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @tillrohrmann for creating this PR. The changes looks good to me. I think you will probably need to rebase with main after I merge my changes. I have small changes in the protobuf file.

MetadataStoreConfiguration configuration = 1;
// unfortunately map keys only support strings or integer types
// but not bytes.
map<string, VersionedValue> kv_entries = 2;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I changed this to entries you will probably need to rebase after I merge

fn decode_from_bytes(mut bytes: Bytes) -> Result<Self, StorageDecodeError> {
StorageCodec::decode::<WriteRequest, _>(&mut bytes)
fn decode_from_bytes(bytes: Bytes) -> Result<Self, StorageDecodeError> {
let result = grpc::WriteRequest::decode(bytes).unwrap();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I totally overlooked the unwrap() call. i will create a follow up PR to fix it.

@tillrohrmann tillrohrmann force-pushed the configuration-version branch from 5ea4d47 to 65efad1 Compare January 27, 2025 10:58
@tillrohrmann
Copy link
Contributor Author

Thanks for the review @muhamadazmy. I've rebased onto the latest master. Merging once GHA gives green light.

This commit adds version and member information to Raft configuration changes
so that we can check for members that have lost their disks and improve
observability.
Setting nodes that are not part of the configuration to Standby is problematic
because it might set some nodes that are trying to join the cluster to Standby.
If this happens, then the nodes will stop trying to join the cluster. Hence,
we are only setting those nodes to Member which are part of the current configuration.
Note, currently this does not change the NodesConfiguration since we use the Member
state to tell nodes to join a cluster in the first place.
@tillrohrmann tillrohrmann force-pushed the configuration-version branch from 65efad1 to ace0c11 Compare January 27, 2025 10:59
@tillrohrmann tillrohrmann merged commit ace0c11 into restatedev:main Jan 27, 2025
12 checks passed
@tillrohrmann tillrohrmann deleted the configuration-version branch January 27, 2025 13:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants