Skip to content

MCM should always delete Node if corresponding Machine object is not present during termination flow #1044

@elankath

Description

@elankath

How to categorize this issue?

/area robustness
/kind bug
/priority 2

What happened:

There are cases where if a Machine is force-deleted, the Node cleanup does not occur. This happens if the Machine object is explicitly deleted by support. And the node lifecycle controller fails to clear the node.

How to reproduce it (as minimally and precisely as possible):

Difficult to reproduce. However, the Node object should be deleted ASAP after the corresponding Machine object is removed instead of needlessly waiting for node lifecycle controller.

Anything else we need to know?:

We should not skip node deletion in MCM if corresponding Machine object is missing. This was long known gap since we were depending upon clean-up by standard k8s controller.

This problem occurred in Azure.

Metadata

Metadata

Assignees

Labels

area/robustnessRobustness, reliability, resilience relatedexp/beginnerIssue that requires only basic skillskind/bugBugpriority/2Priority (lower number equals higher priority)

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions