Requeue if pod not available in AutoUpgrade #387

rollandf · 2022-07-12T08:45:52Z

In the auto upgrade flow, the upgrade state of the nodes is built
based on the Ofed pods.

However, there can be a stage where pods are not available as
the Upgrade flow is deleting them and new ones were not created
yet.

In that case, the corresponding node will not be considered in the
upgrade logic at this specific time when the pod is not re-created.

This can affect the "MaxParallelUpgrades" feature.

With this commit, in case a Ofed DaemonSet has one or more unavailable
pods, the upgrade logic will not run and the notification will be requeue
to be handle later.

Signed-off-by: Fred Rolland frolland@nvidia.com

rollandf · 2022-07-12T08:49:10Z

From DaemonSet documentation:

// The number of nodes that should be running the
// daemon pod and have none of the daemon pod running and available
// (ready for at least spec.minReadySeconds)
// +optional
NumberUnavailable int32 `json:"numberUnavailable,omitempty" protobuf:"varint,8,opt,name=numberUnavailable"`

e0ne

LGTM

adrianchiris · 2022-07-13T11:18:39Z

controllers/upgrade_controller.go

@@ -152,6 +153,13 @@ func (r *UpgradeReconciler) BuildState(ctx context.Context) (*upgrade.ClusterUpg

 	r.Log.V(consts.LogLevelDebug).Info("Got driver daemon sets", "length", len(daemonSets))

+	for _, ds := range daemonSets {


as discussed offline, the DS status may not yet be updated by k8s daemonset controller. so we may be called after pod delete but before ds got a chance to update its status, leading to the same issue.

i think we should rely on ds status DesiredNumberScheduled (which just maps to the number of nodes that should run a ds pod) as this is not expected to change (scale up / down) in the middle of an active update flow.

then we can list the pods and count the ones associated with daemonset (have owner reference) and dont have deletion timestap set. which means that for sure the ds has claimed the pod

if the two are the same it means that pods that were deleted got rescheduled by the daemon.

Daemonset controller has some logic to get the status fields updated so i suspect we will not always see the actual state.

https://github.com/kubernetes/kubernetes/blob/a1c8e9386af844757333733714fa1757489735b3/pkg/controller/daemon/daemon_controller.go#L1118

adrianchiris

please see comment, i think we should do some changes

rollandf · 2022-07-14T06:10:38Z

/retest-all

rollandf · 2022-07-14T07:00:31Z

/retest-all

adrianchiris

small comment, otherwise LGTM

controllers/upgrade_controller.go

In the auto upgrade flow, the upgrade state of the nodes is built based on the Ofed pods. However, there can be a stage where pods are not available as the Upgrade flow is deleting them and new ones were not created yet. In that case, the corresponding node will not be considered in the upgrade logic at this specific time when the pod is not re-created. This can affect the "MaxParallelUpgrades" feature. With this commit, in case a Ofed DaemonSet has one or more unavailable pods, the upgrade logic will not run and the notification will be requeue to be handle later. Signed-off-by: Fred Rolland <frolland@nvidia.com>

rollandf force-pushed the unavailable branch 2 times, most recently from 181616e to 230227f Compare July 12, 2022 09:16

e0ne approved these changes Jul 13, 2022

View reviewed changes

adrianchiris reviewed Jul 13, 2022

View reviewed changes

adrianchiris requested changes Jul 13, 2022

View reviewed changes

rollandf force-pushed the unavailable branch from 230227f to 5f324d3 Compare July 13, 2022 13:36

rollandf force-pushed the unavailable branch from 5f324d3 to bad8ad7 Compare July 14, 2022 07:23

adrianchiris requested changes Jul 14, 2022

View reviewed changes

controllers/upgrade_controller.go Show resolved Hide resolved

rollandf force-pushed the unavailable branch from bad8ad7 to d4f2a8a Compare July 15, 2022 07:54

adrianchiris approved these changes Jul 18, 2022

View reviewed changes

adrianchiris merged commit c841259 into Mellanox:master Jul 18, 2022

e0ne mentioned this pull request Jul 18, 2022

Release v1.3.0-beta.3 #389

Closed

25 tasks

rollandf deleted the unavailable branch February 28, 2024 08:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Requeue if pod not available in AutoUpgrade #387

Requeue if pod not available in AutoUpgrade #387

rollandf commented Jul 12, 2022 •

edited

Loading

rollandf commented Jul 12, 2022

e0ne left a comment

adrianchiris Jul 13, 2022 •

edited

Loading

adrianchiris left a comment

rollandf commented Jul 14, 2022

rollandf commented Jul 14, 2022

adrianchiris left a comment

		@@ -152,6 +153,13 @@ func (r UpgradeReconciler) BuildState(ctx context.Context) (upgrade.ClusterUpg

		r.Log.V(consts.LogLevelDebug).Info("Got driver daemon sets", "length", len(daemonSets))

		for _, ds := range daemonSets {

Requeue if pod not available in AutoUpgrade #387

Requeue if pod not available in AutoUpgrade #387

Conversation

rollandf commented Jul 12, 2022 • edited Loading

rollandf commented Jul 12, 2022

e0ne left a comment

Choose a reason for hiding this comment

adrianchiris Jul 13, 2022 • edited Loading

Choose a reason for hiding this comment

adrianchiris left a comment

Choose a reason for hiding this comment

rollandf commented Jul 14, 2022

rollandf commented Jul 14, 2022

adrianchiris left a comment

Choose a reason for hiding this comment

rollandf commented Jul 12, 2022 •

edited

Loading

adrianchiris Jul 13, 2022 •

edited

Loading