Elasticsearch master election during upgrade #344

cehoffman · 2018-05-06T00:08:40Z

Is this a BUG REPORT or FEATURE REQUEST?:

/kind feature

What happened:
Upgrade of elasticsearch cluster resulted in multiple master elections.

What you expected to happen:
Only one master election is done at end of upgrade

How to reproduce it (as minimally and precisely as possible):

Create a cluster at verison X with multiple master
Cause last master in statefulset to become leader
Update cluster to verison Y

Anything else we need to know?:
The controller-manager should delete all master pods except the current leader when doing an upgrade. The current leader should be the last pod deleted and updated.

Environment:

Kubernetes version (use kubectl version):
Cloud provider or hardware configuration**:
Install tools:
Others:

The text was updated successfully, but these errors were encountered:

munnerz · 2018-05-08T12:32:56Z

This is currently fairly difficult for us to do, as we rely upon StatefulSet for the upgrade functionality under the hood, and use the RollingUpdate strategy.

If we switch to OnDelete we will then lose the 'partition' functionality which we currently rely upon to ensure updates to nodes in a cluster aren't triggered early if their pods are deleted. If we switch, when a k8s node fails in the cluster during an upgrade, any pods running on that node will be immediately upgraded next time they start (potentially breaking delicate upgrade procedures).

Therefore, the only way we can do this is to implement our own alternative to StatefulSet, which chose which replica to update based on some database specific predicate.

There has already been discussion over on the Elastic GitHub and forums about triggering manual re-elections in order to make this process more graceful as a stop-gap: elastic/elasticsearch#17493.

Their line seems to be "it shouldn't take that long to re-elect" - but as you say, it'd be nice if we can minimise interruptions. It might be possible to achieve this with a custom discovery plugin, but right now we use the in-built SRV record discovery mechanism, so this would be a new component entirely.

jetstack-bot added the kind/feature label May 6, 2018

munnerz added the feature/upgrade label May 8, 2018

wallrj modified the milestone: v0.2 May 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Elasticsearch master election during upgrade #344

Elasticsearch master election during upgrade #344

cehoffman commented May 6, 2018

munnerz commented May 8, 2018

Elasticsearch master election during upgrade #344

Elasticsearch master election during upgrade #344

Comments

cehoffman commented May 6, 2018

munnerz commented May 8, 2018