Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

c/controller_backend: try to force-abort reconfiguration only on leaders #18021

Merged
merged 1 commit into from
Apr 24, 2024

Conversation

ztlpn
Copy link
Contributor

@ztlpn ztlpn commented Apr 23, 2024

Previously, when force-aborting a reconfiguration, we appended an aborting configuration on all replicas. This can lead to log inconsistencies as on followers the configuration will be duplicated (one from own append, one replicated by the leader). Although these inconsistencies are expected for force-abort, if the leader is alive we can minimize the chance of their appearance by waiting on followers for the aborting config to be replicated from the leader.

Fixes #17847

Backports Required

  • none - not a bug fix
  • none - this is a backport
  • none - issue does not exist in previous branches
  • none - papercut/not impactful enough to backport
  • v24.1.x
  • v23.3.x
  • v23.2.x

Release Notes

  • none

Previously, when force-aborting a reconfiguration, we appended an
aborting configuration on all replicas. This can lead to log inconsistencies
as on followers the configuration will be duplicated (one from own append,
one replicated by the leader). Although these inconsistencies are
expected for force-abort, if the leader is alive, we can minimize the chance
of their appearance by waiting on followers for the aborting config to be
replicated from the leader.

Fixes redpanda-data#17847
@vbotbuildovich
Copy link
Collaborator

vbotbuildovich commented Apr 23, 2024

new failures in https://buildkite.com/redpanda/redpanda/builds/48150#018f0ae3-7a14-4a08-9822-9bd8a4a270ab:

"rptest.tests.rbac_upgrade_test.UpgradeMigrationCreatingDefaultRole.test_rbac_migration"

new failures in https://buildkite.com/redpanda/redpanda/builds/48150#018f0af4-2a19-4474-a963-4b26d1778e5f:

"rptest.tests.rbac_upgrade_test.UpgradeMigrationCreatingDefaultRole.test_rbac_migration"

@ztlpn
Copy link
Contributor Author

ztlpn commented Apr 23, 2024

test failures due to 24.1 branching

@piyushredpanda piyushredpanda merged commit 256a0a5 into redpanda-data:dev Apr 24, 2024
14 of 18 checks passed
@vbotbuildovich
Copy link
Collaborator

/backport v24.1.x

@vbotbuildovich
Copy link
Collaborator

/backport v23.3.x

@vbotbuildovich
Copy link
Collaborator

/backport v23.2.x

@vbotbuildovich
Copy link
Collaborator

Failed to create a backport PR to v23.2.x branch. I tried:

git remote add upstream https://github.com/redpanda-data/redpanda.git
git fetch --all
git checkout -b backport-pr-18021-v23.2.x-433 remotes/upstream/v23.2.x
git cherry-pick -x 8e221d36888652b0d79b81925473e4bb80da2351

Workflow run logs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CI Failure (Consumed from an unexpected offset) in PartitionMoveInterruption.test_cancelling_partition_move
4 participants