Skip to content
This repository has been archived by the owner on Dec 13, 2023. It is now read-only.

zookeeper lock znode not getting cleared after workflows are finalized #246

Open
mithlesh135 opened this issue Jul 19, 2023 · 1 comment

Comments

@mithlesh135
Copy link

mithlesh135 commented Jul 19, 2023

Describe the bug
As zookeeper takes lock for any operation, it creates znodes in below format
/conductor/conductor-lock/workflow-uuid/sequential-epephermal-uuid

Now these epephermal nodes get deleted but the the znode at workflow uuid is persistence once and is not getting deleted, and and we have more and more workflows getting processed, number of these persistence znodes increase inside the parent conductor-lock znode. Zookeeper suggests max size of znode to be 1 MB, this ever growing znode is a problem.

To Reproduce
Steps to reproduce the behavior:

  1. Go to conductor
  2. connect to zookeeper server
  3. run a workflow and check znodes getting created for workflow
  4. even after worklow execution has completed, workflow znode stays

Expected behavior
workflow znode should be deleted after workflow is finalized

@v1r3n
Copy link
Contributor

v1r3n commented Jul 19, 2023

Hi @mithlesh135 I would recommend using Redis for locks. Zookeeper support exists for the legacy reasons and is quite a heavy solution to maintain. Redis based locks are going to be much more lightweight.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants