Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Housekeeper for distributed execution of tasks #21

Open
lucasoares opened this issue Jun 26, 2023 · 0 comments
Open

Improve Housekeeper for distributed execution of tasks #21

lucasoares opened this issue Jun 26, 2023 · 0 comments
Labels
enhancement New feature or request good first issue Good for newcomers

Comments

@lucasoares
Copy link
Member

We have successfully operated all our Deckard instances with a single housekeeper pod for years. To enhance scalability of the housekeeper tasks, I propose the following improvements for the housekeeper feature:

  1. Implement a distributed locking mechanism for each task to support running multiple housekeeper pods simultaneously. While most tasks can run concurrently due to their atomic nature, running the same task in parallel can lead to resource waste.

  2. Address potential issues, such as Prometheus metrics duplication. Currently, we expose numerous queue metrics in the /metrics endpoint of a Deckard instance with the housekeeper enabled. Since the housekeeper is responsible for measuring many of these metrics, duplication can occur if we deploy many housekeper pods with the /metrics enabled. We can consider deploying an individual metrics pod or explore alternative solutions to mitigate this issue.

By incorporating these enhancements, we aim to achieve better scalability, improved fault tolerance, and overall performance in our distributed Deckard setup.

@lucasoares lucasoares added enhancement New feature or request good first issue Good for newcomers labels Jun 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

1 participant