Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[core][autoscaler] Autoscaler V2 - cluster state version number #35873

Open
Tracked by #2600
rickyyx opened this issue May 30, 2023 · 0 comments
Open
Tracked by #2600

[core][autoscaler] Autoscaler V2 - cluster state version number #35873

rickyyx opened this issue May 30, 2023 · 0 comments
Assignees
Labels
core Issues that should be addressed in Ray Core core-autoscaler autoscaler related issues enhancement Request for new feature and/or capability P1 Issue that should be fixed within a few weeks
Milestone

Comments

@rickyyx
Copy link
Contributor

rickyyx commented May 30, 2023

What happened + What you expected to happen

  1. We will need to handle cases where GCS fails over in HA to maintain the correctness of the cluster state version. we need store an epoch number in kvstore and bump it whenver the gcs failsover. we also need to use the epoch number as the high 8 bit.
  2. We might want to make the version number be updated when state is updated.

See comments on #35596

Versions / Dependencies

master

Reproduction script

NA

Issue Severity

None

@rickyyx rickyyx added the core Issues that should be addressed in Ray Core label May 30, 2023
@rickyyx rickyyx added this to the Autoscaler V2 milestone May 30, 2023
@rickyyx rickyyx self-assigned this May 30, 2023
@scv119 scv119 added the core-autoscaler autoscaler related issues label Jun 13, 2023
@anyscalesam anyscalesam added the triage Needs triage (eg: priority, bug/not-bug, and owning component) label Feb 14, 2024
@anyscalesam anyscalesam added the enhancement Request for new feature and/or capability label Mar 4, 2024
@jjyao jjyao added P1 Issue that should be fixed within a few weeks and removed triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Mar 11, 2024
@kevin85421 kevin85421 assigned kevin85421 and unassigned rickyyx Dec 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Issues that should be addressed in Ray Core core-autoscaler autoscaler related issues enhancement Request for new feature and/or capability P1 Issue that should be fixed within a few weeks
Projects
None yet
Development

No branches or pull requests

5 participants