-
Notifications
You must be signed in to change notification settings - Fork 549
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GCP] GCE DWS Support #3574
[GCP] GCE DWS Support #3574
Conversation
Thanks a lot for the PR @gurcangercek! This is fantastic! |
Fixes for DWS GCE support
…uest cancellation (#5) * Fix config fields * fix cancel * Add loggings * remove useless codes
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks a lot for adding the support of DWS with GCE @gurcangercek! This is awesome. This PR should be ready to go once we have the master branch merged. : )
Hey @Michaelvll , thanks for the update and sorry for the delay. It is all yours... |
Tested:
|
hello, can this be used in conjunct with the new skypilot services? |
Hey @binarycrayon, yes, it should be possible to use DWS for your service if you specify the following section in your task: experimental:
config_overrides:
gcp:
managed_instance_group:
run_duration: 3600 For more details: https://skypilot.readthedocs.io/en/latest/reference/yaml-spec.html#experimental |
thanks so much! I'd assume service yaml also support this https://skypilot.readthedocs.io/en/latest/serving/service-yaml-spec.html |
Yes, the service YAML shares all the specs for task |
* [GCP] initial take for dws support with migs * fix lint errors * dependency and format fix * refactor mig instance creation * fix * remove unecessary instance creation code for mig * Fix deletion * Fix instance template logic * Restart * format * format * move to REST APIs instead of python APIs * add multi-node back * Fix multi-node * Avoid spot * format * format * fix scheduling * fix cancel * Add smoke test * revert some changes * fix smoke * Fix * fix * Fix smoke * [GCP] Changing the config name for DWS support and fix for resize request cancellation (#5) * Fix config fields * fix cancel * Add loggings * remove useless codes --------- Co-authored-by: Zhanghao Wu <zhangaho.wu@outlook.com> Co-authored-by: Zhanghao Wu <zhanghao.wu@outlook.com>
Since it is possible to specify spot in instance template, can we enable use of spot instance with managed instance group? |
To handoff the related DWS support work
Tested (run the relevant ones):
bash format.sh
pytest tests/test_smoke.py
pytest tests/test_smoke.py::test_fill_in_the_name
conda deactivate; bash -i tests/backward_compatibility_tests.sh