-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[AIR] <Part 4> Lightning Trainer Release tests + docstring sample test #33323
[AIR] <Part 4> Lightning Trainer Release tests + docstring sample test #33323
Conversation
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
93b0306
to
f53031f
Compare
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
…ease_test Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
20a3a2d
to
16a33d4
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! Can we also remove the existing ray lightning release test if it is being subsumed by the new release test?
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good! Have a few suggestions.
A few other things:
- We can combine
test_tuner
andtest_trainer
. Just switch between the two with a command line arg. - For
test_tuner
, can we also use a scheduler like PBT? I know we don't want to add too much to one test, but this way we'd test out the PTL checkpointing/restoration logic in the release test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks solid. can you respond to all of Justin's comments?
also do you know, or maybe you can ask Kai quickly, if we can release test a script in our doc/examples folder?
that way, we don't have to duplicate code.
@gjoliver I think it has to be in |
Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com>
Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com>
Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com>
Signed-off-by: Yunxuan Xiao <xiaoyunxuan1998@gmail.com>
Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com>
ray-project#33323) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: elliottower <elliot@elliottower.com>
ray-project#33323) Signed-off-by: woshiyyya <xiaoyunxuan1998@gmail.com> Signed-off-by: Jack He <jackhe2345@gmail.com>
Why are these changes needed?
This PR aims to add release tests and docstring tests for
LightningTrainer
.The release tests:
Release Test Job succeed:
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.