You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
How do I specify the checkpoint to use in lumi eval?
Background: I did the training on a different machine. Exported the checkpoint from there and imported the checkpoint in a different machine. Now, I want to run lumi eval for the imported checkpoint on the machine where the checkpoint was imported. I can extract the contents of exported tar file in the jobs folder and then run the eval. Is there a better way to just specify the checkpoint in the command itself?
The text was updated successfully, but these errors were encountered:
There doesn't seem to be an option, see what lumi eval --help tells you. I would also like to see a better direct interaction with the TensorFlow checkpoints here. Even better: An option to periodically run the evaluation during training (hot, without stopping training first)
How do I specify the checkpoint to use in
lumi eval
?Background: I did the training on a different machine. Exported the checkpoint from there and imported the checkpoint in a different machine. Now, I want to run
lumi eval
for the imported checkpoint on the machine where the checkpoint was imported. I can extract the contents of exported tar file in the jobs folder and then run the eval. Is there a better way to just specify the checkpoint in the command itself?The text was updated successfully, but these errors were encountered: