Skip to content

How I can eval model after each epoch end? #48

Discussion options

You must be logged in to vote

Hi @SherlockHolmes221,

The evaluation code is implemented for a single GPU, without distributed data parallel. So, it wasn't included during training. However, all checkpoints after each epoch will be cached by default in the directory you specified. Each checkpoint is named as ckpt_steps_epochs. You can run evaluation on the checkpoints you are interested in.

Fred.

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by fredzzhang
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #47 on July 11, 2022 12:00.