Skip to content

Commit

Permalink
Train job keeps 3 checkpoints at a time
Browse files Browse the repository at this point in the history
This may be useful for recovering from NaN problems
  • Loading branch information
abisee authored Aug 16, 2017
1 parent 2466daf commit f15e22b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion run_summarization.py
Original file line number Diff line number Diff line change
Expand Up @@ -161,7 +161,7 @@ def setup_training(model, batcher):
convert_to_coverage_model()
if FLAGS.restore_best_model:
restore_best_model()
saver = tf.train.Saver(max_to_keep=1) # only keep 1 checkpoint at a time
saver = tf.train.Saver(max_to_keep=3) # keep 3 checkpoints at a time

sv = tf.train.Supervisor(logdir=train_dir,
is_chief=True,
Expand Down

0 comments on commit f15e22b

Please sign in to comment.