Load weights on segmentation/train.py when using --resume and --test-only flags #3285

datumbox · 2021-01-24T14:43:42Z

A good way to verify the accuracy of a specific epoch checkpoint on the evaluation dataset is to execute:

python -m torch.distributed.launch --nproc_per_node=2 --use_env train.py --dataset coco --model model_name_here\ 
  --test-only --resume /path/to/checkpoint.pth

This is supported on both the classification and detection reference scripts but not on segmentation. More specifically on segmentation, the specified weights are not loaded and thus we report the accuracy of randomly initialized weights. This PR fixes this.

datumbox

I left a few notes on the implementation:

datumbox · 2021-01-24T14:51:52Z

references/segmentation/train.py

-        optimizer.load_state_dict(checkpoint['optimizer'])
-        lr_scheduler.load_state_dict(checkpoint['lr_scheduler'])
-        args.start_epoch = checkpoint['epoch'] + 1
+        model_without_ddp.load_state_dict(checkpoint['model'], strict=not args.test_only)


strict=False required to avoid having to configure auxiliary classifiers

datumbox · 2021-01-24T14:53:11Z

references/segmentation/train.py

-        args.start_epoch = checkpoint['epoch'] + 1
+        model_without_ddp.load_state_dict(checkpoint['model'], strict=not args.test_only)
+        if not args.test_only:
+            optimizer.load_state_dict(checkpoint['optimizer'])


We avoid loading the weights of other objects if in test-only mode. Similar to the above, this is done to avoid having to handle auxiliary classifiers.

fmassa

Looks great, thanks for improving this Vasilis!

Summary: Co-authored-by: Francisco Massa <fvsmassa@gmail.com> Reviewed By: datumbox Differential Revision: D26156373 fbshipit-source-id: 83f22c90477ca2da8db176d2455a70ca302d17d1

Load variables when --resume /path/to/checkpoint --test-only

3b7c1be

datumbox requested a review from fmassa January 24, 2021 14:43

facebook-github-bot added the cla signed label Jan 24, 2021

datumbox commented Jan 24, 2021

View reviewed changes

fmassa approved these changes Jan 25, 2021

View reviewed changes

Merge branch 'master' into bugfix/segmentation_testonly

b02857c

fmassa merged commit 1ebda73 into pytorch:master Jan 25, 2021

datumbox deleted the bugfix/segmentation_testonly branch January 25, 2021 09:46

datumbox added the bug label Jun 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load weights on segmentation/train.py when using --resume and --test-only flags #3285

Load weights on segmentation/train.py when using --resume and --test-only flags #3285

datumbox commented Jan 24, 2021

datumbox left a comment

datumbox Jan 24, 2021

datumbox Jan 24, 2021

fmassa left a comment

Load weights on segmentation/train.py when using --resume and --test-only flags #3285

Load weights on segmentation/train.py when using --resume and --test-only flags #3285

Conversation

datumbox commented Jan 24, 2021

datumbox left a comment

Choose a reason for hiding this comment

datumbox Jan 24, 2021

Choose a reason for hiding this comment

datumbox Jan 24, 2021

Choose a reason for hiding this comment

fmassa left a comment

Choose a reason for hiding this comment