Training scheme of the pretrained imagenet models? #1986

tzm1003306213 · 2020-03-15T09:16:29Z

Hi,

Are the pretrained models reported by torchvision using the same hyper-parameters as https://github.com/pytorch/examples/blob/master/imagenet/main.py? I used the default hyper-parameters to train mobilenet_v2, but the results were much worse than reported.

Thanks

pmeier · 2020-03-17T09:50:22Z

You can find the training scripts for all torchvision models here. For classification this is the script you want

https://github.com/pytorch/vision/blob/master/references/classification/train.py

fmassa · 2020-03-17T10:04:39Z

As @pmeier mentioned, we also provide the default hyperparameters for the pre-trained models in torchvision under the references folder.
For mobilenet_v2 we used https://github.com/pytorch/vision/tree/master/references/classification#mobilenetv2

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py\
     --model mobilenet_v2 --epochs 300 --lr 0.045 --wd 0.00004\
     --lr-step-size 1 --lr-gamma 0.98

Let us know if you have further questions

magamba · 2020-03-19T10:36:30Z

Hello,

I have a similar issue. I am using pretrained AlexNet and VGG models from torchvision for a scientific paper and, in order to interpret my results, I would like to know how the models were trained. I have checked here as suggested, but I am unable to find any reference to VGG19, its shallower variants, and AlexNet. Are they published anywhere else?

Thank you

pmeier · 2020-03-19T10:40:13Z

I expect these models were trained with the default parameters given in train.py, but I can't be sure. @fmassa ?

fmassa · 2020-03-19T18:22:11Z

AlexNet and VGG have been trained a long time ago by @colesbury , I think they might follow the same procedure as ResNet (and thus default parameters), but I'm not 100% sure. Original PR adding those is inn #23

colesbury · 2020-03-19T18:26:46Z

Models with batch normalization were trained with the default parameters. Models without batch normalization were trained with an initial learning rate of 0.01 (i.e. 1/10th the default learning rate).

See https://github.com/pytorch/examples/tree/master/imagenet#training

pmeier · 2020-03-19T18:32:43Z

Should we add this in the classification reference README? If yes, I could send a PR tomorrow.

fmassa · 2020-03-19T18:43:04Z

@pmeier yes please, if you could send a PR improving the README it would be great

fmassa closed this as completed Mar 17, 2020

fmassa added module: models module: reference scripts question labels Mar 17, 2020

pmeier mentioned this issue Mar 20, 2020

Add default training parameters to classification refrence README #1998

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training scheme of the pretrained imagenet models? #1986

Training scheme of the pretrained imagenet models? #1986

tzm1003306213 commented Mar 15, 2020

pmeier commented Mar 17, 2020

fmassa commented Mar 17, 2020

magamba commented Mar 19, 2020

pmeier commented Mar 19, 2020

fmassa commented Mar 19, 2020

colesbury commented Mar 19, 2020

pmeier commented Mar 19, 2020

fmassa commented Mar 19, 2020

Training scheme of the pretrained imagenet models? #1986

Training scheme of the pretrained imagenet models? #1986

Comments

tzm1003306213 commented Mar 15, 2020

pmeier commented Mar 17, 2020

fmassa commented Mar 17, 2020

magamba commented Mar 19, 2020

pmeier commented Mar 19, 2020

fmassa commented Mar 19, 2020

colesbury commented Mar 19, 2020

pmeier commented Mar 19, 2020

fmassa commented Mar 19, 2020