-
Notifications
You must be signed in to change notification settings - Fork 7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training scheme of the pretrained imagenet models? #1986
Comments
You can find the training scripts for all https://github.com/pytorch/vision/blob/master/references/classification/train.py |
As @pmeier mentioned, we also provide the default hyperparameters for the pre-trained models in torchvision under the
Let us know if you have further questions |
Hello, I have a similar issue. I am using pretrained AlexNet and VGG models from torchvision for a scientific paper and, in order to interpret my results, I would like to know how the models were trained. I have checked here as suggested, but I am unable to find any reference to VGG19, its shallower variants, and AlexNet. Are they published anywhere else? Thank you |
AlexNet and VGG have been trained a long time ago by @colesbury , I think they might follow the same procedure as ResNet (and thus default parameters), but I'm not 100% sure. Original PR adding those is inn #23 |
Models with batch normalization were trained with the default parameters. Models without batch normalization were trained with an initial learning rate of 0.01 (i.e. 1/10th the default learning rate). See https://github.com/pytorch/examples/tree/master/imagenet#training |
Should we add this in the classification reference README? If yes, I could send a PR tomorrow. |
@pmeier yes please, if you could send a PR improving the README it would be great |
Hi,
Are the pretrained models reported by torchvision using the same hyper-parameters as https://github.com/pytorch/examples/blob/master/imagenet/main.py? I used the default hyper-parameters to train mobilenet_v2, but the results were much worse than reported.
Thanks
The text was updated successfully, but these errors were encountered: