Use torch instead of scipy for random initialization of inception and googlenet weights #4256

vmoens · 2021-08-05T11:49:23Z

weights generated with truncnorm from scipy are conditioned by np global seed (or a RandomState). We want to avoid setting np.random.seed in our tests and also we don't want to pass a RandomState object in the model args. As such, we move to generating weights with torch.nn.init.trunc_normal_ instead, which is conditioned by torch.manual_seed.

NicolasHug

Thanks for the PR @vmoens ,

I realize this is still WIP but I'm seeing lots of changed pkl files: according to https://app.circleci.com/pipelines/github/pytorch/vision/9719/workflows/9ad29d07-4475-418c-88e9-54e8f09b8c5c/jobs/721968 it seems that only one test was failing, so we probably don't need to update all of those.

Also it seems like we still have a typing error for float(m.stddev)

NicolasHug

Thanks @vmoens !

I tagged as BC breaking because the results might change slightly, but it's fine as we don't have guarantees w.r.t. randomness

datumbox · 2021-08-13T12:04:42Z

@vmoens Thanks for the PR. The change looks useful. Though it's correctly marked as BC-breaking, hopefully the two methods don't produce significantly different results and the average user should be OK.

@NicolasHug It might be worth confirming that a model initialized under the new scheme does not diverge. A rudimental check in this case would be to run the model for 1-2 epochs and confirm that the loss decreases on the new branch. Is this something we ran already or plan to run on the future?

NicolasHug · 2021-08-13T12:45:42Z

@vmoens would you mind trying @datumbox 's suggestion, just to make sure?

…ption and googlenet weights (#4256) Summary: using nn.init.trunc_normal_ instead of scipy.stats.truncnorm Reviewed By: NicolasHug Differential Revision: D30417203 fbshipit-source-id: 6b04f6bf7f6d30dfbc65980a4036a9dc539e4651 Co-authored-by: Vincent Moens <vmoens@fb.com>

using nn.init.trunc_normal_ instead of scipy.stats.truncnorm

f05f1a5

facebook-github-bot added the cla signed label Aug 5, 2021

Vincent Moens added 2 commits August 5, 2021 13:02

correct typo

55ad32d

type check error

1e72acb

NicolasHug reviewed Aug 5, 2021

View reviewed changes

vmoens force-pushed the trunc_normal_torch branch from ebdde0a to 80f7c10 Compare August 5, 2021 12:58

amend

e3a84ef

vmoens force-pushed the trunc_normal_torch branch from 4d3bf4e to e3a84ef Compare August 5, 2021 16:21

Vincent Moens added 5 commits August 5, 2021 17:29

amend

7394d0f

amend

177e237

amend

452faa2

amend

ca3ea37

amend

c9662d7

NicolasHug added bc-breaking code quality module: models labels Aug 6, 2021

NicolasHug approved these changes Aug 6, 2021

View reviewed changes

vmoens changed the title ~~WIP using nn.init.trunc_normal_ instead of scipy.stats.truncnorm~~ using nn.init.trunc_normal_ instead of scipy.stats.truncnorm Aug 6, 2021

vmoens changed the title ~~using nn.init.trunc_normal_ instead of scipy.stats.truncnorm~~ Use torch instead of scipy for random initialization of inception and googlenet weights Aug 6, 2021

vmoens marked this pull request as ready for review August 6, 2021 07:54

vmoens merged commit 7e987bf into pytorch:master Aug 6, 2021

NicolasHug mentioned this pull request Aug 16, 2021

Type annotations #2025

Closed

ngam mentioned this pull request Nov 3, 2021

torchvision v0.11.1 conda-forge/torchvision-feedstock#31

Closed

3 tasks

oke-aditya mentioned this pull request Nov 11, 2021

Remove the note of needing scipy in GoogleNet and Inception #4919

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use torch instead of scipy for random initialization of inception and googlenet weights #4256

Use torch instead of scipy for random initialization of inception and googlenet weights #4256

vmoens commented Aug 5, 2021 •

edited by NicolasHug

Loading

NicolasHug left a comment

NicolasHug left a comment

datumbox commented Aug 13, 2021

NicolasHug commented Aug 13, 2021

Use torch instead of scipy for random initialization of inception and googlenet weights #4256

Use torch instead of scipy for random initialization of inception and googlenet weights #4256

Conversation

vmoens commented Aug 5, 2021 • edited by NicolasHug Loading

NicolasHug left a comment

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

datumbox commented Aug 13, 2021

NicolasHug commented Aug 13, 2021

vmoens commented Aug 5, 2021 •

edited by NicolasHug

Loading