avoid extra copies in batchnorm inference by introducing a new op, _native_batch_norm_legit_no_training #94946

bdhirsh · 2023-02-15T23:10:46Z

Stack from ghstack (oldest at bottom):

-> avoid extra copies in batchnorm inference by introducing a new op, _native_batch_norm_legit_no_training #94946

…ative_batch_norm_legit_no_training [ghstack-poisoned]

pytorch-bot · 2023-02-15T23:10:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/94946

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit cc46919:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ezyang

this is horrible but I am OK with getting it in for release. Is someone signed up for batch norm consolidation

bdhirsh · 2023-02-15T23:20:21Z

Right now nobody is signed up (cc @albanD @janeyx99 ?)

… new op, _native_batch_norm_legit_no_training" [ghstack-poisoned]

bdhirsh · 2023-02-16T00:04:15Z

added a test

…ative_batch_norm_legit_no_training ghstack-source-id: 158da4bfbd9fc40a50282c00cda4981e1c30078e Pull Request resolved: #94946

bdhirsh · 2023-02-16T00:04:34Z

@pytorchbot merge

pytorchmergebot · 2023-02-16T00:06:23Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-02-16T00:56:54Z

Merge failed

Reason: 3 mandatory check(s) failed (Rule superuser). The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job

ngimel · 2023-02-16T01:16:21Z

aten/src/ATen/native/Normalization.cpp

+std::tuple<Tensor, Tensor, Tensor> _batch_norm_legit_no_training(
+    const Tensor& self, const c10::optional<Tensor>& weight_opt, const c10::optional<Tensor>& bias_opt,
+    const Tensor& running_mean, const Tensor& running_var, double momentum, double eps) {
+  return batch_norm_cpu(self, weight_opt, bias_opt, const_cast<Tensor&>(running_mean), const_cast<Tensor&>(running_var), /*train=*/false, momentum, eps);


why is this cpu only?

Whoops 😅 will fix when i get back to my laptop.

although this code path will never actually be hit in the PT2 workflow (since the decomp for this new op is automatically opted into by inductor + any backends that run “core aten decomps”)

… new op, _native_batch_norm_legit_no_training" [ghstack-poisoned]

bdhirsh · 2023-02-16T05:37:16Z

The extra .to(..., copy=True)s will live to see another day (I had to leave them in to appease tests, although I confirmed that copies still don't show up in the batchnorm inference graph). I think they'll be easier to clean up once batch norm consolidation happens. I'm pretty sure this is failing because there's a test passing train=False directly to _native_batch_norm_legit.default, even though that op should never handle the train=False case anymore.

bdhirsh · 2023-02-16T05:37:40Z

@pytorchbot merge

pytorchmergebot · 2023-02-16T05:39:20Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-02-16T05:54:31Z

Merge failed

Reason: 2 mandatory check(s) failed. The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team

Raised by workflow job Failing merge rule: Core Maintainers

… new op, _native_batch_norm_legit_no_training" [ghstack-poisoned]

…ative_batch_norm_legit_no_training ghstack-source-id: 83baffae010d27b4ced28189def1afbe47db2198 Pull Request resolved: #94946

bdhirsh · 2023-02-16T06:25:36Z

@pytorchbot merge

pytorchmergebot · 2023-02-16T06:27:39Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…w op, _native_batch_norm_legit_no_training (pytorch#94946)" This reverts commit 68600fc.

…ative_batch_norm_legit_no_training (pytorch#94946) Pull Request resolved: pytorch#94946 Approved by: https://github.com/ezyang

avoid extra copies in batchnorm inference by introducing a new op, _n…

09cc979

…ative_batch_norm_legit_no_training [ghstack-poisoned]

github-actions bot requested review from albanD, antoniojkim, Chillee, ezyang, jbschlosser, miladm, SherlockNoMad, voznesenskym and wconstab February 15, 2023 23:12

ezyang approved these changes Feb 15, 2023

View reviewed changes

bdhirsh added release notes: python_frontend python frontend release notes category topic: not user facing topic category labels Feb 16, 2023

Update on "avoid extra copies in batchnorm inference by introducing a…

48124e5

… new op, _native_batch_norm_legit_no_training" [ghstack-poisoned]

bdhirsh added a commit that referenced this pull request Feb 16, 2023

avoid extra copies in batchnorm inference by introducing a new op, _n…

e4c5bdd

…ative_batch_norm_legit_no_training ghstack-source-id: 158da4bfbd9fc40a50282c00cda4981e1c30078e Pull Request resolved: #94946

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 16, 2023

ngimel reviewed Feb 16, 2023

View reviewed changes

Update on "avoid extra copies in batchnorm inference by introducing a…

b707d13

… new op, _native_batch_norm_legit_no_training" [ghstack-poisoned]

Update on "avoid extra copies in batchnorm inference by introducing a…

cc46919

… new op, _native_batch_norm_legit_no_training" [ghstack-poisoned]

bdhirsh added a commit that referenced this pull request Feb 16, 2023

avoid extra copies in batchnorm inference by introducing a new op, _n…

328894f

…ative_batch_norm_legit_no_training ghstack-source-id: 83baffae010d27b4ced28189def1afbe47db2198 Pull Request resolved: #94946

pytorchmergebot added the Merged label Feb 16, 2023

pytorchmergebot closed this in 68600fc Feb 16, 2023

This was referenced Feb 17, 2023

hotfix for memory leak in aot autograd induced by saving tensors for backward #95101

Closed

better error message when functionalization cant handle op #95392

Closed

msaroufim mentioned this pull request Mar 3, 2023

Remove mention of dynamo.optimize() in docs #96002

Closed

pruthvistony added a commit to ROCm/pytorch that referenced this pull request May 2, 2023

Revert "avoid extra copies in batchnorm inference by introducing a ne…

6051ba4

…w op, _native_batch_norm_legit_no_training (pytorch#94946)" This reverts commit 68600fc.

facebook-github-bot deleted the gh/bdhirsh/378/head branch June 8, 2023 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid extra copies in batchnorm inference by introducing a new op, _native_batch_norm_legit_no_training #94946

avoid extra copies in batchnorm inference by introducing a new op, _native_batch_norm_legit_no_training #94946

bdhirsh commented Feb 15, 2023 •

edited

Loading

pytorch-bot bot commented Feb 15, 2023 •

edited

Loading

ezyang left a comment

bdhirsh commented Feb 15, 2023

bdhirsh commented Feb 16, 2023

bdhirsh commented Feb 16, 2023

pytorchmergebot commented Feb 16, 2023

pytorchmergebot commented Feb 16, 2023

ngimel Feb 16, 2023

bdhirsh Feb 16, 2023

bdhirsh commented Feb 16, 2023

bdhirsh commented Feb 16, 2023

pytorchmergebot commented Feb 16, 2023

pytorchmergebot commented Feb 16, 2023

bdhirsh commented Feb 16, 2023

pytorchmergebot commented Feb 16, 2023

avoid extra copies in batchnorm inference by introducing a new op, _native_batch_norm_legit_no_training #94946

avoid extra copies in batchnorm inference by introducing a new op, _native_batch_norm_legit_no_training #94946

Conversation

bdhirsh commented Feb 15, 2023 • edited Loading

pytorch-bot bot commented Feb 15, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/94946

✅ No Failures

ezyang left a comment

Choose a reason for hiding this comment

bdhirsh commented Feb 15, 2023

bdhirsh commented Feb 16, 2023

bdhirsh commented Feb 16, 2023

pytorchmergebot commented Feb 16, 2023

Merge started

pytorchmergebot commented Feb 16, 2023

Merge failed

ngimel Feb 16, 2023

Choose a reason for hiding this comment

bdhirsh Feb 16, 2023

Choose a reason for hiding this comment

bdhirsh commented Feb 16, 2023

bdhirsh commented Feb 16, 2023

pytorchmergebot commented Feb 16, 2023

Merge started

pytorchmergebot commented Feb 16, 2023

Merge failed

bdhirsh commented Feb 16, 2023

pytorchmergebot commented Feb 16, 2023

Merge started

bdhirsh commented Feb 15, 2023 •

edited

Loading

pytorch-bot bot commented Feb 15, 2023 •

edited

Loading