One-line change to correctly dispatch to cpu function for inference #2881

TroyGarden · 2024-07-23T23:55:55Z

Summary: # context

Reviewed By: dstaay-fb

Differential Revision: D48574563

netlify · 2024-07-23T23:56:10Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`44ad142`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66a0867d6dc97400081adbd7
😎 Deploy Preview	https://deploy-preview-2881--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2024-07-23T23:56:19Z

This pull request was exported from Phabricator. Differential Revision: D48574563

facebook-github-bot · 2024-07-24T03:13:31Z

This pull request was exported from Phabricator. Differential Revision: D48574563

facebook-github-bot · 2024-07-24T04:37:53Z

This pull request was exported from Phabricator. Differential Revision: D48574563

…ytorch#2881) Summary: Pull Request resolved: pytorch#2881 # context * the fundamental issue is that a "dispatch_to_cpu" function is calling the "autograd" function * a side issue is the poor naming: `permute_pooled_embs_auto_grad` sounds like an "autograd version" but actually as an operator the real backend (CPU, GPU, META, AUTOGRAD, etc.) is determined by the dispatcher. * this defact exists from day-1 since this operator was developed (D31271923). * the impact is that in the inference flow where the autograd is not needed, although the dispatcher does call the CPU version of this operator, the actual function calls the autograd function, which is not desired. # how to do it properly: * For a user-faced operator, we first need to give it a good name, like `permute_multi_embedding`. * then we need to implement functions for the necessary backends, commonly 4: CPU, CUDA, META, and AUTOGRAD * it's better to name the function as {operator_name}_{backend}, and **very important to link them correctly**, here is a good example: D48720379 * in training, the dispatcher will **ALWAYS** pickup the autograd version, then [calls cuda/cpu/meta version from the autograd function](https://fburl.com/code/2d1acuop) based on device or other context. * in inference, the dispatcher will ignore the autograd, and directly calls the cpu/cuda/meta version Reviewed By: dstaay-fb, sryap Differential Revision: D48574563

facebook-github-bot · 2024-07-24T04:43:40Z

This pull request was exported from Phabricator. Differential Revision: D48574563

facebook-github-bot · 2024-07-24T16:17:02Z

This pull request has been merged in 500c5fa.

facebook-github-bot added the cla signed label Jul 23, 2024

facebook-github-bot added the fb-exported label Jul 23, 2024

TroyGarden force-pushed the export-D48574563 branch from 1b3573e to 56d5696 Compare July 24, 2024 03:13

TroyGarden force-pushed the export-D48574563 branch from 56d5696 to dbd0569 Compare July 24, 2024 04:38

TroyGarden force-pushed the export-D48574563 branch from dbd0569 to 44ad142 Compare July 24, 2024 04:43

facebook-github-bot closed this in 500c5fa Jul 24, 2024

facebook-github-bot added the Merged label Jul 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One-line change to correctly dispatch to cpu function for inference #2881

One-line change to correctly dispatch to cpu function for inference #2881

TroyGarden commented Jul 23, 2024

netlify bot commented Jul 23, 2024 •

edited

Loading

facebook-github-bot commented Jul 23, 2024

facebook-github-bot commented Jul 24, 2024

facebook-github-bot commented Jul 24, 2024

facebook-github-bot commented Jul 24, 2024

facebook-github-bot commented Jul 24, 2024

One-line change to correctly dispatch to cpu function for inference #2881

One-line change to correctly dispatch to cpu function for inference #2881

Conversation

TroyGarden commented Jul 23, 2024

netlify bot commented Jul 23, 2024 • edited Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

facebook-github-bot commented Jul 23, 2024

facebook-github-bot commented Jul 24, 2024

facebook-github-bot commented Jul 24, 2024

facebook-github-bot commented Jul 24, 2024

facebook-github-bot commented Jul 24, 2024

netlify bot commented Jul 23, 2024 •

edited

Loading