Use FP32 copy of weights for norm (multitensor LAMB optimizer) #17700

MoisesHer · 2020-02-27T02:32:03Z

Description

When using Mixed Precision, use the master copy of weights (FP32) for computing the norm

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage: test was already available (tests/python/unittesttest_optimizer:test_multilamb)
Code is well-documented:
To the best of my knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Modified the code to use the master copy of weights for computing the norm when mixed precision is used

* 'master' of https://github.com/apache/incubator-mxnet: (192 commits) * impl - FFI for np einsum (apache#17869) [Numpy] FFI for diag/diagonal/diag_indices_from (apache#17789) [Numpy] Kron operator (apache#17323) cmake: Set DMLC_LOG_FATAL_THROW only for building mxnet and not for tvm (apache#17878) Add simplified HybridBlock.forward without F (apache#17530) Use FP32 copy of weights for norm (multitensor LAMB optimizer) (apache#17700) Use multi-tensor sumSQ in clip_global_norm (apache#17652) [Numpy] Add op fmax, fmin, fmod (apache#17567) Adding sparse support to MXTensor for custom operators (apache#17569) Update 3rdparty/mkldnn to v1.2.2 (apache#17313) Dynamic subgraph compile support (apache#17623) Refactor cpp-package CMakeLists.txt & add missing inference/imagenet_inference (apache#17835) staticbuild: Fix potential user-assisted execution of arbitrary code (apache#17860) * FFI for np.argmax and np.argmin (apache#17843) ffi for roll/rot90 (apache#17861) Skip test_multi_worker_dataloader_release_pool on OS X (apache#17797) add ffi for full_like, binary (apache#17811) HybridBlock.export() to return created filenames (apache#17758) Fix SoftReLU fused operator numerical stability (apache#17849) CI: Test clang10 cpu & gpu builds with -WError (apache#17830) ...

…e#17700) * Use fp32 copy of weights for computing norm in LAMB optimizer * Fix cpplint

MoisesHer added 2 commits February 26, 2020 18:24

Use fp32 copy of weights for computing norm in LAMB optimizer

abbc2db

Fix cpplint

cd2c9e9

eric-haibin-lin approved these changes Mar 23, 2020

View reviewed changes

eric-haibin-lin merged commit 8e39518 into apache:master Mar 23, 2020

MoisesHer added a commit to MoisesHer/incubator-mxnet that referenced this pull request Apr 10, 2020

Use FP32 copy of weights for norm (multitensor LAMB optimizer) (apach…

0071890

…e#17700) * Use fp32 copy of weights for computing norm in LAMB optimizer * Fix cpplint

anirudh2290 pushed a commit to anirudh2290/mxnet that referenced this pull request May 29, 2020

Use FP32 copy of weights for norm (multitensor LAMB optimizer) (apach…

d593502

…e#17700) * Use fp32 copy of weights for computing norm in LAMB optimizer * Fix cpplint

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use FP32 copy of weights for norm (multitensor LAMB optimizer) #17700

Use FP32 copy of weights for norm (multitensor LAMB optimizer) #17700

MoisesHer commented Feb 27, 2020

Use FP32 copy of weights for norm (multitensor LAMB optimizer) #17700

Use FP32 copy of weights for norm (multitensor LAMB optimizer) #17700

Conversation

MoisesHer commented Feb 27, 2020

Description

Checklist

Essentials

Changes