Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

Use FP32 copy of weights for norm (multitensor LAMB optimizer) #17700

Merged
merged 2 commits into from
Mar 23, 2020

Conversation

MoisesHer
Copy link
Contributor

Description

When using Mixed Precision, use the master copy of weights (FP32) for computing the norm

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

  • Changes are complete (i.e. I finished coding on this PR)
  • All changes have test coverage: test was already available (tests/python/unittesttest_optimizer:test_multilamb)
  • Code is well-documented:
  • To the best of my knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

  • Modified the code to use the master copy of weights for computing the norm when mixed precision is used

@eric-haibin-lin eric-haibin-lin merged commit 8e39518 into apache:master Mar 23, 2020
anirudh2290 added a commit to anirudh2290/mxnet that referenced this pull request Mar 27, 2020
* 'master' of https://github.com/apache/incubator-mxnet: (192 commits)
  * impl - FFI for np einsum (apache#17869)
  [Numpy] FFI for diag/diagonal/diag_indices_from (apache#17789)
  [Numpy] Kron operator (apache#17323)
  cmake: Set DMLC_LOG_FATAL_THROW only for building mxnet and not for tvm (apache#17878)
  Add simplified HybridBlock.forward without F (apache#17530)
  Use FP32 copy of weights for norm (multitensor LAMB optimizer) (apache#17700)
  Use multi-tensor sumSQ in clip_global_norm (apache#17652)
  [Numpy] Add op fmax, fmin, fmod (apache#17567)
  Adding sparse support to MXTensor for custom operators (apache#17569)
  Update 3rdparty/mkldnn to v1.2.2 (apache#17313)
  Dynamic subgraph compile support (apache#17623)
  Refactor cpp-package CMakeLists.txt & add missing inference/imagenet_inference (apache#17835)
  staticbuild: Fix potential user-assisted execution of arbitrary code  (apache#17860)
  * FFI for np.argmax and np.argmin (apache#17843)
  ffi for roll/rot90 (apache#17861)
  Skip test_multi_worker_dataloader_release_pool on OS X (apache#17797)
  add ffi for full_like, binary (apache#17811)
  HybridBlock.export() to return created filenames (apache#17758)
  Fix SoftReLU fused operator numerical stability (apache#17849)
  CI: Test clang10 cpu & gpu builds with -WError (apache#17830)
  ...
MoisesHer added a commit to MoisesHer/incubator-mxnet that referenced this pull request Apr 10, 2020
…e#17700)

* Use fp32 copy of weights for computing norm in LAMB optimizer

* Fix cpplint
anirudh2290 pushed a commit to anirudh2290/mxnet that referenced this pull request May 29, 2020
…e#17700)

* Use fp32 copy of weights for computing norm in LAMB optimizer

* Fix cpplint
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants