[Numpy] The symbolic export of BatchNorm is wrong #18373

sxjscience · 2020-05-20T08:51:22Z

import mxnet as mx
import json
import pprint
mx.npx.set_np()
net = mx.gluon.nn.BatchNorm(epsilon=2E-5, axis=2)
net.hybridize()
net.initialize()
a = net(mx.np.ones((10, 3, 5, 5)))
net.export('bnorm', 0)
with open('bnorm-symbol.json') as f:
   dat = json.load(f)
   pprint.pprint(dat)

Output:

           {'attrs': {'__profiler_scope__': 'batchnorm0:',
                      'axis': '1',
                      'eps': '1e-05',
                      'fix_gamma': 'False',
                      'momentum': '0.9',
                      'use_global_stats': 'False'},
            'inputs': [[0, 0, 0], [1, 0, 0], [2, 0, 0], [3, 0, 1], [4, 0, 1]],
            'name': 'batchnorm0_fwd',
            'op': 'BatchNorm'}]}

We can find that eps and axis are not stored.

The text was updated successfully, but these errors were encountered:

sxjscience · 2020-05-20T17:52:25Z

I find that issue does not only happen in numpy but also exists in ndarray:

import mxnet as mx
import json
import pprint
#mx.npx.set_np()
net = mx.gluon.nn.BatchNorm(epsilon=2E-5, axis=2)
net.hybridize()
net.initialize()
a = net(mx.nd.ones((10, 3, 5, 5)))
net.export('bnorm', 0)
with open('bnorm-symbol.json') as f:
   dat = json.load(f)
   pprint.pprint(dat)

Output:

           {'attrs': {'__profiler_scope__': 'batchnorm0:',
                      'axis': '1',
                      'eps': '1e-05',
                      'fix_gamma': 'False',
                      'momentum': '0.9',
                      'use_global_stats': 'False'},
            'inputs': [[0, 0, 0], [1, 0, 0], [2, 0, 0], [3, 0, 1], [4, 0, 1]],
            'name': 'batchnorm0_fwd',
            'op': 'BatchNorm'}]}

wkcn · 2020-06-05T00:01:45Z

Hi @sxjscience , is it available to delete the pre-built pip packages impacted by this issue?

BatchNorm is universally used, and this bug will not raise any exception. Users may install the previous version of MXNet with this bug, and find that the accuracy drops.

sxjscience · 2020-06-05T00:05:17Z

@wkcn Yes, this is a disaster for the users. However, deleting the pre-built pip packages is also not a good option because there are users that are not using BatchNorm. We will need to ensure that the official 1.7 release does not contain this bug.

szha · 2020-06-05T00:48:52Z

cc @ciyongch

ciyongch · 2020-06-05T02:11:43Z

Hi @szha, v1.7.x doesn't include the PR #17679 (it's a new feature after code freeze), so there's no such issue on this branch. While for v1.x branch, the fix were already cherry-picked.
I just check the latest commit of both v1.7.x and v1.x branches with the above reproducer, it works well. So no action is needed for this case.

           {'attrs': {'axis': '2',
                      'eps': '2e-05',
                      'fix_gamma': 'False',
                      'momentum': '0.9',
                      'use_global_stats': 'False'},
            'inputs': [[0, 0, 0], [1, 0, 0], [2, 0, 0], [3, 0, 1], [4, 0, 1]],
            'name': 'batchnorm0_fwd',
            'op': 'BatchNorm'}]}

sxjscience added Bug Numpy labels May 20, 2020

leezu added the v2.0 label May 20, 2020

yzhliu assigned sxjscience May 20, 2020

sxjscience mentioned this issue May 21, 2020

Fix BatchNorm #18377

Merged

sxjscience closed this as completed in #18377 May 31, 2020

wkcn mentioned this issue Jun 4, 2020

BatchNorm can not converge with scale=False #18475

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Numpy] The symbolic export of BatchNorm is wrong #18373

[Numpy] The symbolic export of BatchNorm is wrong #18373

sxjscience commented May 20, 2020

sxjscience commented May 20, 2020

wkcn commented Jun 5, 2020

sxjscience commented Jun 5, 2020

szha commented Jun 5, 2020

ciyongch commented Jun 5, 2020

[Numpy] The symbolic export of BatchNorm is wrong #18373

[Numpy] The symbolic export of BatchNorm is wrong #18373

Comments

sxjscience commented May 20, 2020

sxjscience commented May 20, 2020

wkcn commented Jun 5, 2020

sxjscience commented Jun 5, 2020

szha commented Jun 5, 2020

ciyongch commented Jun 5, 2020