[MXNET-507] Set dtype=int32 for ret_indices in ordering ops #11134

sxjscience · 2018-06-04T08:58:02Z

Description

There are two problems in the ordering operators, i.e, topk, sort, argsort:

Only real_t is supported.
The indices are stored as real_t. This will cause error in the backward pass where the gradient are passed to the wrong locations.

For example, we can not run the following code in the previous version:

import mxnet as mx
import numpy as np
import mxnet.ndarray as nd

ctx = mx.cpu()

a = mx.nd.arange(54686454, ctx=ctx, dtype=np.int32)
a.attach_grad()

k = 10
with mx.autograd.record():
b = mx.nd.topk(a, k=k, ret_typ='value')
b.backward(mx.nd.ones((k,), ctx=ctx, dtype=np.int32))
a_grad = a.grad.asnumpy()
for i in range(-1, - k - 1, -1):
assert a_grad[i] == 1

I propose to fix this bug by changing the dtype of indices to int32. This will make the code backward incompatible. However, it only breaks some rare usages. Normally we will directly output the indices or use them to slice a tensor. The current change do not break these common usages. (I have used the other solution mentioned below.)

Another solution is to support an additional flag dtype for these operators (as suggested in #11031). The problem of this solution is that we cannot avoid a nested macro, which is extremely slow to compile. So I haven't solved it this way.

MACRO_SWITCH(dtype, DType, {
   MACRO_SWITCH(idtype, IDType, {
      ...
   });
});

This PR also fixes the issue reported in #10085 that ordering ops do not support kNullOp.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Arbitrary dtype (exclude float16) for ordering operators, tests
For topk, argsort, add the dtype option to specify the type of the output indices, tests
Fix the bug that ordering ops do not support kNullOp, tests

Comments

~~This change is backward incompatible. However, I think it's reasonable to set the dtype of indices to int32 because it does not break the common usage where we use these indices to slice a tensor.~~ The change is backward compatible now.

sxjscience · 2018-06-05T04:58:00Z

After discussing offline with Eric, I'll add an additional dtype flag to make sure that the OP is backward compatible.

szha · 2018-06-05T05:06:01Z

Feel free to remove "breaking" label when you're ready.

asitstands · 2018-06-11T13:30:29Z

I think that it would be helpful if a check for the ranges of the floating point types is added. The recent addtion of dtype to multinomial implemented the check. https://github.com/apache/incubator-mxnet/blob/3eada3b32aeab5c8cdf7d507bcc3a986c9e5b91f/src/operator/random/sample_multinomial_op.h#L73-L76

sxjscience · 2018-06-11T13:32:52Z

Thanks a lot! I was just looking for such a check. Get Outlook for iOS<https://aka.ms/o0ukef>

…

________________________________ From: Deokjae Lee <notifications@github.com> Sent: Monday, June 11, 2018 9:31:02 PM To: apache/incubator-mxnet Cc: Xingjian SHI; Author Subject: Re: [apache/incubator-mxnet] [MXNET-507] Set dtype=int32 for ret_indices in ordering ops (#11134) I think that it would be helpful if a check for the ranges of the floating point types is added. The recent addtion of dtype to multinomial implemented the check. https://github.com/apache/incubator-mxnet/blob/3eada3b32aeab5c8cdf7d507bcc3a986c9e5b91f/src/operator/random/sample_multinomial_op.h#L73-L76 — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#11134 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AE8D7u7vceZYr3aWQORkAyQyQqv645Ntks5t7nEWgaJpZM4UYsvh>.

sxjscience · 2018-06-19T02:46:31Z

Is it okay to merge this in?

szha · 2018-06-30T02:48:31Z

src/operator/tensor/ordering_op-inl.h

+    DMLC_DECLARE_FIELD(dtype)
+    .add_enum("uint8", mshadow::kUint8)
+    .add_enum("int32", mshadow::kInt32)
+    .add_enum("float16", mshadow::kFloat16)


should we remove this option given that it won't be supported?

OK, I'll remove it.

@szha Actually, the dtype of indices can be float16. There is no need to remove the option.

This reverts commit 30a4457.

thomelane · 2018-08-15T22:11:58Z

@sxjscience any chance you could add a small update to the documentation saying that the output is sorted. thanks!

sxjscience added 4 commits June 4, 2018 16:32

set the dtype of index to be int32 for ordering ops

226aee3

fix

0781f55

fix

9c22358

fix

d38cc01

sxjscience changed the title ~~[MXNET-507] Set dtype of index to be int32 for ordering ops~~ [MXNET-507] Set dtype=int32 for ret_indices in ordering ops Jun 4, 2018

szha added Operator Breaking labels Jun 4, 2018

szha mentioned this pull request Jun 4, 2018

[Discussion] MXNet 2.0 Roadmap (was: APIs that might be a good idea to break in 2.0) #9686

Closed

Merge remote-tracking branch 'apache/master' into fix_order

5433b97

sxjscience added 4 commits June 12, 2018 09:21

Merge remote-tracking branch 'upstream/master' into fix_order

7b771a0

try to add the dtype option

4bb6be7

fix

d16f614

try to fix

9d3bdf3

szha removed the Breaking label Jun 12, 2018

sxjscience added 4 commits June 12, 2018 11:39

try to fix

3c1d162

fix lint

51736e1

add more tests

c1016c7

fix test

b26d24b

sxjscience requested review from piiswrong and marcoabreu June 12, 2018 06:17

sxjscience added 3 commits June 15, 2018 11:13

Merge remote-tracking branch 'upstream/master' into fix_order

ad0fb51

Do not change unrelated file

cde4b5f

Merge remote-tracking branch 'upstream/master' into fix_order

1f805d9

sxjscience requested review from anirudh2290 and removed request for anirudh2290 June 19, 2018 02:45

szha reviewed Jun 30, 2018

View reviewed changes

sxjscience added 2 commits July 4, 2018 09:24

remove float16

30a4457

Revert "remove float16"

c4bae4d

This reverts commit 30a4457.

sxjscience mentioned this pull request Jul 30, 2018

[Operator] Make sure the result of topk is sorted #10204

Closed

sxjscience mentioned this pull request Aug 20, 2018

[MXNET-507] Set arbitrary dtype for ret_indices in ordering ops #12250

Merged

9 tasks

sxjscience closed this Aug 20, 2018

szha mentioned this pull request Sep 13, 2019

[RFC] Apache MXNet 2.0 Roadmap #16167

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MXNET-507] Set dtype=int32 for ret_indices in ordering ops #11134

[MXNET-507] Set dtype=int32 for ret_indices in ordering ops #11134

sxjscience commented Jun 4, 2018 •

edited

Loading

sxjscience commented Jun 5, 2018

szha commented Jun 5, 2018

asitstands commented Jun 11, 2018

sxjscience commented Jun 11, 2018 via email

sxjscience commented Jun 19, 2018

szha Jun 30, 2018

sxjscience Jul 4, 2018

sxjscience Jul 4, 2018

thomelane commented Aug 15, 2018

[MXNET-507] Set dtype=int32 for ret_indices in ordering ops #11134

[MXNET-507] Set dtype=int32 for ret_indices in ordering ops #11134

Conversation

sxjscience commented Jun 4, 2018 • edited Loading

Description

Checklist

Essentials

Changes

Comments

sxjscience commented Jun 5, 2018

szha commented Jun 5, 2018

asitstands commented Jun 11, 2018

sxjscience commented Jun 11, 2018 via email

sxjscience commented Jun 19, 2018

szha Jun 30, 2018

Choose a reason for hiding this comment

sxjscience Jul 4, 2018

Choose a reason for hiding this comment

sxjscience Jul 4, 2018

Choose a reason for hiding this comment

thomelane commented Aug 15, 2018

sxjscience commented Jun 4, 2018 •

edited

Loading