[TOPI] Fix GPU Dynamic Op Schedule #7117

kevinthesun · 2020-12-16T02:10:57Z

This PR limits the resources used by dynamic shape gpu kernels to avoid runtime errors. It also skips CallPacked in vm if kernel has only one output and this output is empty, like (1, 0, 6).

After this PR, TF and PT object detection models should be runnable on Nvidia GPU.

@zhiics @Laurawly @mbrookhart

python/tvm/topi/cuda/nms.py

mbrookhart

A couple of nitpicks, but overall, it looks great, awesome work.

mbrookhart · 2020-12-16T16:43:19Z

tests/python/relay/test_any.py

+        mod,
+        [np_indices_result, np_valid_box_count],
+        only_vm=False,
+        disable_targets=["nvptx"],


This tests the empty output VM change 👍
Why disable nvptx?

There is issue causing segfault from dynamic nms for nvptx, and generally we need thrust for any dynamic shape sorting. For now nvptx is not ready for these operations.

Makes sense. I'm trying to fix the default sort kernel in #7099, if you want to take a look

mbrookhart · 2020-12-16T16:43:35Z

tests/python/relay/test_any.py

@@ -199,6 +199,15 @@ def test_any_concat():
    ref = np.concatenate([x_np - 3.0, y_np * 5.0], axis=0)
    check_result([x_np, y_np], mod, ref)

+    num_inputs = 25
+    x = [relay.var("x", shape=(relay.Any(),), dtype="float32") for _ in range(num_inputs)]
+    z = relay.op.concatenate(x, axis=0)


this tests the injective schedule 👍

mbrookhart · 2020-12-16T16:44:54Z

python/tvm/topi/cuda/nms.py

@@ -754,7 +782,22 @@ def non_max_suppression(
    )
    score_axis = score_index
    score_shape = (batch_size, num_anchors)
-    score_tensor = te.compute(score_shape, lambda i, j: data[i, j, score_axis], tag=tag.ELEMWISE)
+    data_buf = tvm.tir.decl_buffer(data.shape, data.dtype, "data_buf", data_alignment=8)


This looks fine, but I'm a little surprised it's necessary. Do you have a test case that breaks the current code, or is this mostly for performance?

When the nms workload is large like in RCNN models, general cuda injective schedule can still cause runtime error even with the improvement of this PR. It's common that any dynamic injective op can have runtime issue with current uniform cuda injective schedule.

This problem is not directly related to nms, but cuda injective schedule. Later we might need to revisit this part for gpu dynamic ops and have a better and more general solution(together with more tests).

mbrookhart · 2020-12-16T16:45:22Z

python/tvm/topi/cuda/conv2d_transpose_nchw.py

@@ -194,6 +197,8 @@ def _callback(op):

            if cfg.is_fallback:
                N, F, Y, X = get_const_tuple(conv.shape)
+                if not isinstance(N, int):
+                    N = 1


Can we add a test that hits this change?

Yeah we do have a test for this. Now I enabled all targets.

mbrookhart

LGTM

kevinthesun · 2020-12-17T21:54:55Z

Thanks @mbrookhart

* Fix GPU dynamic op schedules * Fix dynamic shape nms * Fix * Fix test format

kevinthesun added 2 commits December 16, 2020 01:30

Fix GPU dynamic op schedules

240df3e

Fix dynamic shape nms

5cf4043

mbrookhart reviewed Dec 16, 2020

View reviewed changes

python/tvm/topi/cuda/nms.py Outdated Show resolved Hide resolved

mbrookhart reviewed Dec 16, 2020

View reviewed changes

kevinthesun added 2 commits December 16, 2020 19:04

Fix

183b88a

Fix test format

b51ac7b

mbrookhart approved these changes Dec 16, 2020

View reviewed changes

kevinthesun merged commit bad149e into apache:main Dec 17, 2020

masahi pushed a commit to masahi/tvm that referenced this pull request Dec 18, 2020

[TOPI] Fix GPU Dynamic Op Schedule (apache#7117)

ed9ed82

* Fix GPU dynamic op schedules * Fix dynamic shape nms * Fix * Fix test format

TusharKanekiDey pushed a commit to TusharKanekiDey/tvm that referenced this pull request Jan 20, 2021

[TOPI] Fix GPU Dynamic Op Schedule (apache#7117)

b1254f2

* Fix GPU dynamic op schedules * Fix dynamic shape nms * Fix * Fix test format

trevor-m pushed a commit to neo-ai/tvm that referenced this pull request Jan 21, 2021

[TOPI] Fix GPU Dynamic Op Schedule (apache#7117)

25f18d0

* Fix GPU dynamic op schedules * Fix dynamic shape nms * Fix * Fix test format

electriclilies pushed a commit to electriclilies/tvm that referenced this pull request Feb 18, 2021

[TOPI] Fix GPU Dynamic Op Schedule (apache#7117)

bd804e2

* Fix GPU dynamic op schedules * Fix dynamic shape nms * Fix * Fix test format

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TOPI] Fix GPU Dynamic Op Schedule #7117

[TOPI] Fix GPU Dynamic Op Schedule #7117

kevinthesun commented Dec 16, 2020

mbrookhart left a comment

mbrookhart Dec 16, 2020

kevinthesun Dec 16, 2020

mbrookhart Dec 16, 2020

mbrookhart Dec 16, 2020

mbrookhart Dec 16, 2020

kevinthesun Dec 16, 2020

mbrookhart Dec 16, 2020

kevinthesun Dec 16, 2020

mbrookhart left a comment

kevinthesun commented Dec 17, 2020

[TOPI] Fix GPU Dynamic Op Schedule #7117

[TOPI] Fix GPU Dynamic Op Schedule #7117

Conversation

kevinthesun commented Dec 16, 2020

mbrookhart left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mbrookhart left a comment

Choose a reason for hiding this comment

kevinthesun commented Dec 17, 2020