[CUDNN] Add cuDNN as a Relay partitioning target (BYOC) #10871

mbaret · 2022-04-01T16:25:32Z

This adds infrastructure to support offloading of Relay patterns to cuDNN. In this initial commit, only softmax is supported. Later PRs will include support for more operators, including some limited fused patterns.

mbaret · 2022-04-01T16:25:52Z

cc @mikepapadim @mbs-octoml

mbs-octoml

LGTM modulo two nits.

mbs-octoml · 2022-04-01T17:03:09Z

python/tvm/relay/op/contrib/cudnn.py

+    assert isinstance(partition.body.op, relay.Function)
+
+    global_name = str(partition.attrs.global_symbol)
+    target = tvm.target.cuda()


Just notice this, I think Target.current() is better so that cuda params are not lost.

I had a go using this based on your suggestion, but it seems the Target only ends up in the context if you use the 'with' way of specifying targets (not just directly passing them to relay.build). In an ideal world, this somehow gets plumbed directly through the BYOC/partitioning mechanism - but I think that exceeds the immediate scope of this PR. Perhaps we could leave as a future improvement for now?

mbs-octoml · 2022-04-01T17:04:26Z

python/tvm/relay/op/contrib/cudnn.py

+    return _register
+
+
+@tvm._ffi.register_func("relay.ext.cudnn")


Now would be a good time to hoist this boilerplate into a library_byoc.py helper or something similar?

Done - let me know what you think.

mikepapadim

LGTM, just trigger again CI. It looks like a random fail

mikepapadim · 2022-04-06T21:46:45Z

it looks like some flaky test in aarch64

This adds infrastructure to support offloading of Relay patterns to cuDNN. In this initial commit, only softmax is supported.

mbaret · 2022-04-08T11:41:46Z

cc @masahi @mbrookhart @junrushao1994 PTAL and merge if you're happy :)

junrushao

SGTM!

* [CUDNN] Add cuDNN as a Relay partitioning target (BYOC) This adds infrastructure to support offloading of Relay patterns to cuDNN. In this initial commit, only softmax is supported. * Refactor common TE BYOC code into separate file * Add test guard

mbs-octoml approved these changes Apr 1, 2022

View reviewed changes

mikepapadim approved these changes Apr 4, 2022

View reviewed changes

mbaret force-pushed the cudnn-relay branch 2 times, most recently from 5c2e300 to 0e06391 Compare April 6, 2022 11:54

mbaret added 3 commits April 7, 2022 11:39

[CUDNN] Add cuDNN as a Relay partitioning target (BYOC)

aec89f9

This adds infrastructure to support offloading of Relay patterns to cuDNN. In this initial commit, only softmax is supported.

Refactor common TE BYOC code into separate file

27d058e

Add test guard

73f352f

mbaret force-pushed the cudnn-relay branch from 057b918 to 73f352f Compare April 7, 2022 18:44

junrushao approved these changes Apr 8, 2022

View reviewed changes

masahi approved these changes Apr 8, 2022

View reviewed changes

masahi merged commit 81d72e3 into apache:main Apr 8, 2022

driazati mentioned this pull request Jul 14, 2022

TVM v0.9.0.rc0 Release Candidate Notes #12102

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDNN] Add cuDNN as a Relay partitioning target (BYOC) #10871

[CUDNN] Add cuDNN as a Relay partitioning target (BYOC) #10871

mbaret commented Apr 1, 2022

mbaret commented Apr 1, 2022

mbs-octoml left a comment

mbs-octoml Apr 1, 2022

mbaret Apr 5, 2022

mbs-octoml Apr 1, 2022

mbaret Apr 5, 2022

mikepapadim left a comment

mikepapadim commented Apr 6, 2022

mbaret commented Apr 8, 2022

junrushao left a comment

[CUDNN] Add cuDNN as a Relay partitioning target (BYOC) #10871

[CUDNN] Add cuDNN as a Relay partitioning target (BYOC) #10871

Conversation

mbaret commented Apr 1, 2022

mbaret commented Apr 1, 2022

mbs-octoml left a comment

Choose a reason for hiding this comment

mbs-octoml Apr 1, 2022

Choose a reason for hiding this comment

mbaret Apr 5, 2022

Choose a reason for hiding this comment

mbs-octoml Apr 1, 2022

Choose a reason for hiding this comment

mbaret Apr 5, 2022

Choose a reason for hiding this comment

mikepapadim left a comment

Choose a reason for hiding this comment

mikepapadim commented Apr 6, 2022

mbaret commented Apr 8, 2022

junrushao left a comment

Choose a reason for hiding this comment