How to handle convNd #1355

krzysz00 · 2023-12-18T18:54:37Z

krzysz00
Dec 18, 2023
Maintainer

We've got client requests for 1D and 3D convolutions in the next release.

From what I can tell, there're two reasonable ways to do this

The short-term hack

One observation we could make is that conv2d is just conv3d with a unspecified depth dimension of 1 (and similarly, conv1d is conv2d with a width of 1).

We could therefore go through and transition rock.conv2d to rock.conv3d, adjust the backwards data and backwards wight kernels too. Then, we'd rewrite 2D convolution into 3D convolution, either in rock-conv-to-gemm or by modifying all our clients (tosa-to-rock, rocmlir-gen, and so on) to insert the fake dimension.

convNd, the semi-disruptive way

We could keep our general scheme for rock.conv2d (which we'd just rename rock.conv) where you specify the layout of the filter, input, and output tensors as a series of strings in an attribute.

Then, in conv2gemm, we'd just identify how many non-{batch,group,channel} dimensions we have and loop over said dimensions in order to construct things like the gemmK dimension.

This has the advantage that it'd only really require rewriting conv2gemm ... along with a lot of the auxilliary glue code (like the convDims struct) to make it take a general number of height/width/depth/... dimensions.

This is probably more implementable than the third solution here and less of an annoying hack than the first one.

convNd while getting rid of layout attributes

The proposal is that we do #1140 and pin rock.conv2d to some fixed logical layout, like NCHW. Then, we'd have general purpose code for working out things like what order the C, Y, X / K, H, W dimensions should be concatenated in.

Doing this'll allow us to ditch a lot of the support code the second option requires us to modify and simplify things like conv2gemm, at the cost of moving some logic into rocmlir-gen for handling things like -transA false or -fil_layout kgcyx.

But, once we've standardized on, for example, NGCHW convolutions, that trailing set of dimensions can be any length we like, and we could expect our code to handle it.

My thoughts

I'm leaning to the third option (finally do generalized problem key) because it's been a long issue and not doing it can cause bad tuning caching for MIGraphX, but, if we don't have the time, I like the second option (where we extend the layout attributes) over making conv2d into conv3d.

manupak · 2023-12-19T13:16:16Z

manupak
Dec 19, 2023
Collaborator

The option 3 seems to build upon option 2 or I dont understand what does striding in the layout has to specially do with convNd.
Thus, it feels problem key v2 stuff is orthogonal to how we handle convND or Im missing the point..

Between option 2 & 3, I suppose it boils down to effort difference. If we can afford, option 2 seems nicer.

1 reply

krzysz00 Jan 2, 2024
Maintainer Author

My main doubt with option 2 was that it'd produce throwaway work that would get cleaned up when we do problem key v2

... but also it's less effort and it's something we can actually do.

... also, with option 2, I think it'll be reasonable to move from the h and w terminology to numbered image dimensions (so, for instance, we'd go from ["ni", "ci", "hi", "wi"] to ["ni", "ci", "0i", "1i"].

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle convNd #1355

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

How to handle convNd #1355

krzysz00 Dec 18, 2023 Maintainer

The short-term hack

convNd, the semi-disruptive way

convNd while getting rid of layout attributes

My thoughts

Replies: 1 comment · 1 reply

manupak Dec 19, 2023 Collaborator

krzysz00 Jan 2, 2024 Maintainer Author

krzysz00
Dec 18, 2023
Maintainer

Replies: 1 comment 1 reply

manupak
Dec 19, 2023
Collaborator

krzysz00 Jan 2, 2024
Maintainer Author