Skip to content

Commit

Permalink
Add vectorization to inner loop
Browse files Browse the repository at this point in the history
  • Loading branch information
echuraev committed Aug 23, 2021
1 parent d32193a commit e19c67a
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions python/tvm/topi/gpu/conv2d_nhwc.py
Original file line number Diff line number Diff line change
Expand Up @@ -110,6 +110,7 @@ def schedule_conv2d_nhwc_direct(cfg, s, Conv):
ni, yi, xi, fi = s[OL].op.axis
ry, rx, rc = s[OL].op.reduce_axis
rco, rci = s[OL].split(rc, factor=step)
s[OL].vectorize(fi)
s[OL].reorder(rco, ry, rx, rci, ni, fi)

s[AA].compute_at(s[OL], rx)
Expand Down

0 comments on commit e19c67a

Please sign in to comment.