Add test for large batches in DeformConv2d #2040

fmassa · 2020-04-01T13:11:22Z

Follow-up for #2027

codecov-io · 2020-04-02T12:41:47Z

Codecov Report

Merging #2040 into master will not change coverage.
The diff coverage is n/a.

@@          Coverage Diff           @@
##           master   #2040   +/-   ##
======================================
  Coverage    0.48%   0.48%           
======================================
  Files          92      92           
  Lines        7411    7411           
  Branches     1128    1128           
======================================
  Hits           36      36           
  Misses       7362    7362           
  Partials       13      13

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f2f085b...b5aa349. Read the comment docs.

cpuhrsch · 2020-04-02T16:42:47Z

test/test_ops.py

@@ -454,7 +454,7 @@ def expected_fn(self, x, weight, offset, bias, stride=1, padding=0, dilation=1):
        return out

    def get_fn_args(self, device, contiguous):
-        batch_sz = 1
+        batch_sz = 33


Why not test both batch_sz 1, 33 and some larger value (in case that magic number changes)?

I would like to change this test to be faster actually -- in it's current state it is the slowest test in torchvision (takes 30s to run). Ideally, I would like to only do gradcheck on a smaller tensor, but still check for correctness in the large-tensor case. My plan was to open an issue to improve this in the future

cpuhrsch · 2020-04-02T16:43:33Z

torchvision/csrc/cpu/DeformConv_cpu.cpp

      {n_in_channels * weight_w * weight_h, n_parallel_imgs * out_h * out_w},
      input.options());

  // Separate into blocks
-  grad_input = grad_input.view(
+  grad_input = grad_input.reshape(


Why is this changing from view to reshape?

There are many changes like that which are mostly artifacts of when I was debugging ongoing issues, and although most of them are not needed per se, I think it's a better practice now to use reshape instead of view, as it works with non-contiguous tensors.

Basically, using reshape here will probably not change anything to the current code-path, but I think it's safer

It's safer, but it can also incur unexpected memory operations

totally agree, but if we want to support non-contiguous tensors in this function, we would need to call .contiguous() before anyway, so this becomes a no-op

fmassa · 2020-04-02T18:03:58Z

Thanks for the review Christian!

* Add test for large batches in DeformConv2d * Clean-up and (try) fix DeformConv2d * Simplifications and bugfixes * Try fix CUDA now

fmassa force-pushed the test-large-batch-deform-conv2d branch from 793d7bd to 6e6c2da Compare April 1, 2020 15:00

fmassa added 4 commits April 2, 2020 15:42

Add test for large batches in DeformConv2d

76c9947

Clean-up and (try) fix DeformConv2d

6c0fe62

Simplifications and bugfixes

384707e

Try fix CUDA now

252e063

fmassa force-pushed the test-large-batch-deform-conv2d branch from b5aa349 to 252e063 Compare April 2, 2020 13:42

cpuhrsch reviewed Apr 2, 2020

View reviewed changes

fmassa merged commit ccd797d into pytorch:master Apr 2, 2020

fmassa deleted the test-large-batch-deform-conv2d branch April 2, 2020 18:03

fmassa mentioned this pull request Apr 27, 2020

The batchsize of trainset using deform conv #2128

Closed

fmassa added a commit to fmassa/vision-1 that referenced this pull request Jul 8, 2020

Add test for large batches in DeformConv2d (pytorch#2040)

1dd485a

* Add test for large batches in DeformConv2d * Clean-up and (try) fix DeformConv2d * Simplifications and bugfixes * Try fix CUDA now

fmassa mentioned this pull request Oct 14, 2020

Deformable convolution fails when batch size is more than 32? #2817

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test for large batches in DeformConv2d #2040

Add test for large batches in DeformConv2d #2040

fmassa commented Apr 1, 2020

codecov-io commented Apr 2, 2020

cpuhrsch Apr 2, 2020

fmassa Apr 2, 2020

cpuhrsch Apr 2, 2020

fmassa Apr 2, 2020

cpuhrsch Apr 2, 2020

fmassa Apr 2, 2020

fmassa commented Apr 2, 2020

Add test for large batches in DeformConv2d #2040

Add test for large batches in DeformConv2d #2040

Conversation

fmassa commented Apr 1, 2020

codecov-io commented Apr 2, 2020

Codecov Report

cpuhrsch Apr 2, 2020

Choose a reason for hiding this comment

fmassa Apr 2, 2020

Choose a reason for hiding this comment

cpuhrsch Apr 2, 2020

Choose a reason for hiding this comment

fmassa Apr 2, 2020

Choose a reason for hiding this comment

cpuhrsch Apr 2, 2020

Choose a reason for hiding this comment

fmassa Apr 2, 2020

Choose a reason for hiding this comment

fmassa commented Apr 2, 2020