Add Quantized version of RoIAlign #3624

NicolasHug · 2021-03-31T15:38:42Z

This PR implements support for quantized integers for the forward pass of RoIAlign on CPU.

NicolasHug

I think this is ready for review - CI failures are due to #3631

quick benchmark on 30k boxes:

roialign float            19.7 ms ± 652 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)
qroialign torch.qint8     29.7 ms ± 794 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)
qroialign torch.quint8    29.8 ms ± 1.54 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
qroialign torch.qint32    29.9 ms ± 1.9 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

torchvision/csrc/ops/quantized/cpu/qroi_align_kernel.cpp

NicolasHug · 2021-04-05T10:41:57Z

torchvision/csrc/ops/roi_align.h

@@ -30,6 +30,123 @@ at::Tensor _roi_align_backward(
    int64_t sampling_ratio,
    bool aligned);

+template <typename T>


I just moved the common code in the header, and added some comments / removed useless parameters. The rest of the code is unchanged.

Maybe there's a better place to put common code?

The header files located at the root of /ops/ folder contain the "public" methods for C++ API while the cpp files contain the code necessary for calling the dispatcher and registering the operators. They are not meant to contain code of the actual implementation of the method.

Up until now, there was very little need to share code between the CPU and CUDA implementations and as a result you won't find in torchvision's codebase an example that does what you want to do. One approach is to follow the pattern used at PyTorch and move the common code is separate files in the /ops/cpu/ folder. See the Loops.h file which is used both by CPU kernels and quantized.

torchvision/ops/roi_align.py

NicolasHug · 2021-04-05T10:43:13Z

torchvision/csrc/ops/quantized/cpu/qroi_align_kernel.cpp

+namespace {
+
+template <typename T>
+void qroi_align_forward_kernel_impl(


reviewing this file might be easier by doing a vimdiff or similar with ops/cpu/roi_align_kernel.cpp

datumbox · 2021-04-05T18:14:22Z

torchvision/csrc/ops/roi_align.h

+    int roi_bin_grid_h,
+    int roi_bin_grid_w,
+    std::vector<PreCalc<T>>& pre_calc) {
+  int pre_calc_index = 0;


I would also consider moving the actual implementation of lengthy methods to separate cpp file.

fmassa · 2021-04-07T10:30:08Z

torchvision/csrc/ops/quantized/cpu/qroi_align_kernel.cpp

+    int roi_batch_ind = at::native::dequantize_val(
+        rois_scale, rois_zp, offset_rois[0]);


I think we might have issues here with large images and batches.

Imagine that the image is of size 1000x1000, and the boxes thus can be from 0 to 1000 in value. The quantized boxes (which have the same quantizers for all boxes) would not be able to properly index the 0, 1, 2, etc values needed for the image index in the batch.

I think the solution for this is either

[easy] disable batch support altogether in the quantized version of the op

[medium] try using per-channel quantization of the boxes (so that the quantizer for the indices is different than the quantizer for the coordinates) (need to see how it integrates with the rest of the pipeline)

[harder] break BC in the kernels and pass instead a list of tensors for the rois. This would maybe involve more changes down the road in the Faster R-CNN model and would probably be tricky, so I wouldn't recommend this.

Additionally, can you also add a quick test exercising this code-path that I mentioned (with a large image and a few batch elements)?

Yes, if I understand your concern correctly, this is similar to my previous #3624 (comment):

we can't represent 1 with (scale, zero_point) = (2, 0)

I pushed a check in d6f78ab as we discussed yesterday, but I realize now that it's not correct/enough anyway: it assumes that (scale, zero_point) == (1, 0) which is wrong in general.

fmassa

Looks great, thanks!

Can you open an issue to track adding support for N > 1 for quantized tensors?

Summary: * WIP * clang * docs * extracted out common utils * Use better quantization function and pass tensors as parameters * proper dequantization * Some tests * Dequantization optimization, seems to gain a few ms * clang-format * again * more correct test. Had to remove optimization although it almost works * Also test aligned=True * remove useless part * more docs and comments * Put back optimization with more robust test * Added check for index upper bound * avoid possible overflow * Move common function into common.h * oops * scale=1,zero_point=0 makes more sense * Force batch size of 1 to prevent any indexingbug * format * format again * updated docstring * put back description comment for pre_calc_bilinear_interpolate * revert most changes to docstring as it's taken care of in another PR Reviewed By: NicolasHug Differential Revision: D27706946 fbshipit-source-id: 2ae1614c214ea676b4f7705dc0716efd9f34330e

WIP

bd7f639

facebook-github-bot added the cla signed label Mar 31, 2021

NicolasHug added 14 commits April 1, 2021 09:40

clang

8d21449

Merge branch 'master' of github.com:pytorch/vision into qroialign

d3e6e27

docs

68b0dd8

extracted out common utils

c115b73

Use better quantization function and pass tensors as parameters

aadd2fc

proper dequantization

81a3207

Some tests

295a6cc

Dequantization optimization, seems to gain a few ms

626f790

clang-format

b1b68f1

again

fb45472

more correct test. Had to remove optimization although it almost works

79bdfdf

Also test aligned=True

3dccaca

remove useless part

c0b13fd

more docs and comments

8527755

NicolasHug changed the title ~~WIP Add Quantized version of RoIAlign~~ Add Quantized version of RoIAlign Apr 4, 2021

NicolasHug added improvement module: models.quantization Issues related to the quantizable/quantized models module: ops labels Apr 4, 2021

NicolasHug marked this pull request as ready for review April 4, 2021 10:46

NicolasHug mentioned this pull request Apr 4, 2021

Fix test_draw_boxes #3631

Merged

Put back optimization with more robust test

efef48a

NicolasHug commented Apr 5, 2021

View reviewed changes

datumbox reviewed Apr 5, 2021

View reviewed changes

NicolasHug added 4 commits April 6, 2021 14:48

Merge branch 'master' into qroialign

c061f6a

Merge branch 'master' of github.com:pytorch/vision into qroialign

07f3374

Added check for index upper bound

d6f78ab

Merge branch 'qroialign' of github.com:NicolasHug/vision into qroialign

160669a

fmassa reviewed Apr 7, 2021

View reviewed changes

NicolasHug added 12 commits April 7, 2021 11:31

avoid possible overflow

61564ca

Move common function into common.h

369fd33

oops

bcadc0f

scale=1,zero_point=0 makes more sense

6792e65

Force batch size of 1 to prevent any indexingbug

dde14ed

Merge branch 'master' of github.com:pytorch/vision into qroialign

29b29e0

format

457aab0

format again

0c7bb11

updated docstring

e96cf1a

Merge branch 'master' of github.com:pytorch/vision into qroialign

114475f

put back description comment for pre_calc_bilinear_interpolate

45d083f

revert most changes to docstring as it's taken care of in another PR

3ab6b66

fmassa approved these changes Apr 8, 2021

View reviewed changes

fmassa merged commit ad9cc62 into pytorch:master Apr 8, 2021

NicolasHug mentioned this pull request Apr 9, 2021

Allow batch_size > 1 in quantized RoIAlign #3655

Open

datumbox added enhancement and removed improvement labels Jun 1, 2021

NicolasHug mentioned this pull request Jul 16, 2021

TorchVision Roadmap - 2021 H1 #3221

Closed

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Quantized version of RoIAlign #3624

Add Quantized version of RoIAlign #3624

NicolasHug commented Mar 31, 2021

NicolasHug left a comment

NicolasHug Apr 5, 2021

datumbox Apr 5, 2021

NicolasHug Apr 5, 2021

datumbox Apr 5, 2021

fmassa Apr 7, 2021

NicolasHug Apr 7, 2021

fmassa left a comment

		int roi_batch_ind = at::native::dequantize_val(
		rois_scale, rois_zp, offset_rois[0]);

Add Quantized version of RoIAlign #3624

Add Quantized version of RoIAlign #3624

Conversation

NicolasHug commented Mar 31, 2021

NicolasHug left a comment

Choose a reason for hiding this comment

NicolasHug Apr 5, 2021

Choose a reason for hiding this comment

datumbox Apr 5, 2021

Choose a reason for hiding this comment

NicolasHug Apr 5, 2021

Choose a reason for hiding this comment

datumbox Apr 5, 2021

Choose a reason for hiding this comment

fmassa Apr 7, 2021

Choose a reason for hiding this comment

NicolasHug Apr 7, 2021

Choose a reason for hiding this comment

fmassa left a comment

Choose a reason for hiding this comment