Post-paper Detection Optimizations #5444

datumbox · 2022-02-19T11:38:25Z

Related to #5307 and #2263. Touches upon requests recorded at #4932 and #5325.

Our target is to improve the existing RetinaNet, FasterRCNN and MaskRCNN architectures using post-paper optimizations:

Adds a new RetinaNet variant with post-paper optimizations proposed at "Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection".
Adds a new FasterRCNN and MaskRCNN variants with post-paper optimizations proposed at "Benchmarking Detection Transfer Learning with Vision Transformers" by @rbgirshick and @pdollar
Adds support of elements from @vaibhava0's recipe, @xiaohu2015's recipe and @fmassa's work on DETR. Other recipes influencing this work is the ones of Swin Transformers and XCiT.

This PR contains commits which were later split on separate PRs to assist the reviews. Currently it contains the following changes on top of the main branch:

Adds support of norm_layer in all Detection heads.
Adds a new extendible FastRCNNConvFCHead which follows a similar strategy as the existing MaskRCNNHeads.
Replaces Conv-Norm-BNs with Conv2dNormActivation where possible; we use PyTorch core's _load_from_state_dict() approach to maintain BC.
Extends RPNHead to support heavier heads.
Adds an experimental private _box_loss utility which allows for training models with different box losses. No plans to make this public; the entire API is kept private so that we can review our Detection API as a whole and examine the best way to support different Transforms and Losses in Detection models.
Adds new 3 variants for RetinaNet and *RCNN based on follow up papers.

facebook-github-bot · 2022-02-19T11:38:32Z

💊 CI failures summary and remediations

As of commit 6488c41 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

fmassa

LGTM!

I've left just a minor comment regarding naming, but it's just for discussion and there is no need to act on it

torchvision/models/detection/faster_rcnn.py

torchvision/models/detection/_utils.py

d4l3k · 2022-04-14T07:24:01Z

@datumbox I'm getting errors when trying to load a model using FPN as a frozen layer.

Traceback (most recent call last):
  File "/mnt/ext/openape/apedepth/train.py", line 85, in <module>
    model.load_state_dict(state_dict, strict=False)
  File "/home/rice/venvs/openape/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1566, in load_state_dict
    load(self)
  File "/home/rice/venvs/openape/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1564, in load
    load(child, prefix + name + '.')
  File "/home/rice/venvs/openape/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1564, in load
    load(child, prefix + name + '.')
  File "/home/rice/venvs/openape/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1564, in load
    load(child, prefix + name + '.')
  File "/home/rice/venvs/openape/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1560, in load
    module._load_from_state_dict(
  File "/home/rice/venvs/openape/lib/python3.10/site-packages/torchvision/ops/feature_pyramid_network.py", line 131, in _load_from_state_dict
    state_dict[new_key] = state_dict.pop(old_key)
KeyError: 'semantic.backbone.fpn.inner_blocks.0.weight'

This seems to be a regression since PT 1.11 stable and I can no longer save/load my model. Excluding the semantic weights when calling torch.load doesn't help either.

My model has

        self.semantic = freeze(models.detection.fasterrcnn_mobilenet_v3_large_fpn(pretrained=True))

d4l3k · 2022-04-14T07:44:41Z

Adding a if old_key in state_dict in a few spots seems to fix it.

        version = local_metadata.get("version", None)

        if version is None or version < 2:
            for type in ["weight", "bias"]:
                old_key = f"{prefix}conv.{type}"
                new_key = f"{prefix}conv.0.0.{type}"
+                if old_key in state_dict:
                    state_dict[new_key] = state_dict.pop(old_key)

datumbox · 2022-04-19T07:14:54Z

@d4l3k Thanks for the heads up.

This seems to be a regression since PT 1.11 stable and I can no longer save/load my model.

If this is true, it's probably not related to this PR. To reduce any confusion, could you please open a separate issue and provide a way to reproduce the problem?

Adding a if old_key in state_dict in a few spots seems to fix it.

This might indicate that the old structure is modified and not compatible with v1. The proposed patch is not a solution but rather more likely to mask the issue. If you provide a way to reproduce the issue, we can help you invstigate.

Summary: * Use frozen BN only if pre-trained. * Add LSJ and ability to from scratch training. * Fixing formatter * Adding `--opt` and `--norm-weight-decay` support in Detection. * Fix error message * Make ScaleJitter proportional. * Adding more norm layers in split_normalization_params. * Add FixedSizeCrop * Temporary fix for fill values on PIL * Fix the bug on fill. * Add RandomShortestSize. * Skip resize when an augmentation method is used. * multiscale in [480, 800] * Add missing star * Add new RetinaNet variant. * Add tests. * Update expected file for old retina * Fixing tests * Add FrozenBN to retinav2 * Fix network initialization issues * Adding BN support in MaskRCNNHeads and FPN * Adding support of FasterRCNNHeads * Introduce norm_layers in backbone utils. * Bigger RPN head + 2x rcnn v2 models. * Adding gIoU support to retinanet * Fix assert * Add back nesterov momentum * Rename and extend `FastRCNNConvFCHead` to support arbitrary FCs * Fix linter (Note: this ignores all push blocking failures!) Reviewed By: jdsgomes, NicolasHug Differential Revision: D36095683 fbshipit-source-id: 9105524308694ac8830ed12ba40286bb75c4aa8d

datumbox added 2 commits February 18, 2022 21:35

Use frozen BN only if pre-trained.

0f6fa39

Add LSJ and ability to from scratch training.

7a94595

pytorch-bot bot added the ciflow/default label Feb 19, 2022

facebook-github-bot added the cla signed label Feb 19, 2022

datumbox changed the title ~~Enhance Detection Recipe~~ [WIP] Enhance Detection Recipe Feb 19, 2022

datumbox marked this pull request as draft February 19, 2022 11:38

Fixing formatter

89a5b9d

datumbox added enhancement module: reference scripts topic: object detection labels Feb 19, 2022

datumbox mentioned this pull request Feb 19, 2022

[RFC] Batteries Included - Phase 2 #5410

Closed

24 tasks

datumbox and others added 15 commits February 20, 2022 10:23

Merge branch 'main' into references/detection_recipe

20470c1

Merge branch 'main' into references/detection_recipe

22d7f47

Merge branch 'main' into references/detection_recipe

a0322dd

Merge branch 'main' into references/detection_recipe

2943182

Merge branch 'main' into references/detection_recipe

629e149

Merge branch 'main' into references/detection_recipe

53fbd71

Merge branch 'main' into references/detection_recipe

d3b8dad

Merge branch 'main' into references/detection_recipe

5aa97c3

Adding --opt and --norm-weight-decay support in Detection.

8537c48

Fix error message

f7f8e2f

Make ScaleJitter proportional.

ed2a24c

Merge branch 'main' into references/detection_recipe

bc7a8a9

Merge branch 'main' into references/detection_recipe

a1786bb

Merge branch 'main' into references/detection_recipe

6c12921

Adding more norm layers in split_normalization_params.

bcf0afc

datumbox force-pushed the references/detection_recipe branch from 2bc1e81 to bcf0afc Compare March 8, 2022 00:04

Merge branch 'main' into references/detection_recipe

9c66a7c

datumbox and others added 3 commits March 30, 2022 12:54

Merge branch 'main' into references/detection_recipe

e5cbb97

Adding gIoU support to retinanet

592784d

Fix assert

2cff640

datumbox mentioned this pull request Mar 30, 2022

[RFC] Loss Functions in Torchvision #2980

Open

20 tasks

Merge branch 'main' into references/detection_recipe

a6f0ea7

datumbox mentioned this pull request Mar 31, 2022

Detection recipe enhancements #5715

Merged

datumbox and others added 2 commits April 1, 2022 08:30

Add back nesterov momentum

61412df

Merge branch 'main' into references/detection_recipe

99479ee

datumbox changed the title ~~[WIP] Enhance Detection Recipe~~ [WIP] Post-paper Detection Optimizations Apr 1, 2022

Merge branch 'main' into references/detection_recipe

08307ca

datumbox requested a review from fmassa April 1, 2022 11:24

datumbox marked this pull request as ready for review April 1, 2022 11:33

datumbox added 2 commits April 1, 2022 15:55

Merge branch 'main' into references/detection_recipe

a322dd2

Merge branch 'main' into references/detection_recipe

eb649e8

datumbox mentioned this pull request Apr 2, 2022

add GN and GIoU loss for retinanet #4932

Closed

fmassa approved these changes Apr 4, 2022

View reviewed changes

torchvision/models/detection/faster_rcnn.py Outdated Show resolved Hide resolved

torchvision/models/detection/_utils.py Show resolved Hide resolved

datumbox added 2 commits April 4, 2022 10:33

Rename and extend FastRCNNConvFCHead to support arbitrary FCs

24b8643

Fix linter

6488c41

datumbox changed the title ~~[WIP] Post-paper Detection Optimizations~~ Post-paper Detection Optimizations Apr 5, 2022

Merge branch 'main' into references/detection_recipe

00e182a

datumbox merged commit 08cc9a7 into pytorch:main Apr 5, 2022

datumbox deleted the references/detection_recipe branch April 5, 2022 17:49

d4l3k mentioned this pull request Apr 19, 2022

torchvision models no longer respect strict=False when loading #5835

Closed

datumbox mentioned this pull request Apr 28, 2022

Are new models planned to be added? #2707

Open

37 tasks

GENZITSU mentioned this pull request Jun 29, 2022

weekly useful materials - 07/05 - GENZITSU/UsefulMaterials#106

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Post-paper Detection Optimizations #5444

Post-paper Detection Optimizations #5444

datumbox commented Feb 19, 2022 •

edited

Loading

facebook-github-bot commented Feb 19, 2022 •

edited

Loading

fmassa left a comment

d4l3k commented Apr 14, 2022

d4l3k commented Apr 14, 2022

datumbox commented Apr 19, 2022

Post-paper Detection Optimizations #5444

Post-paper Detection Optimizations #5444

Conversation

datumbox commented Feb 19, 2022 • edited Loading

facebook-github-bot commented Feb 19, 2022 • edited Loading

💊 CI failures summary and remediations

fmassa left a comment

Choose a reason for hiding this comment

d4l3k commented Apr 14, 2022

d4l3k commented Apr 14, 2022

datumbox commented Apr 19, 2022

datumbox commented Feb 19, 2022 •

edited

Loading

facebook-github-bot commented Feb 19, 2022 •

edited

Loading