Fix RT-DETR cache for generate_anchors #31671

qubvel · 2024-06-27T19:41:45Z

What does this PR do?

For RT-DETR model:

Fix lru_cache for the generate_anchors method, so even if we go with dynamic anchors generation we only generate it once per image of the same size.
Fix anchors dtype: make anchors generated always in float32, and then convert to the desired type. Otherwise, a difference might be observed between dynamic and static anchors inference in float16/bfloat16 (test is added).

Who can review?

HuggingFaceDocBuilderDev · 2024-06-27T20:04:51Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

SangbumChoi

Thanks for handling this!
(Did you also finetuned RT-DETR with bfloat16? Is it okay?)

qubvel · 2024-06-28T09:23:04Z

Did you also finetuned RT-DETR with bfloat16? Is it okay?

No, I didn't. I tried bfloat16 only for inference 🙂

amyeroberts

Thanks for fixing!

amyeroberts · 2024-06-28T10:47:25Z

src/transformers/models/rt_detr/modeling_rt_detr.py

@@ -1656,7 +1656,10 @@ def unfreeze_backbone(self):
            param.requires_grad_(True)

    @lru_cache(maxsize=32)
-    def generate_anchors(self, spatial_shapes=None, grid_size=0.05, dtype=torch.float32, device="cpu"):
+    def generate_anchors(self, spatial_shapes=None, grid_size=0.05):
+        # We always generate anchors in float32 to preserve original model code


Let's update this to reflect the true reason: preserving equivalence between dynamic and static anchor inference. We don't really care about whether our code matches the original model's, just that equivalent logic is used

src/transformers/models/rt_detr/modeling_rt_detr.py

qubvel added 4 commits June 27, 2024 17:56

Fix cache and type conversion

0032193

Add test

32c515c

Fixup

8a684d5

nit

5a64e29

SangbumChoi approved these changes Jun 28, 2024

View reviewed changes

qubvel added the run-slow label Jun 28, 2024

qubvel added 4 commits June 28, 2024 08:52

[run slow] rt_detr

15b6c99

Fix test

99a228c

Fixup

a7342c8

[run slow] rt_detr

8b51afa

qubvel requested a review from amyeroberts June 28, 2024 09:33

qubvel mentioned this pull request Jul 1, 2024

Fix RT-DETR weights initialization #31724

Merged

amyeroberts approved these changes Jul 1, 2024

View reviewed changes

qubvel commented Jul 3, 2024

View reviewed changes

src/transformers/models/rt_detr/modeling_rt_detr.py Outdated Show resolved Hide resolved

Update src/transformers/models/rt_detr/modeling_rt_detr.py

a59e985

qubvel merged commit b975216 into huggingface:main Jul 3, 2024
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix RT-DETR cache for generate_anchors #31671

Fix RT-DETR cache for generate_anchors #31671

qubvel commented Jun 27, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 27, 2024

SangbumChoi left a comment •

edited

Loading

qubvel commented Jun 28, 2024

amyeroberts left a comment

amyeroberts Jun 28, 2024

Fix RT-DETR cache for generate_anchors #31671

Fix RT-DETR cache for generate_anchors #31671

Conversation

qubvel commented Jun 27, 2024 • edited Loading

What does this PR do?

Who can review?

HuggingFaceDocBuilderDev commented Jun 27, 2024

SangbumChoi left a comment • edited Loading

Choose a reason for hiding this comment

qubvel commented Jun 28, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Jun 28, 2024

Choose a reason for hiding this comment

qubvel commented Jun 27, 2024 •

edited

Loading

SangbumChoi left a comment •

edited

Loading