[BUG] Loading state dict in a feature extraction network #2227

ioangatop · 2024-06-26T13:22:44Z

ioangatop
Jun 26, 2024

Describe the bug

Hi Ross! I'm facing a small issue with the features extractor, here are some details:

The function create_model supports the argument of checkpoint_path which allows to load custom model weights. However, when we want to load a model as feature extractor, the model is wrapped around the FeatureGetterNet class, and the loading fails as the keys do not much anymore; the FeatureGetterNet stores the model under self.model so in order to work, the state dict keys should have a prefix model., for example class_token -> model.class_token

Additionally, one workaround is to do the loading of the model after the initialisation, but this also fails as some networks, like vision transformer, prune some layers and thus the state_dict has extra keys

To Reproduce

from urllib import request

from timm.models import _helpers
import timm


# download weights
request.urlretrieve("https://dl.fbaipublicfiles.com/dino/dino_deitsmall16_pretrain/dino_deitsmall16_pretrain.pth", "dino_deitsmall16_pretrain.pth")

# build and load model -- works as expected
model = timm.create_model(
    model_name="vit_small_patch16_224",
    num_classes=0,
    checkpoint_path="dino_deitsmall16_pretrain.pth",
)

# RuntimeError: Error(s) in loading state_dict for FeatureGetterNet:
#   Missing key(s) in state_dict: "model.cls_token", "model.pos_embed", ...
#  Unexpected key(s) in state_dict: "cls_token", "pos_embed", ...
backbone = timm.create_model(
    model_name="vit_small_patch16_224",
    num_classes=0,
    features_only=True,
    checkpoint_path="dino_deitsmall16_pretrain.pth",
)

# RuntimeError: Error(s) in loading state_dict for VisionTransformer:
#   Unexpected key(s) in state_dict: "norm.weight", "norm.bias". 
backbone = timm.create_model(
    model_name="vit_small_patch16_224",
    num_classes=0,
    features_only=True,
)
_helpers.load_checkpoint(backbone.model, "dino_deitsmall16_pretrain.pth")

As always, thanks a lot 🙏

rwightman · 2024-06-27T17:54:21Z

rwightman
Jun 27, 2024
Maintainer

@ioangatop if you want classifier weights loaded into feature extraction wrapped models, you need to load weights as 'pretrained' so that they are loaded before the model is mutated.

See related discussion, should work with >= 0.9 timm version https://github.com/hugginface/pytorch-image-models/discussions/1941

Although, example in that discussion should be a bit differentl, use the 'overlay' arg as in the train script

pytorch-image-models/train.py

Lines 463 to 468 in d4ef0b4

    
           if args.pretrained_path: 
        
               # merge with pretrained_cfg of model, 'file' has priority over 'url' and 'hf_hub'. 
        
               factory_kwargs['pretrained_cfg_overlay'] = dict( 
        
                   file=args.pretrained_path, 
        
                   num_classes=-1,  # force head adaptation 
        
               )

The overlay dict is merged with the models normal pretrained_cfg, the pretrained_cfg arg fully overrides it.

Alternative to using the file key in the pretrained_cfg override dict, you can also use url to download from somewhere else, or hf_hub_id for a HF hub location.

0 replies

ioangatop · 2024-07-10T21:27:26Z

ioangatop
Jul 10, 2024
Author

@rwightman thanks for the response! from your reply, here is the complete solution that worked for me:

feature_extractor = timm.create_model(
  model_name="vit_small_patch16_224",
  pretrained=True,
  pretrained_cfg={
    "url": "https://dl.fbaipublicfiles.com/dino/dino_deitsmall16_pretrain/dino_deitsmall16_pretrain.pth",
    "num_classes": 0
  },
  out_indices=1,
  features_only=True,
)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Loading state dict in a feature extraction network #2227

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

[BUG] Loading state dict in a feature extraction network #2227

ioangatop Jun 26, 2024

Replies: 2 comments

rwightman Jun 27, 2024 Maintainer

ioangatop Jul 10, 2024 Author

ioangatop
Jun 26, 2024

rwightman
Jun 27, 2024
Maintainer

ioangatop
Jul 10, 2024
Author