Fine-Tuning CNN models #3182

IMvision12 · 2023-07-05T17:31:51Z

IMvision12
Jul 5, 2023

I have a flax model :

b = mlpmixer_b16(num_classes=10)

And pre-trained weights (ImageNet) (Image size: 224x224)

 with open("imagenet21k_Mixer-B_16.msgpack", "rb") as f:
        content = f.read()
        restored_params = flax.serialization.msgpack_restore(content)

So, I want to fine-tune this model with restored_params on a dataset having images of size 128x128
when i try to init or apply i get this error:

dummy_inputs = jnp.ones((1, 128, 128, 3), dtype=jnp.float32)
rng = jax.random.PRNGKey(0)
x = b.apply({"params": restored_params}, dummy_inputs)

ScopeParamShapeError: Initializer expected to generate shape (196, 384) but got shape (64, 384) instead for parameter "kernel" in "/MixerBlock_0/token_mixing/Dense_0". (https://flax.readthedocs.io/en/latest/api_reference/flax.errors.html#flax.errors.ScopeParamShapeError)

If I change the shape to 224x224 it works fine:

jnp.ones((1, 224, 224, 3), dtype=jnp.float32)

How to properly finetune a model using flax?

Answered by andsteing

Jul 6, 2023

Answer copied from google-research/vision_transformer#274:

When you want to fine-tune a ViT model with a different image size than it was pre-trained on, then you'll need to adjust the position embeddings accordingly. Section 3.2 of the ViT Paper proposes to perform 2D interpolation.

This is supported in this codebase when loading a checkpoint:

https://github.com/google-research/vision_transformer/blob/297866ab49341257e6f657d7f1068164c8eaf338/vit_jax/checkpoint.py#L192-L201

Which is done automatically when you call checkpoint.load_pretrained() and provide both init_params that expect a certain image size (e.g. 128 in your example), and load from a checkpoint that has weights that were tr…

View full answer

cgarciae · 2023-07-05T19:21:56Z

cgarciae
Jul 5, 2023
Maintainer

Why don't you try finetunning it with inputs of shape 224x224? Take a look at our Transfer Learning guide for some general recommendations.

3 replies

IMvision12 Jul 5, 2023
Author

Hey @cgarciae, I've read the guide and I'm wondering if it's possible to fine-tune using a different image size, similar to what we do in TensorFlow and PyTorch.

cgarciae Jul 5, 2023
Maintainer

@IMvision12 you'd have to ask the Big Vision folks (which I assume your model is coming from), in principle there is nothing stopping you on the Flax side.

andsteing Jul 6, 2023
Maintainer

Answer copied from google-research/vision_transformer#274:

When you want to fine-tune a ViT model with a different image size than it was pre-trained on, then you'll need to adjust the position embeddings accordingly. Section 3.2 of the ViT Paper proposes to perform 2D interpolation.

This is supported in this codebase when loading a checkpoint:

https://github.com/google-research/vision_transformer/blob/297866ab49341257e6f657d7f1068164c8eaf338/vit_jax/checkpoint.py#L192-L201

Which is done automatically when you call checkpoint.load_pretrained() and provide both init_params that expect a certain image size (e.g. 128 in your example), and load from a checkpoint that has weights that were trained on a different size (e.g. 224).

See the code in the main Colab that fine-tunes on on cifar10 (size 32), specifically this cell:

https://colab.research.google.com/github/google-research/vision_transformer/blob/main/vit_jax.ipynb#scrollTo=zIXjOEDkvAWM

Answer selected by IMvision12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-Tuning CNN models #3182

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 3 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Fine-Tuning CNN models #3182

IMvision12 Jul 5, 2023

Replies: 1 comment · 3 replies

cgarciae Jul 5, 2023 Maintainer

IMvision12 Jul 5, 2023 Author

cgarciae Jul 5, 2023 Maintainer

andsteing Jul 6, 2023 Maintainer

IMvision12
Jul 5, 2023

Replies: 1 comment 3 replies

cgarciae
Jul 5, 2023
Maintainer

IMvision12 Jul 5, 2023
Author

cgarciae Jul 5, 2023
Maintainer

andsteing Jul 6, 2023
Maintainer