[ViT] Support fine-tuning with different image resolution #5025

yiwen-song · 2021-12-03T06:23:44Z

As discussed in #4594, we should be able to interpolate embeddings from one resolution to a different one when training ViT models.
This PR adds the support for it.

References: ClassyVision Implementation

Experiments:

Launching Command:

PYTHONPATH=$PYTHONPATH:`pwd` python -u ~/workspace/scripts/run_with_submitit.py --timeout 3000 --ngpus 8 --nodes 4 --partition train --model vit_b_16 --batch-size 16 --epochs 8 --opt sgd --lr 0.01 --wd 0 --lr-scheduler cosineannealinglr --amp --mixup-alpha 0.2 --auto-augment ra --data-path /datasets01_ontap/imagenet_full_size/061417/ --clip-grad-norm 1 --cutmix-alpha 1.0 --resume /checkpoints/sallysyw/experiments/8022/model_299.pth --train-crop-size 384 --val-crop-size 384

cc @datumbox

facebook-github-bot · 2021-12-03T06:23:52Z

💊 CI failures summary and remediations

As of commit 73093f0 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

datumbox

Thanks for the PR @sallysyw, let me know your thoughts.

torchvision/prototype/models/vision_transformer.py

datumbox

@sallysyw I pushed to your branch a change to fix the typing issues. The problem here is that the OrderedDict is not subscriptable. Using quotes will do the trick.

I'm approving to unblock your work, but it's important to follow up with another PR that adds some tests to cover the method.

torchvision/prototype/models/vision_transformer.py

github-actions · 2021-12-09T22:51:03Z

Hey @sallysyw!

You merged this PR, but no labels were added. The list of valid labels is available at https://github.com/pytorch/vision/blob/main/.github/process_commit.py

…5025) Summary: * add from_checkpoint method for vit * remove useless change * Making interpolate_embeddings a utility function * remove logging * fix type hint * fix return type check * ad retuurns in docsting & unify type hint * remove useless import * fix issue: 'type' object is not subscriptable * Fixing typing issues * Making interpolation mode configurable * formatting Reviewed By: prabhat00155 Differential Revision: D33253466 fbshipit-source-id: 79bf6855f2dcee3c2fef6c05c243a0dc8dfee25e Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

add from_checkpoint method for vit

b981d99

pytorch-probot bot added the ciflow/default label Dec 3, 2021

facebook-github-bot added the cla signed label Dec 3, 2021

remove useless change

2b0640c

datumbox reviewed Dec 6, 2021

View reviewed changes

torchvision/prototype/models/vision_transformer.py Outdated Show resolved Hide resolved

torchvision/prototype/models/vision_transformer.py Outdated Show resolved Hide resolved

yiwen-song requested a review from fmassa December 7, 2021 23:09

yiwen-song and others added 5 commits December 7, 2021 17:24

Merge branch 'pytorch:main' into checkpoint

eeaceb1

Making interpolate_embeddings a utility function

48843f5

remove logging

910ea10

fix type hint

3f9e06f

fix return type check

637fba8

datumbox reviewed Dec 8, 2021

View reviewed changes

torchvision/prototype/models/vision_transformer.py Outdated Show resolved Hide resolved

torchvision/prototype/models/vision_transformer.py Show resolved Hide resolved

torchvision/prototype/models/vision_transformer.py Outdated Show resolved Hide resolved

yiwen-song and others added 4 commits December 8, 2021 22:47

ad retuurns in docsting & unify type hint

e332730

remove useless import

808e017

fix issue: 'type' object is not subscriptable

6798f1c

Fixing typing issues

7df5492

datumbox approved these changes Dec 9, 2021

View reviewed changes

torchvision/prototype/models/vision_transformer.py Outdated Show resolved Hide resolved

yiwen-song and others added 3 commits December 9, 2021 21:37

Making interpolation mode configurable

a301187

formatting

117363d

Merge branch 'main' into checkpoint

73093f0

yiwen-song merged commit 1b14829 into pytorch:main Dec 9, 2021

yiwen-song deleted the checkpoint branch December 9, 2021 23:10

datumbox added enhancement module: models labels Dec 9, 2021

yiwen-song linked an issue Dec 10, 2021 that may be closed by this pull request

Adding Vision Transformer to torchvision/models #4593

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ViT] Support fine-tuning with different image resolution #5025

[ViT] Support fine-tuning with different image resolution #5025

yiwen-song commented Dec 3, 2021 •

edited by pytorch-probot bot

Loading

facebook-github-bot commented Dec 3, 2021 •

edited

Loading

datumbox left a comment

datumbox left a comment •

edited

Loading

github-actions bot commented Dec 9, 2021

[ViT] Support fine-tuning with different image resolution #5025

[ViT] Support fine-tuning with different image resolution #5025

Conversation

yiwen-song commented Dec 3, 2021 • edited by pytorch-probot bot Loading

facebook-github-bot commented Dec 3, 2021 • edited Loading

💊 CI failures summary and remediations

datumbox left a comment

Choose a reason for hiding this comment

datumbox left a comment • edited Loading

Choose a reason for hiding this comment

github-actions bot commented Dec 9, 2021

yiwen-song commented Dec 3, 2021 •

edited by pytorch-probot bot

Loading

facebook-github-bot commented Dec 3, 2021 •

edited

Loading

datumbox left a comment •

edited

Loading