Support Use of Joint and GDumb with Pre-Trained Models #362

wistuba · 2023-08-04T11:48:01Z

Joint and Gdumb reset the models on_model_update_start. This results in a non-desired behavior when working with pre-trained models.
There is also no option to not reset the model at all.

The change introduces a new flag reset which allows to control whether the model will be reset. Furthermore, in case of a reset, the pre-trained model will be reloaded instead of using an untrained model.

The integration tests are expected to break for Joint and GDumb due to a different initialization of the model. Before, the workflow was creating the model and resetting it. Now, it only creates the model without reset.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

prabhuteja12

Documenting an alternative to this approach:

The resetting is method + network specific. Thus an alternative is to define a reset_parameters for RenateBenchmarkingModule and overload them only for specific modules (ViT, Text transformers etc). The current on_model_training_start would invoke the this reset_parameters to reset it.

github-actions · 2023-08-04T13:03:53Z

Coverage report

The coverage rate went from 85.68% to 84.95% ⬇️

0% of new lines are covered.

Diff Coverage details (click to unfold)

src/renate/cli/parsing_functions.py

0% of new lines are covered (79.2% of the complete file).
Missing lines: 457, 462

wistuba added 4 commits August 4, 2023 13:43

no reset for gdumb and joint

3dbfe1c

update gdumb numbers

369a2b9

first number fixed for joint

fe3a371

2nd number updated

e373fc0

wistuba requested a review from prabhuteja12 August 4, 2023 12:04

wistuba assigned prabhuteja12 Aug 4, 2023

wistuba added 2 commits August 4, 2023 14:38

remove unit test checking for successful reset

3f3d112

cleanup test and flake

1ac924e

prabhuteja12 reviewed Aug 4, 2023

View reviewed changes

prabhuteja12 approved these changes Aug 4, 2023

View reviewed changes

wistuba merged commit c0612c0 into dev Aug 4, 2023

wistuba deleted the mw-joint-no-reset branch August 4, 2023 17:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Use of Joint and GDumb with Pre-Trained Models #362

Support Use of Joint and GDumb with Pre-Trained Models #362

wistuba commented Aug 4, 2023 •

edited

Loading

prabhuteja12 left a comment

github-actions bot commented Aug 4, 2023

src/renate/cli/parsing_functions.py

Support Use of Joint and GDumb with Pre-Trained Models #362

Support Use of Joint and GDumb with Pre-Trained Models #362

Conversation

wistuba commented Aug 4, 2023 • edited Loading

prabhuteja12 left a comment

Choose a reason for hiding this comment

github-actions bot commented Aug 4, 2023

Coverage report

src/renate/cli/parsing_functions.py

wistuba commented Aug 4, 2023 •

edited

Loading