Skip to content

Latest commit

 

History

History
57 lines (41 loc) · 1.84 KB

File metadata and controls

57 lines (41 loc) · 1.84 KB

Merging Models of Homogeneous Tasks

Data

3 datasets were used in this experiment:

Run

cd ./homogeneous_tasks

Training models

python -m training_script.cifar_resnet20

Evaluating merging methods

  • Evaluating the basic performance (i.e. the original models, ensemble of the original models)
python -m base_model_concept_merging --config-name=cifar50_resnet20
  • Evaluating the merging method
python -m mudsc_concept_merging --config-name=cifar50_resnet20 --suffix=$SUFFIX
  • Note that $SUFFIX can be:

    _avg: Direct average of the original models

    _act: An equivalent implementation of Zipit without partial zip. For the models without group structure (i.e. ViT, ResnetGN), we test them with the original Zipit. For the model with group structure (i.e. ViT, ResnetGN), we test them with our implementation.

    _act_useperm: Activation-based alignment (A. Align)

    _useperm: An equivalent implementation of Git Rebasin. For the models without group structure, we test them with a pytorch implement of Git Rebasin. For the model with group structure (i.e. ViT, ResnetGN), we test them with our implementation.

    "": Weight-based Zip(W. Zip)

    _act_iws_fs_useperm: Alignment-based MuDSC

    _act_iws_fs_useperm_train: Alignment-based MuDSC tested on train dataset (for searching balanced factor)

    _act_iws_fs: Zip-based MuDSC

    _act_iws_fs_train: Zip-based MuDSC tested on train dataset (for searching balanced factor)