Add extra arguments to hubert pretrain factory functions #2345

nateanl · 2022-04-22T13:08:56Z

In different pre-training and fine-tuning settings, the mask_prob, mask_channel_prob, and mask_channel_length are different. For example, the settings in pre-training and fine-tuning are different. The motivation is to avoid overfitting when fine-tuning on a small dataset (example: fine-tune on 10 minutes of audio).
This PR adds the required arguments in the factory functions to make them tunable for pre-training and fine-tuning. mask_length is set to 10 by default for all cases, hence it's not included in the factory function.

facebook-github-bot · 2022-04-22T13:11:27Z

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

carolineechen

the factory functions here support the mask_channel_length parameter unlike what the PR summary describes -- which one should it be?

torchaudio/models/wav2vec2/model.py

carolineechen · 2022-04-22T15:44:19Z

torchaudio/models/wav2vec2/model.py

@@ -1096,6 +1114,9 @@ def hubert_pretrain_xlarge(
    encoder_ff_interm_dropout: float = 0.0,
    encoder_dropout: float = 0.0,
    encoder_layer_drop: float = 0.0,
+    mask_prob: float = 0.8,
+    mask_channel_prob: float = 0.0,
+    mask_channel_length: float = 10,
 ) -> HuBERTPretrainModel:
    # Overriding the signature so that the return type is correct on Sphinx
    """hubert_pretrain_xlarge(encoder_projection_dropout: float = 0.0, encoder_attention_dropout: float = 0.0, encoder_ff_interm_dropout: float = 0.0, encoder_dropout: float = 0.0, encoder_layer_drop: float = 0.0) -> torchaudio.models.HuBERTPretrainModel


same as above -- add new params to sphinx signature override

torchaudio/models/wav2vec2/model.py

nateanl · 2022-04-22T19:09:58Z

mask_channel_length is the one that is included. Since mask_length value doesn't change in pre-training and fine-tuning, it will not be included in the argument list.

facebook-github-bot · 2022-04-26T10:53:24Z

@nateanl has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

carolineechen

Looks good, thanks

Summary: In different pre-training and fine-tuning settings, the `mask_prob`, `mask_channel_prob`, and `mask_channel_length` are different. For example, the settings in [pre-training](https://github.com/pytorch/fairseq/blob/main/examples/hubert/config/pretrain/hubert_base_librispeech.yaml#L70) and [fine-tuning](https://github.com/pytorch/fairseq/blob/main/examples/hubert/config/finetune/base_10h.yaml#L69-L73) are different. The motivation is to avoid overfitting when fine-tuning on a small dataset (example: [fine-tune on 10 minutes of audio](https://github.com/pytorch/fairseq/blob/main/examples/wav2vec/config/finetuning/vox_10m.yaml#L57-L59)). This PR adds the required arguments in the factory functions to make them tunable for pre-training and fine-tuning. `mask_length` is set to `10` by default for all cases, hence it's not included in the factory function. Pull Request resolved: pytorch#2345 Reviewed By: carolineechen, xiaohui-zhang Differential Revision: D35845117 Pulled By: nateanl fbshipit-source-id: 0cbb74d09535d189b8258aa8ee0f88779bdb77e7

add extra arguments

04be8b6

nateanl added improvement module: models labels Apr 22, 2022

nateanl requested review from mthrok, hwangjeff, xiaohui-zhang and carolineechen April 22, 2022 13:08

facebook-github-bot added the CLA Signed label Apr 22, 2022

carolineechen reviewed Apr 22, 2022

View reviewed changes

fix

aee6e9b

nateanl requested a review from carolineechen April 26, 2022 10:53

carolineechen approved these changes Apr 26, 2022

View reviewed changes

facebook-github-bot closed this in 7c249d1 Apr 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add extra arguments to hubert pretrain factory functions #2345

Add extra arguments to hubert pretrain factory functions #2345

nateanl commented Apr 22, 2022

facebook-github-bot commented Apr 22, 2022

carolineechen left a comment

carolineechen Apr 22, 2022

nateanl commented Apr 22, 2022

facebook-github-bot commented Apr 26, 2022

carolineechen left a comment

Add extra arguments to hubert pretrain factory functions #2345

Add extra arguments to hubert pretrain factory functions #2345

Conversation

nateanl commented Apr 22, 2022

facebook-github-bot commented Apr 22, 2022

carolineechen left a comment

Choose a reason for hiding this comment

carolineechen Apr 22, 2022

Choose a reason for hiding this comment

nateanl commented Apr 22, 2022

facebook-github-bot commented Apr 26, 2022

carolineechen left a comment

Choose a reason for hiding this comment