Introducee random teacher layer sets #35

zhangzhenyu13 · 2022-11-03T07:51:42Z

I find that a fixed teacher layer sets might not be a good choice for cofi;
so it would make the method more robust to introduce the random teacher sets selection.
refer this: [2109.10164] RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation (arxiv.org)

add more version inspired by rail-kd

YinongLong · 2022-11-24T15:25:05Z

trainer/trainer.py

+                if self.additional_args.layer_distill_version > 4:
+                    specified_teacher_layers = [i for i in range(12)]
+                    if self.additional_args.layer_distill_version ==5:
+                        specified_teacher_layers = sorted(random.sample(specified_teacher_layers, 4))


The "random" module is not imported.

Oh, yes, it is. Thanks.
People see this will import the module: random

zhangzhenyu13 added 2 commits November 3, 2022 15:47

Update trainer.py

0682538

add more version inspired by rail-kd

Update trainer.py

b399e64

xiamengzhou merged commit 756f67b into princeton-nlp:main Nov 7, 2022

YinongLong reviewed Nov 24, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducee random teacher layer sets #35

Introducee random teacher layer sets #35

zhangzhenyu13 commented Nov 3, 2022

YinongLong Nov 24, 2022

zhangzhenyu13 Nov 25, 2022

Introducee random teacher layer sets #35

Introducee random teacher layer sets #35

Conversation

zhangzhenyu13 commented Nov 3, 2022

YinongLong Nov 24, 2022

Choose a reason for hiding this comment

zhangzhenyu13 Nov 25, 2022

Choose a reason for hiding this comment