Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
distillation.yaml		distillation.yaml
requirements.txt		requirements.txt
run_qa_no_trainer_distillation.py		run_qa_no_trainer_distillation.py
utils_qa.py		utils_qa.py

README.md

Step-by-Step

This document is used to list steps of reproducing PyTorch BERT distillation examples result. Original BERT documents please refer to BERT README and README.

Prerequisite

Python Version

Recommend python 3.6 or higher version.

Install dependency

pip install -r requirements.txt

Start to neural_compressor tune for Model Distillation

Below are example NLP tasks for model distillation from a task specific fine-tuned large model to a smaller model. It requires the pre-trained task specific model such as csarron/bert-base-uncased-squad-v1 from Huggingface portal. The distillation configuration is specified in yaml file i.e. distillation.yaml.

SQuAD task

python run_qa_no_trainer_distillation.py \
      --dataset_name squad --model_name_or_path distilbert-base-uncased \
      --teacher_model_name_or_path csarron/bert-base-uncased-squad-v1 --do_distillation \
      --learning_rate 1e-5 --num_train_epochs 4 --output_dir /path/to/output_dir \
      --loss_weights 0 1 --temperature 2 --run_teacher_logits \
      --pad_to_max_length --seed 5143

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eager

eager

README.md

Step-by-Step

Prerequisite

Python Version

Install dependency

Start to neural_compressor tune for Model Distillation

SQuAD task

Files

eager

Directory actions

More options

Directory actions

More options

Latest commit

History

eager

Folders and files

parent directory

README.md

Step-by-Step

Prerequisite

Python Version

Install dependency

Start to neural_compressor tune for Model Distillation

SQuAD task