Make an eval script for SQuaD #285

chenmoneygithub · 2022-08-07T00:42:57Z

From a high level it is just a classification task, but there are some details to handle. The whole workflow can be described as:

Data
1. We can use the SQuaD dataset from tensorflow dataset: link.
2. Preferably just use V2?
Data preprocessing
1. Add [CLS] token at the start, and [SEP] token between context and answer.
2. From the given answer_start field, calculate the answer_end value, representing the index of start and end in the context.
3. Calculate the start and end index in the tokenized context.
4. Set the three labels: start token index, end token index, impossible (means impossible to find the answer).
Classification Head
1. Takes in the pretrained model output, and outputs 3 things: start token logits/prob (shape=[sequence_length, ]), end token logits/prob (shape=[sequence_length, ]), impossible logits/prob (shape=[1,]).
2. The structure should be one simple dense layer.
Training config
1. Optimizer: TBD
2. Learning rate: TBD
3. Should we use KerasTuner? TBD

The text was updated successfully, but these errors were encountered:

mattdangerw · 2022-08-08T14:28:10Z

Is this issue specifically for the BERT example?

chenmoneygithub · 2022-08-08T18:14:51Z

It's for general purpose, I am thinking we should have some components for eval purposes.

mattdangerw · 2022-08-08T20:15:18Z

It might be good to build the script specifically for BERT right now, and then shuffle of the components as we get a better understanding of what we need.

mattdangerw · 2022-09-02T18:58:22Z

Assigning this to @aflah02 as I think you are the one actively working on this.

aflah02 · 2022-09-02T18:59:56Z

yup thanks!

chenmoneygithub added type:Bug Something isn't working and removed type:Bug Something isn't working labels Aug 7, 2022

chenmoneygithub self-assigned this Aug 10, 2022

mattdangerw assigned aflah02 Sep 2, 2022

mattdangerw closed this as completed Oct 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make an eval script for SQuaD #285

Make an eval script for SQuaD #285

chenmoneygithub commented Aug 7, 2022

mattdangerw commented Aug 8, 2022 •

edited

Loading

chenmoneygithub commented Aug 8, 2022

mattdangerw commented Aug 8, 2022

mattdangerw commented Sep 2, 2022

aflah02 commented Sep 2, 2022

Make an eval script for SQuaD #285

Make an eval script for SQuaD #285

Comments

chenmoneygithub commented Aug 7, 2022

mattdangerw commented Aug 8, 2022 • edited Loading

chenmoneygithub commented Aug 8, 2022

mattdangerw commented Aug 8, 2022

mattdangerw commented Sep 2, 2022

aflah02 commented Sep 2, 2022

mattdangerw commented Aug 8, 2022 •

edited

Loading