-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How shall I create the file "test.src"? #5
Comments
@mriganktiwari I guess i t's from the part preparing (tokenize and binarize) the inputs for QG model that corresponds to P(Q|Y). So I thought Correct me if I'm wrong =] |
@mriganktiwari I found that I need to tokenize and binarize I guess the annotation of P(Q|Y) was a good idea to avoid misleading the readers of what QAGS measuring, but, to reproduce, this leaves a question mark. Shouldn't it be P(Q|Y;A) ? @W4ngatang |
Have you figured out it? When reproducing this work, I was also confused by this problem |
Do you mean that the test_w_10ans.txt is the only file that need to be tokenized? |
Any new information on this one? I also dont know how to generate this file |
I don't really remember the details of the code but I succeeded reproducing it after writing the comment now I'm referring to. Feel sorry that I cannot share actual code (which is lost) I've run for the experiment. But if you want to reproduce it, the original author's code is worth reading if you already read the paper. It was a good starting point for me and took not too long to fill the gap. If you replace the generation models in this work with recent language models, it will definitely work better. IMHO, if I need to revisit this work, I wouldn't bother myself to train small models as the original work did but just adapt the instruct-tuned LLMs instead with some good instructions. @dlaredo @Zhou-Zoey |
How shall I create the file "test.src" at question generation step? And does it contain?
The text was updated successfully, but these errors were encountered: