Release of training code for QKConv #179

robinsongh381 · 2022-12-28T05:06:34Z

@christineaa Thanks for sharing nice work !

Do you have any plans to release the training code ?

christineaa · 2022-12-29T02:50:26Z

Thanks for your attention to this work!
We have no plan to release the training code.

robinsongh381 · 2023-06-10T10:48:17Z

@christineaa I have a one more question regarding the experiment in the QKConv paper.
When you train and evaluate for QReCC task, did you use the 14K conversations as the training dataset or 80K question-answer pairs as the training dataset ?

And similarly for the test dataset - did you use the conversation level or question-answer level?

Thank you!

christineaa · 2023-06-13T03:05:19Z

We used the question-answer pairs as the training/dev/test dataset, with 60.4K, 3.1K, and 16.4K samples respectively.

dhx20150812 · 2023-07-13T10:35:30Z

Hi, @christineaa . Thanks for your nice work.

I have one more question: how should I build the BM25 index for QRecc task. I notice you post a link to the ml-qrecc repo. Whether should I download the webpages from both the Common Crawl and the Wayback Machine and build the BM25 index?

christineaa · 2023-07-13T10:42:24Z

Thanks for your attention to this work!
Yes, you should download both web pages and follow the instructions in the ml-qrecc repo.

dhx20150812 · 2023-07-14T06:44:00Z

Thanks for your reply.

robinsongh381 · 2023-09-06T06:58:16Z

@christineaa I have further questions regarding training and evaluation of QKConv model on QReCC dataset.

In your paper, the third footprint states "We remove conversations without truth responses.". What is the meaning of this ? Did you apply this for training or test dataset or both ? Please provide me a detailed code for this processing if available. Because in your QKConv inference or dataset code, I cannot find any relevant information.
Upon evaluation for Table 2 in QKConv paper for QReCC, did you apply the above "removed" version of qrecc-test.json ? or plain qrecc-test.json ? I mean qrecc-test.json from here.

robinsongh381 · 2023-09-06T07:02:31Z

Also, when you report Table2, did you exlcude test examples which do not have gold knowledge ?

christineaa · 2023-09-08T04:07:48Z

@robinsongh381 Thanks for your attention to this work!

We remove the samples when "Truth_answer"/"Answer" is an empty string for the training set (57946 samples left) and the test set (15024 samples left).
We use the "removed" version of the test set, coding as evaluation code.
We include samples without golden knowledge in Table 2 as the absence of golden knowledge does not affect response generation evaluation, and only exclude them in the knowledge selection evaluation.

robinsongh381 · 2023-09-08T07:20:12Z

@christineaa Thank you for kind response.

I have a further question for Question 3.

The absence of gold knowledge indicates that the essential and required piece of information does not exist within the knowledge pool and hence factually correct and knowledg-grounded response cannot be obtained.

For this reason, I have found that previous works on QReCC evaluation, such as DPR-IHN[1], and CONQRR[2], have excluded such cases (i.e., examples without gold-knowledge annotation) in their evaluation.

What is your opinion on this ?

Thank you

[1] Saving Dense Retriever from Shortcut Dependency in Conversational Search

[2] CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning

christineaa · 2023-09-11T08:02:02Z

@robinsongh381
The knowledge base for QReCC contains 54M passages, and more or less, there is knowledge relevant to the questions. We demonstrate how the model utilizes incorrect retrieved knowledge in Table 5 & Table 6.

However, DPR-IHN and CONQRR excluding samples without golden knowledge are another case. They present knowledge selection Recall metrics as their main results, and Recall metrics cannot be applied without golden knowledge.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release of training code for QKConv #179

Release of training code for QKConv #179

robinsongh381 commented Dec 28, 2022

christineaa commented Dec 29, 2022

robinsongh381 commented Jun 10, 2023

christineaa commented Jun 13, 2023 •

edited

Loading

dhx20150812 commented Jul 13, 2023

christineaa commented Jul 13, 2023

dhx20150812 commented Jul 14, 2023

robinsongh381 commented Sep 6, 2023

robinsongh381 commented Sep 6, 2023

christineaa commented Sep 8, 2023 •

edited

Loading

robinsongh381 commented Sep 8, 2023

christineaa commented Sep 11, 2023

Release of training code for QKConv #179

Release of training code for QKConv #179

Comments

robinsongh381 commented Dec 28, 2022

christineaa commented Dec 29, 2022

robinsongh381 commented Jun 10, 2023

christineaa commented Jun 13, 2023 • edited Loading

dhx20150812 commented Jul 13, 2023

christineaa commented Jul 13, 2023

dhx20150812 commented Jul 14, 2023

robinsongh381 commented Sep 6, 2023

robinsongh381 commented Sep 6, 2023

christineaa commented Sep 8, 2023 • edited Loading

robinsongh381 commented Sep 8, 2023

christineaa commented Sep 11, 2023

christineaa commented Jun 13, 2023 •

edited

Loading

christineaa commented Sep 8, 2023 •

edited

Loading