We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi,
For the warm-up step, I see a regular dense retrieval model training on the triples.small data provided by MSMarco.
But I don't find any code introducing bm25 index and bm25 sampling. I guess you are treating triples.small data's negatives as bm25 negs already?
What does bm25 warm up mean? How is that introduced?
Thanks
The text was updated successfully, but these errors were encountered:
Hi, For the warm-up step, I see a regular dense retrieval model training on the triples.small data provided by MSMarco. But I don't find any code introducing bm25 index and bm25 sampling. I guess you are treating triples.small data's negatives as bm25 negs already? What does bm25 warm up mean? How is that introduced? Thanks
Yeah, I also can't find the BM25 index. Have you found the answer to it?
Sorry, something went wrong.
+1
I believe @tangzhy is correct (at least on MSMARCO), the triples.train.small.tsv were generated by the MSMARCO dataset itself, and they refer to generating the triplets using BM25 in the raw text of the README, hence why there's no reference to BM25 in this repo.
triples.train.small.tsv
No branches or pull requests
Hi,
For the warm-up step, I see a regular dense retrieval model training on the triples.small data provided by MSMarco.
But I don't find any code introducing bm25 index and bm25 sampling.
I guess you are treating triples.small data's negatives as bm25 negs already?
What does bm25 warm up mean? How is that introduced?
Thanks
The text was updated successfully, but these errors were encountered: