How long to prepare the instruction data? #13

Xnhyacinth · 2025-01-09T03:30:04Z

Hi, @Hannibal046 How long did it take to perform this retrieval on all the training data?

Hannibal046 · 2025-01-09T03:42:30Z

Thanks for your interest in our work! For retrieval, we use a multi-vector retrieval model ColBERT-v2 to build index and search over wikipedia dump. The overall duration for indexing and searching takes about ten-plus hours.

Xnhyacinth · 2025-01-09T05:21:56Z

Sure, I used pyserini for retrieval and found that it requires 200+ hours. Can the processed data be provided directly via git lfs or some cloud service?

Xnhyacinth · 2025-01-09T07:04:31Z

Done. I use the server to speed up the process

Xnhyacinth closed this as completed Jan 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How long to prepare the instruction data? #13

How long to prepare the instruction data? #13

Xnhyacinth commented Jan 9, 2025

Hannibal046 commented Jan 9, 2025

Xnhyacinth commented Jan 9, 2025

Xnhyacinth commented Jan 9, 2025

How long to prepare the instruction data? #13

How long to prepare the instruction data? #13

Comments

Xnhyacinth commented Jan 9, 2025

Hannibal046 commented Jan 9, 2025

Xnhyacinth commented Jan 9, 2025

Xnhyacinth commented Jan 9, 2025