GitHub - maylad31/multihop-rag

Connect with me on linkedin if you have an intersting project/common interests. https://www.linkedin.com/in/mayankladdha31/

How minor adjustments in query preprocessing and prompt refinement could enhance retrieval and final outcomes: MultiHop-RAG is a QA dataset to evaluate retrieval and reasoning across documents with metadata in the #RAG pipelines. It contains queries, with evidence for each query distributed across 2 to 4 documents. I first tried a simple retrieval. For inference_query type the results were not that bad. But for other query types(comparison and temporal), the results were quite poor. Then, I tried to see if we can improve for other query types by minor query preprocessing(trying to get more relevant chunks by breaking the query into relevant phrases) and by tweaking the prompt a bit.

I observed a notable improvement. While some responses were incorrect, the overall improvement from the previous version was significant. We can try to tweak the prompt, use a better model(gpt4), experimenting with different strategies (make better use of metadata, try different chunking methods), may be create a knowledge graph. My aim was not to get the best accuracy but to see whether minor adjustments in query preprocessing and prompt refinement could enhance retrieval and final outcomes. And it does.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
Multihop_rag.ipynb		Multihop_rag.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

maylad31/multihop-rag

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages