Analyse Retriever performance #1

dasgoutam · 2024-03-20T07:55:32Z

In a standard RAG pipeline, the first 2 steps usually involve 'Storage' and 'Retrieval' -

Storage - Load data from a source --> split data into chunks --> create embeddings --> Store in a data store(Vector DB)
Retrieval - On a given query, retrieve chunks from the data store

The performance of the retrieval process should depend on the following parameters -

creating chunks in accordance to source data
type of embedding model used
retrieval mechanism used in the vector db

Using a single data source - 'muenchen-en', show results of how/whether performance varies on manipulating these parameters, and formulate an acceptable success criteria.

dasgoutam added the analysis Analyse/comparative study of features label Mar 20, 2024

dasgoutam self-assigned this Mar 20, 2024

svenseeberg added this to the Answer Retrieval milestone May 22, 2024

svenseeberg added the component:chat Chat Back End label Oct 4, 2024

svenseeberg modified the milestones: v3 Basic Answer Retrieval, v3.2 Improve LLM perfomance Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Analyse Retriever performance #1

Analyse Retriever performance #1

dasgoutam commented Mar 20, 2024

Analyse Retriever performance #1

Analyse Retriever performance #1

Comments

dasgoutam commented Mar 20, 2024