This repository contains code for a project focused on Retrieval Augmented Generation using a Large Language Model (LLM). The project utilizes Python 3.8.19 and includes a Jupyter notebook with the following functionalities:
- Extracting data from a specified PDF file.
- Storing the extracted information in a Chroma DB Vector Store.
- Enforcing an LLM (GPT3.5 Turbo) to answer user queries based on relevant information from the Vector Store.
To install the required Python packages, follow these steps:
-
Make sure you have Python 3.8.19 installed. If not, you can download it from the official Python website.
-
Clone this repository to your local machine.
-
Navigate to the project directory in your terminal.
-
Install the required packages using pip:
pip install -r requirements.txt
This command will install all the necessary dependencies listed in the
requirements.txt
file. -
To download the Menu card on which I experimented, run:
mkdir data curl -O data/https://losteria.net/fileadmin/user_upload/losteria_ernaehrungsfibel_032022_interim_AT_website.pdf
Once the required packages are installed, you can explore the functionalities provided in the Jupyter notebook. Open the notebook using Jupyter Notebook or JupyterLab and execute the cells to interact with the project. There is also extensive instructions, information and references provided inside the Notebook to follow along.