PDF Query RAG-Based LLM APP

License: (MIT), Copyright (C) 2024, Author Phil Chen
- This is a example application the author of this repository is not liable for damages or losses arising from your use or inability to use the code.

Description

This repo provides an example of a Generative AI PDF query web application that uses a retrieval-augmented generation (RAG) process architecture. The PDF query web application will go through your PDF’s and provide you responses to your questions based on what is in your PDF’s. The importance of using RAG is the ability to scope the results of the generated response from the LLM in our case Claude 3.5 Sonnet with up-to-date, accurate, reliable responses. RAG allows for domain-specific contextually relevant responses tailored to your data rather than static training data.

The PDF query web application will leverage Facebook AI Similarity Search (FAISS) and Amazon Titan Embeddings to create vector representations of unstructured text and the storage/search of those embeddings. LangChain is utilized for the prompt template guiding the models response, RetrievalQA for pertinent data, and various PDF processing tools. We will use Amazon Bedrock to access Claude 3.5 Sonnet and Amazon Titan Embeddings.

Prerequisites for macOS laptop local setup

Amazon Web Services Account
Enable Amazon Bedrock Access (Specifically Amazon Titan Embeddings and Claude 3.5 Sonnet) see: Manage access to Amazon Bedrock foundation models
AWS CLI installed
AWS CLI IAM user with Full Amazon Bedrock Access
Verified on Python 3.10, 3.11, 3.12
Anaconda or Miniconda installed
AWS Default Region is set to us-east-1 you can change the region in the pdf_query_rag_llm_app.py file under region_name='us-east-1'

Prerequisites for EC2 Ubuntu instance setup

Amazon Web Services Account
Enable Amazon Bedrock Access (Specifically Amazon Titan Embeddings and Claude 3.5 Sonnet) see: Manage access to Amazon Bedrock foundation models
EC2 Instance Role with AmazonBedrockFullAccess Policy Attached (note you can make this more secure by making a custom policy)
Verified on EC2 Instance Ubuntu 22.04 and Ubuntu 24.04
Verified on Python 3.10, 3.11, 3.12
Virtualenv
AWS Default Region is set to us-east-1 you can change the region in the pdf_query_rag_llm_app.py file under region_name='us-east-1'

AWS Resource Cost

As with most AWS services you will incur costs for usage.

Pricing:
- https://aws.amazon.com/bedrock/pricing/
- https://aws.amazon.com/ec2/pricing/on-demand/

macOS laptop local setup

conda create -n "pdf-query-rag-llm-app" python=3.11.0

git clone git@github.com:nethacker/pdf-query-rag-llm-app.git

cd pdf-query-rag-llm-app

pip install -r requirements.txt

Run macOS laptop local setup

To run text PDF Query RAG LLM Application

streamlit run pdf_query_rag_llm_app.py

You can reach the app at http://localhost:8501/. Please put your PDF's that you want to query in the data directory and click New Data Update before querying.

EC2 Ubuntu instance setup steps

(This example assumes you have a ubuntu user with /home/ubuntu)

Install some dependencies

sudo apt -y update

sudo apt -y install build-essential openssl

sudo apt -y install libpq-dev libssl-dev libffi-dev zlib1g-dev

sudo apt -y install python3-pip python3-dev

sudo apt -y install nginx

sudo apt -y install virtualenvwrapper

Clone the GIT repository

cd /home/ubuntu

git clone https://github.com/nethacker/pdf-query-rag-llm-app.git

Setup the Python environment

virtualenv pdf-query-rag-llm-app_env

source pdf-query-rag-llm-app_env/bin/activate

Install the PDF Query RAG LLM application package dependencies

cd /home/ubuntu/pdf-query-rag-llm-app

pip install -r requirements.txt

Setup systemd to daemonize and bootstrap the PDF Query RAG-Based LLM application (Port 8080)

sudo cp systemd/pdf-query-rag-llm-app.service /etc/systemd/system/

sudo systemctl start pdf-query-rag-llm-app

sudo systemctl enable pdf-query-rag-llm-app.service

Install NGINX to help scale and handle connections (Port 80)

sudo cp nginx/nginx_pdf-query-rag-llm-app.conf /etc/nginx/sites-available/nginx_pdf-query-rag-llm-app.conf

sudo rm /etc/nginx/sites-enabled/default

sudo ln -s /etc/nginx/sites-available/nginx_pdf-query-rag-llm-app.conf /etc/nginx/sites-enabled

sudo systemctl restart nginx

You can reach the app at http://{yourhost}. Please put your PDF's that you want to query in the data directory on the instance and click New Data Update before querying.

Notes

Make sure to open up port 80 in your EC2 Security Group associated to the instance.
For HTTPS (TLS) you can use AWS ALB or AWS CloudFront
Depending on how many PDF’s you have, how big the PDF’s are, and your CPU specifications using the New Data Update button can take awhile as it builds your vector embeddings.
Any time you add PDF’s or change them make sure to click “New Data Update” to update/build your vector embeddings.
This application does not take into consideration security controls, that is your responsibility.
Please read Amazon Bedrock FAQ's for general questions about AWS LLM resources used.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
data		data
nginx		nginx
systemd		systemd
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
flow-diagram.svg		flow-diagram.svg
pdf_query_rag_llm_app.py		pdf_query_rag_llm_app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Query RAG-Based LLM APP

Description

Prerequisites for macOS laptop local setup

Prerequisites for EC2 Ubuntu instance setup

AWS Resource Cost

macOS laptop local setup

Run macOS laptop local setup

EC2 Ubuntu instance setup steps

Install some dependencies

Clone the GIT repository

Setup the Python environment

Install the PDF Query RAG LLM application package dependencies

Setup systemd to daemonize and bootstrap the PDF Query RAG-Based LLM application (Port 8080)

Install NGINX to help scale and handle connections (Port 80)

Notes

About

Releases

Packages

Languages

License

nethacker/pdf-query-rag-llm-app

Folders and files

Latest commit

History

Repository files navigation

PDF Query RAG-Based LLM APP

Description

Prerequisites for macOS laptop local setup

Prerequisites for EC2 Ubuntu instance setup

AWS Resource Cost

macOS laptop local setup

Run macOS laptop local setup

EC2 Ubuntu instance setup steps

Install some dependencies

Clone the GIT repository

Setup the Python environment

Install the PDF Query RAG LLM application package dependencies

Setup systemd to daemonize and bootstrap the PDF Query RAG-Based LLM application (Port 8080)

Install NGINX to help scale and handle connections (Port 80)

Notes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages