Adding chat history to RAG app and refactor to better utilize LangChain #648

alpha-amundson · 2024-05-03T23:12:04Z

See commit log for full description. tl;dr: added chat history to rag-frontend app.

…eep track of and retrieve chat history from Cloud SQL. main.py - removed old langchain and logic to retrieve context. replaced with new chain from rag_chain.py. Introduced browser session with 30 minute ttl. Storing session ID in the session cookie. Session ID is then used to retrieve chat history. Chat history is cleared when timeout is reached. cloud_sql.py - now includes a method to create a PostgresEngine for storing and retrieving history, plus a CustomVectorStore to perform the query embedding and vector search. Old code paths no longer needed were removed. rag_chain.py - contains helper method create_chain to create, update and delete the end-to-end RAG chain with history. various tf files: increased max input and total tokens on HF TGI for mistral. threadded through some parameters needed to instantiate the PostgresEngine. requirements.txt - added some dependencies needed for langchain

applications/rag/frontend/container/main.py

imreddy13 · 2024-05-03T23:43:56Z

/gcbrun

Reverted breaking change to env var

alpha-amundson · 2024-05-06T22:33:59Z

/gcbrun

alpha-amundson · 2024-05-06T23:30:49Z

/gcbrun

* Working on improvements for rag application: - Working on missing TODO - Fixing issue with credentials - Refactoring vector_storages so you can add different vector storages TODO: Vector Storage factory - Unit test will be added on future PR * Updating changes with db * refactoring app so can be executed using gunicorn * refactory of the code as flask application package * Fixing Bugs - Reviewing issue with IPtypes, currently the fix is to validate if there's an development environment so a public cloud_sql instance can be use. - Fixing issue with Flask App Factory

german-grandas · 2024-07-12T20:52:09Z

/gcbrun

* Working on improvements for rag application: - Working on missing TODO - Fixing issue with credentials - Refactoring vector_storages so you can add different vector storages TODO: Vector Storage factory - Unit test will be added on future PR * Updating changes with db * refactoring app so can be executed using gunicorn * refactory of the code as flask application package * Fixing Bugs - Reviewing issue with IPtypes, currently the fix is to validate if there's an development environment so a public cloud_sql instance can be use. - Fixing issue with Flask App Factory * Working on Custom HuggingFace interface - Adding a custom chat model to send request to HuggingFace TGI API - Applying formatting to code.

applications/rag/frontend/container/main.py

* Working on improvements for rag application: - Working on missing TODO - Fixing issue with credentials - Refactoring vector_storages so you can add different vector storages TODO: Vector Storage factory - Unit test will be added on future PR * Updating changes with db * refactoring app so can be executed using gunicorn * refactory of the code as flask application package * Fixing Bugs - Reviewing issue with IPtypes, currently the fix is to validate if there's an development environment so a public cloud_sql instance can be use. - Fixing issue with Flask App Factory * Working on Custom HuggingFace interface - Adding a custom chat model to send request to HuggingFace TGI API - Applying formatting to code. * Improving the CloudSQL vector vector_storage

applications/rag/frontend/container/main.py

main.py

german-grandas · 2024-07-31T16:18:22Z

/gcbrun

german-grandas · 2024-08-01T14:55:43Z

/gcbrun

german-grandas · 2024-08-01T16:48:53Z

/gcbrun

german-grandas · 2024-08-09T16:05:12Z

Some prompt answer examples using meta-llama/Llama-2-7b-hf

Some prompt answer examples using meta-llama/Llama-2-7b-chat-hf

gongmax · 2024-08-15T17:35:06Z

/gcbrun

applications/rag/frontend/container/application/__init__.py

applications/rag/frontend/container/application/models/vector_embeddings.py

gongmax · 2024-08-15T22:47:21Z

applications/rag/frontend/container/application/utils/cloud_sql_utils.py

+    level=logging.INFO, format="%(asctime)s - %(levelname)s - %(message)s"
+)
+
+ENVIRONMENT = os.environ.get("ENVIRONMENT")


Is this an environment that flask set?

@gongmax it's just for local development purposes, the variable was added because an issue on line 66.

I mean where is the ENVIRONMENT env get set?

cloudbuild.yaml

applications/rag/frontend/container/application/rag_langchain/rag_chain.py

applications/rag/frontend/container/application/vector_storages/cloud_sql.py

gongmax · 2024-08-16T22:16:51Z

applications/rag/frontend/container/application/rag_langchain/rag_chain.py

+        )
+
+        chain = setup_and_retrieval | prompt | model
+        chain_with_history = RunnableWithMessageHistory(


Can you add some comments around how the chain.invoke works with this chain_with_history and user input? Especially around how setup_and_retrieval component works.

applications/rag/frontend/container/application/models/vector_embeddings.py

gongmax · 2024-08-16T22:29:49Z

Please also resolve the conflicts

first commit

…tor (#769) * Refactor: create module for workload identity service account Change-Id: I29e985e77a1ff2d5f4a8d9493c1e65907c89c100 * fix: add todo Change-Id: I3357e8f9dd16c7958dff0f0cf0f990fce980f474 --------- Co-authored-by: Gen Lu <genlu@google.com>

german-grandas · 2024-08-20T20:43:08Z

/gcbrun

gongmax · 2024-08-22T17:09:47Z

applications/rag/frontend/container/application/utils/cloud_sql_utils.py

+    level=logging.INFO, format="%(asctime)s - %(levelname)s - %(message)s"
+)
+
+ENVIRONMENT = os.environ.get("ENVIRONMENT")


I mean where is the ENVIRONMENT env get set?

gongmax · 2024-08-22T17:13:20Z

cloudbuild.yaml

@@ -269,7 +269,7 @@ steps:
        kubectl exec -it -n rag-$SHORT_SHA-$_BUILD_ID jupyter-admin -c notebook -- jupyter nbconvert --to script /data/rag-kaggle-ray-sql-interactive.ipynb
        kubectl exec -it -n rag-$SHORT_SHA-$_BUILD_ID jupyter-admin -c notebook -- ipython /data/rag-kaggle-ray-sql-interactive.py

-        python3 ./applications/rag/tests/test_rag.py "http://127.0.0.1:8081/prompt"
+        # python3 ./applications/rag/tests/test_rag.py "http://127.0.0.1:8081/prompt" Ignoring while the test approach is reviewed


Can we add this back to ensure the e2e test pass?

gongmax · 2024-08-22T17:30:27Z

applications/rag/frontend/main.tf

@@ -23,6 +23,14 @@ locals {
  })
 }

+resource "random_string" "application_secret_key" {
+  length  = var.project_id


length should be a number, and CI complains on this line:

Error: Incorrect attribute value type on frontend/main.tf line 27, in resource "random_string" "application_secret_key": 27: length = var.project_id ├──────────────── │ var.project_id is "gke-ai-eco-dev" Inappropriate value for attribute "length": a number is required.

alpha-amundson mentioned this pull request May 3, 2024

Adding chat history to RAG frontend app #586

Closed

github-advanced-security bot found potential problems May 3, 2024

View reviewed changes

applications/rag/frontend/container/main.py Fixed Show fixed Hide fixed

alpha-amundson requested a review from imreddy13 May 3, 2024 23:15

alpha-amundson changed the title ~~Also introduced a basic session history mechanism in the browser to k…~~ Adding chat history to RAG app and refactor to better utilize LangChain May 3, 2024

tflint formatting fixes

5cc85b9

nstogner and others added 3 commits May 6, 2024 14:55

TPU Provisioner: JobSet related fixes (#645)

6898666

Updated image to use code in this branch

1d6c052

Reverted breaking change to env var

making tflint happy

981e777

github-advanced-security bot found potential problems Jul 22, 2024

View reviewed changes

applications/rag/frontend/container/main.py Dismissed Show dismissed Hide dismissed

github-advanced-security bot found potential problems Jul 29, 2024

View reviewed changes

applications/rag/frontend/container/main.py Dismissed Show dismissed Hide dismissed

Fixing issues and updating chat history on frontend

e750d12

github-advanced-security bot found potential problems Jul 31, 2024

View reviewed changes

main.py Fixed Show fixed Hide fixed

main.py Fixed Show fixed Hide fixed

Fixing files on working tree

a000c46

Ignoring test rag, to review how the rag application is working

0d853ea

ignoring unit test to review cloud build process

386c437

german-grandas added 2 commits August 6, 2024 12:01

refactoring cloud sql connection helper

be1839d

Merge branch 'main' into rag-langchain-chat-history

7f081ff

german-grandas requested a review from roberthbailey August 9, 2024 23:07

german-grandas requested a review from gongmax August 9, 2024 23:07

gongmax reviewed Aug 16, 2024

View reviewed changes

Bslabe123 and others added 9 commits August 20, 2024 15:11

Change TPU Metrics Source for Autoscaling (#770)

35f67e4

first commit

updating branch

48f655b

fixing conflicts with remote branch

a9895d6

fixing conflicts with remote branch

cd95c98

fixing conflicts with remote branch

bc8d745

fixing conflicts applying rebase

e9beeef

Updating files based on reviewer comments

eb9ab02

reverting change on cloudbuild.yaml file

dff8d94

german-grandas requested a review from gongmax August 20, 2024 20:43

gongmax reviewed Aug 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding chat history to RAG app and refactor to better utilize LangChain #648

Adding chat history to RAG app and refactor to better utilize LangChain #648

alpha-amundson commented May 3, 2024

imreddy13 commented May 3, 2024

alpha-amundson commented May 6, 2024

alpha-amundson commented May 6, 2024

german-grandas commented Jul 12, 2024

german-grandas commented Jul 31, 2024

german-grandas commented Aug 1, 2024

german-grandas commented Aug 1, 2024

german-grandas commented Aug 9, 2024

gongmax commented Aug 15, 2024

gongmax Aug 15, 2024

german-grandas Aug 20, 2024

gongmax Aug 22, 2024

gongmax Aug 16, 2024

gongmax commented Aug 16, 2024

german-grandas commented Aug 20, 2024

gongmax Aug 22, 2024

gongmax Aug 22, 2024

gongmax Aug 22, 2024

Adding chat history to RAG app and refactor to better utilize LangChain #648

Are you sure you want to change the base?

Adding chat history to RAG app and refactor to better utilize LangChain #648

Conversation

alpha-amundson commented May 3, 2024

imreddy13 commented May 3, 2024

alpha-amundson commented May 6, 2024

alpha-amundson commented May 6, 2024

german-grandas commented Jul 12, 2024

german-grandas commented Jul 31, 2024

german-grandas commented Aug 1, 2024

german-grandas commented Aug 1, 2024

german-grandas commented Aug 9, 2024

gongmax commented Aug 15, 2024

gongmax Aug 15, 2024

Choose a reason for hiding this comment

german-grandas Aug 20, 2024

Choose a reason for hiding this comment

gongmax Aug 22, 2024

Choose a reason for hiding this comment

gongmax Aug 16, 2024

Choose a reason for hiding this comment

gongmax commented Aug 16, 2024

german-grandas commented Aug 20, 2024

gongmax Aug 22, 2024

Choose a reason for hiding this comment

gongmax Aug 22, 2024

Choose a reason for hiding this comment

gongmax Aug 22, 2024

Choose a reason for hiding this comment