-
Notifications
You must be signed in to change notification settings - Fork 145
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* graphRAG dataprep llama-index validated w openai endpoints Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com> * llama-index graphRAG retrieval validated with openai models Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * validated code usng TGI/TEI instead of openai Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * compose.yaml for dataprep validated with neo4j, TGI/TEI, openai Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * graphRAG retriever validated and full compose.yaml Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * minor fix Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add graphRAGGateway WIP Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * graphragGateway working in E2E Example Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * fix schedule in orchestrator to support ChatCompletionRequest input Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * change default to TGI instead of openAI and add test code for neo4jretriever Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * test code for dataprep-neo4j microservice Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * improved READMES Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update docker build path for tests Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * minor fix Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * fix typo in container name Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * resolve image name conflict for hub publishing Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * add tgi validation to miicroservice tests Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * rename test script to trigger cicd correctly w hpu Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * rename test containers for cicd Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * update HF_TOKEN in TGI/TEI test scripts Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * swaped larger model so the graph isnt empty in ci test Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * set 4 hpu for 70B model in ci test Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * add extra time for large model loading cicd Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * fix tgi gaudi shard args Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * switch to chat cause chat template is needed Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * enable logs in test Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * use locally downloaded model in CI machine Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * use local model path and reduce wait time Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * clear ports before ci run Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * fix cache model access Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * fix cache model access Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * incrased wait time for tgi shards ready Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * wait until tgi connected Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * switch back to small model for testing Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * minor readability fixes Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> * README fixes Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> --------- Signed-off-by: rbrugaro <rita.brugarolas.brufau@intel.com> Signed-off-by: Rita Brugarolas <rita.brugarolas.brufau@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
- Loading branch information
1 parent
00abba2
commit 19330ea
Showing
24 changed files
with
1,870 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -59,6 +59,7 @@ | |
VideoQnAGateway, | ||
VisualQnAGateway, | ||
MultimodalQnAGateway, | ||
GraphragGateway, | ||
AvatarChatbotGateway, | ||
) | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,39 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
FROM python:3.11-slim | ||
|
||
ENV LANG=C.UTF-8 | ||
|
||
ARG ARCH="cpu" | ||
|
||
RUN apt-get update -y && apt-get install -y --no-install-recommends --fix-missing \ | ||
build-essential \ | ||
default-jre \ | ||
libgl1-mesa-glx \ | ||
libjemalloc-dev \ | ||
vim | ||
|
||
RUN useradd -m -s /bin/bash user && \ | ||
mkdir -p /home/user && \ | ||
chown -R user /home/user/ | ||
|
||
USER user | ||
|
||
COPY comps /home/user/comps | ||
|
||
RUN pip install --no-cache-dir --upgrade pip setuptools && \ | ||
if [ ${ARCH} = "cpu" ]; then pip install --no-cache-dir torch torchvision --index-url https://download.pytorch.org/whl/cpu; fi && \ | ||
pip install --no-cache-dir -r /home/user/comps/dataprep/neo4j/llama_index/requirements.txt | ||
|
||
ENV PYTHONPATH=$PYTHONPATH:/home/user | ||
|
||
USER root | ||
|
||
RUN mkdir -p /home/user/comps/dataprep/neo4j/llama_index/uploaded_files && chown -R user /home/user/comps/dataprep/neo4j/llama_index/uploaded_files | ||
|
||
USER user | ||
|
||
WORKDIR /home/user/comps/dataprep/neo4j/llama_index | ||
|
||
ENTRYPOINT ["python", "extract_graph_neo4j.py"] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,94 @@ | ||
# Dataprep Microservice with Neo4J | ||
|
||
This dataprep microservice ingests the input files and uses LLM (TGI or OpenAI model when OPENAI_API_KEY is set) to extract entities, relationships and descriptions of those to build a graph-based text index. | ||
|
||
### Setup Environment Variables | ||
|
||
```bash | ||
# Manually set private environment settings | ||
export host_ip=${your_hostname IP} # local IP | ||
export no_proxy=$no_proxy,${host_ip} # important to add {host_ip} for containers communication | ||
export http_proxy=${your_http_proxy} | ||
export https_proxy=${your_http_proxy} | ||
export NEO4J_URI=${your_neo4j_url} | ||
export NEO4J_USERNAME=${your_neo4j_username} | ||
export NEO4J_PASSWORD=${your_neo4j_password} # should match what was used in NEO4J_AUTH when running the neo4j-apoc | ||
export PYTHONPATH=${path_to_comps} | ||
export OPENAI_KEY=${your_openai_api_key} # optional, when not provided will use smaller models TGI/TEI | ||
export HUGGINGFACEHUB_API_TOKEN=${your_hf_token} | ||
# set additional environment settings | ||
source ./set_env.sh | ||
``` | ||
|
||
## 🚀Start Microservice with Docker | ||
|
||
### 1. Build Docker Image | ||
|
||
```bash | ||
cd ../../../../ | ||
docker build -t opea/dataprep-neo4j-llamaindex:latest --build-arg no_proxy=$no_proxy --build-arg https_proxy=$https_proxy --build-arg http_proxy=$http_proxy -f comps/dataprep/neo4j/llama_index/Dockerfile . | ||
``` | ||
|
||
### 2. Setup Environment Variables | ||
|
||
```bash | ||
# Set private environment settings | ||
export host_ip=${your_hostname IP} # local IP | ||
export no_proxy=$no_proxy,${host_ip} # important to add {host_ip} for containers communication | ||
export http_proxy=${your_http_proxy} | ||
export https_proxy=${your_http_proxy} | ||
export NEO4J_URI=${your_neo4j_url} | ||
export NEO4J_USERNAME=${your_neo4j_username} | ||
export NEO4J_PASSWORD=${your_neo4j_password} | ||
export PYTHONPATH=${path_to_comps} | ||
export OPENAI_KEY=${your_openai_api_key} # optional, when not provided will use smaller models TGI/TEI | ||
export HUGGINGFACEHUB_API_TOKEN=${your_hf_token} | ||
# set additional environment settings | ||
source ./set_env.sh | ||
``` | ||
|
||
### 3. Run Docker with Docker Compose | ||
|
||
Docker compose will start 4 microservices: dataprep-neo4j-llamaindex, neo4j-apoc, tgi-gaudi-service and tei-embedding-service. The reason TGI and TEI are needed is because dataprep relies on LLM to extract entities and relationships from text to build the graph and Neo4j Property Graph Index. Neo4j database supports embeddings natively so we do not need a separate vector store. Checkout the blog [Introducing the Property Graph Index: A Powerful New Way to Build Knowledge Graphs with LLMs](https://www.llamaindex.ai/blog/introducing-the-property-graph-index-a-powerful-new-way-to-build-knowledge-graphs-with-llms) for a better understanding of Property Graph Store and Index. | ||
|
||
```bash | ||
cd comps/dataprep/neo4j/llama_index | ||
docker compose -f compose.yaml up -d | ||
``` | ||
|
||
## Invoke Microservice | ||
|
||
Once document preparation microservice for Neo4J is started, user can use below command to invoke the microservice to convert the document to embedding and save to the database. | ||
|
||
```bash | ||
curl -X POST \ | ||
-H "Content-Type: multipart/form-data" \ | ||
-F "files=@./file1.txt" \ | ||
http://${host_ip}:6004/v1/dataprep | ||
``` | ||
|
||
You can specify chunk_size and chunk_size by the following commands. | ||
|
||
```bash | ||
curl -X POST \ | ||
-H "Content-Type: multipart/form-data" \ | ||
-F "files=@./file1.txt" \ | ||
-F "chunk_size=1500" \ | ||
-F "chunk_overlap=100" \ | ||
http://${host_ip}:6004/v1/dataprep | ||
``` | ||
|
||
We support table extraction from pdf documents. You can specify process_table and table_strategy by the following commands. "table_strategy" refers to the strategies to understand tables for table retrieval. As the setting progresses from "fast" to "hq" to "llm," the focus shifts towards deeper table understanding at the expense of processing speed. The default strategy is "fast". | ||
|
||
Note: If you specify "table_strategy=llm" TGI service will be used. | ||
|
||
For ensure the quality and comprehensiveness of the extracted entities, we recommend to use `gpt-4o` as the default model for parsing the document. To enable the openai service, please `export OPENAI_KEY=xxxx` before using this services. | ||
|
||
```bash | ||
curl -X POST \ | ||
-H "Content-Type: multipart/form-data" \ | ||
-F "files=@./your_file.pdf" \ | ||
-F "process_table=true" \ | ||
-F "table_strategy=hq" \ | ||
http://localhost:6004/v1/dataprep | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,97 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
version: "3.8" | ||
services: | ||
neo4j-apoc: | ||
image: neo4j:latest | ||
container_name: neo4j-apoc | ||
volumes: | ||
- /$HOME/neo4j/logs:/logs | ||
- /$HOME/neo4j/config:/config | ||
- /$HOME/neo4j/data:/data | ||
- /$HOME/neo4j/plugins:/plugins | ||
ipc: host | ||
environment: | ||
- NEO4J_AUTH=${NEO4J_USERNAME}/${NEO4J_PASSWORD} | ||
- NEO4J_PLUGINS=["apoc"] | ||
- NEO4J_apoc_export_file_enabled=true | ||
- NEO4J_apoc_import_file_enabled=true | ||
- NEO4J_apoc_import_file_use__neo4j__config=true | ||
- NEO4J_dbms_security_procedures_unrestricted=apoc.\* | ||
ports: | ||
- "7474:7474" | ||
- "7687:7687" | ||
restart: always | ||
tei-embedding-service: | ||
image: ghcr.io/huggingface/text-embeddings-inference:cpu-1.5 | ||
container_name: tei-embedding-server | ||
ports: | ||
- "6006:80" | ||
volumes: | ||
- "./data:/data" | ||
shm_size: 1g | ||
environment: | ||
no_proxy: ${no_proxy} | ||
NO_PROXY: ${no_proxy} | ||
http_proxy: ${http_proxy} | ||
https_proxy: ${https_proxy} | ||
HUGGING_FACE_HUB_TOKEN: ${HUGGINGFACEHUB_API_TOKEN} | ||
ipc: host | ||
command: --model-id ${EMBEDDING_MODEL_ID} --auto-truncate | ||
tgi-gaudi-service: | ||
image: ghcr.io/huggingface/tgi-gaudi:2.0.5 | ||
container_name: tgi-gaudi-server | ||
ports: | ||
- "6005:80" | ||
volumes: | ||
- "./data:/data" | ||
environment: | ||
no_proxy: ${no_proxy} | ||
NO_PROXY: ${no_proxy} | ||
http_proxy: ${http_proxy} | ||
https_proxy: ${https_proxy} | ||
HUGGING_FACE_HUB_TOKEN: ${HUGGINGFACEHUB_API_TOKEN} | ||
HF_HUB_DISABLE_PROGRESS_BARS: 1 | ||
HF_HUB_ENABLE_HF_TRANSFER: 0 | ||
HABANA_VISIBLE_DEVICES: all | ||
OMPI_MCA_btl_vader_single_copy_mechanism: none | ||
ENABLE_HPU_GRAPH: true | ||
LIMIT_HPU_GRAPH: true | ||
USE_FLASH_ATTENTION: true | ||
FLASH_ATTENTION_RECOMPUTE: true | ||
runtime: habana | ||
cap_add: | ||
- SYS_NICE | ||
ipc: host | ||
command: --model-id ${LLM_MODEL_ID} --max-input-length 2048 --max-total-tokens 4096 | ||
dataprep-neo4j-llamaindex: | ||
image: opea/dataprep-neo4j-llamaindex:latest | ||
container_name: dataprep-neo4j-server | ||
depends_on: | ||
- neo4j-apoc | ||
- tgi-gaudi-service | ||
- tei-embedding-service | ||
ports: | ||
- "6004:6004" | ||
ipc: host | ||
environment: | ||
no_proxy: ${no_proxy} | ||
http_proxy: ${http_proxy} | ||
https_proxy: ${https_proxy} | ||
host_ip: ${host_ip} | ||
NEO4J_URL: ${NEO4J_URL} | ||
NEO4J_USERNAME: ${NEO4J_USERNAME} | ||
NEO4J_PASSWORD: ${NEO4J_PASSWORD} | ||
TGI_LLM_ENDPOINT: ${TGI_LLM_ENDPOINT} | ||
TEI_EMBEDDING_ENDPOINT: ${TEI_EMBEDDING_ENDPOINT} | ||
OPENAI_API_KEY: ${OPENAI_API_KEY} | ||
OPENAI_EMBEDDING_MODEL: ${OPENAI_EMBEDDING_MODEL} | ||
OPENAI_LLM_MODEL: ${OPENAI_LLM_MODEL} | ||
EMBEDDING_MODEL_ID: ${EMBEDDING_MODEL_ID} | ||
LLM_MODEL_ID: ${LLM_MODEL_ID} | ||
LOGFLAG: ${LOGFLAG} | ||
restart: unless-stopped | ||
networks: | ||
default: | ||
driver: bridge |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
import os | ||
|
||
host_ip = os.getenv("host_ip") | ||
# Neo4J configuration | ||
NEO4J_URL = os.getenv("NEO4J_URL", f"bolt://{host_ip}:7687") | ||
NEO4J_USERNAME = os.getenv("NEO4J_USERNAME", "neo4j") | ||
NEO4J_PASSWORD = os.getenv("NEO4J_PASSWORD", "neo4jtest") | ||
|
||
# LLM/Embedding endpoints | ||
TGI_LLM_ENDPOINT = os.getenv("TGI_LLM_ENDPOINT", f"http://{host_ip}:6005") | ||
TEI_EMBEDDING_ENDPOINT = os.getenv("TEI_EMBEDDING_ENDPOINT ", f"http://{host_ip}:6006") | ||
|
||
OPENAI_API_KEY = os.getenv("OPENAI_API_KEY") | ||
OPENAI_EMBEDDING_MODEL = os.getenv("OPENAI_EMBEDDING_MODEL", "text-embedding-3-small") | ||
OPENAI_LLM_MODEL = os.getenv("OPENAI_LLM_MODEL", "gpt-4o") |
Oops, something went wrong.