Advanced RAG (Retrieval Augmented Generation) Service

Advanced RAG AI Service running in Docker, it can be used as

Quick MVP/POCm playground to verify which Index type can exactly address the specific (usually related to accuracy) LLM RAG use case.
OpenAPI interface and Gradio REST API interface are supported.

Http Clients, Power platform workflow, and MS Office Word/Outlook Add-In can easily use proper vector index types to search doc info and generate LLM response from the service.

Introduction

The service can help developers quickly verify different RAG indexing techniques (about accuracy and performance) for their own user cases, from Index Generation to verify the output through Chat Mode and Proofreading mode.

It can run as local Docker or put it to Azure Container App, build and perform queries on multiple important Index types. Below is the info about index types and how the project implements them:

Azure AI Search : Azure AI Search Python SDK + Hybrid Semantic Search + Sub Question Query
MS GraphRAG Local : REST APIs Provided by GraphRAG accelerator
MS GraphRAG Global : REST APIs Provided by GraphRAG accelerator
Knowledge Graph : LlamaIndex
Recursive Retriever : LlamaIndex
Summary Index : LlamaIndex
CSV Query Engine: LlamaIndex

Latest Features Update

Read in Change Log

Source Code & Development

https://github.com/freistli/AdvancedRAG_SVC

Quick Start

Download the repo:

git clone https://github.com/freistli/AdvancedRAG.git

IMPORTANT: Rename .env.sample to .env, input necessary environment variables.

Azure OpenAI resource and Azure Document Intelligence resource are must required.

Azure AI Search is optional if you don't build Azure AI Search index or use it.

MS GRAPHRAG is optional. If you want to use it please go through these steps to create GraphRAG backend service on Azure: https://github.com/azure-samples/graphrag-accelerator

Check Parameters for more details.
Build docker image

docker build -t docaidemo .

Run it locally

docker run -p 8000:8000 docaidemo

Access it locally

http://localhost:8000

Run it on Azure Container App

Publish your image to Azure Container Registry or Docker Hub
Create Azure Container App, choose the docker image you published, and then deploy the revision pod without any extra command.

Note: If you don't use .env in image, can set environment variables in the Azure Container App.

Build Index

Click one Index Build tab
Upload a file to file section

Note: The backend is Azure Document Intelligence to read the content, in general it supports lots of file formats, recommend to use PDF (print it as PDF) if the content is complicated
Input an Index name, click Submit
Wait till the index building completed, the right pane will update the status in real time.
After you see this words, means it is completed:

2024-06-13T10:04:54.120027: Index is persisted in /tmp/index_cache/yourindexname

/tmp/index_cache/yourindexname or yourindexname can be used as your Index name.

You can download the index files to local by clicking the Download Index button, so that can use it in your own docker image

(Optional) Setup Your Own Index in the Docker Image

Developers can use their own indexes folders for the docker image:

To make it work:

Move to the solution folder which contains the AdvancedRAG dockerfile
Create a folder to keep the index, for example, index123
Extract the index zip file you get from the step 6 in the "Build Index" section, save index files you downloaded into ./index123
Build the docker image again.

After this, you can use index123 as index name in the Chat mode.

NOTE: You can use the same way to store CSV files, build them with docker if you want to try CSV Query Engine without file uploading.

Call ADVRAGSVC through REST API CALL

Endpoint 1: Chat API

https://{BASEURL}/advchatbot/run/chat

METHOD

POST

HEADER

Content-Type: application/json

Sample Data

{
  "data": [
    "When did the Author convince his farther",  <----- Prompt
    "", <--- History Object, don't change it
    "Azure AI Search",  <------ Index Type
    "azuresearch_0",   <------- Index Name or Folder
    "You are a friendly AI Assistant",    <----- System Message
    false   <---- Streaming flag, true or false
  ]
}

Endpoint 2: Proofread Addin API

https://{BASEURL}/proofreadaddin/run/predict

NOTE: "rules" Index Name is predefined for Knowledge Graph Index of proofread addin. Please save Default knowledge graph index into ./rules/storage/rules_original

METHOD

POST

HEADER

Content-Type: application/json

Sample Data

{
"data": [    
 "今回は半導体製造装置セクターの最近の動きを分析します。" , <---- Proofread Content
 false <--- Streaming flag, true or false
]
}

Endpoint 3: CSV Query Engine API

https://{BASEURL}/csvqueryengine/run/chat

METHOD

POST

HEADER

Content-Type: application/json

Sample Data

{
"data": [
 "how many records does it have",   
 "", 
 "CSV Query Engine",   
 "./rules/files/sample.csv",    
 "You are a helpful AI assistant. You can help users with a variety of tasks, such as answering questions, providing recommendations, and assisting with tasks. You can also provide information on a wide range of topics from the retieved documents. If you are unsure about something, you can ask for clarification. Please be polite and professional at all times. If you have any questions, feel free to ask." ,
 false   
]
}

Consume the service in Office Add-In (use Proofread Addin use case)

Check: https://github.com/freistli/ProofreadAddin

Integrate to Copilot Studio

Step by Step: Integrate Advanced RAG Service with Your Own Data into Copilot Studio

Try your index in Chat Mode

Click Chat Mode
Choose the Index type
Put the index path or Azure AI Search Index name to Index Name text field
Now you can try to chat with the document with various system message if need.

Proofread mode

This is specific for Proofread scenario, especially for non-English Languages. It needs you generate Knowlege Graph Index.

Generating Knowledge Graph Index steps are the same as others.

View Knowledge Graph Index

Click the View Knowlege Graph Index tab
Put you Knowledge Graph Index name, and click Submit
Wait a while, click Download Knowledge Graph View to see the result.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
blogs		blogs
.dockerignore		.dockerignore
.env.sample		.env.sample
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Parameters.md		Parameters.md
README.md		README.md
deploy_acr_app.sh		deploy_acr_app.sh
deploy_aoai.sh		deploy_aoai.sh
deploy_storage.sh		deploy_storage.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advanced RAG (Retrieval Augmented Generation) Service

Introduction

Latest Features Update

Source Code & Development

Quick Start

Run it on Azure Container App

Build Index

(Optional) Setup Your Own Index in the Docker Image

Call ADVRAGSVC through REST API CALL

Endpoint 1: Chat API

METHOD

HEADER

Sample Data

Endpoint 2: Proofread Addin API

METHOD

HEADER

Sample Data

Endpoint 3: CSV Query Engine API

METHOD

HEADER

Sample Data

Consume the service in Office Add-In (use Proofread Addin use case)

Integrate to Copilot Studio

Try your index in Chat Mode

Proofread mode

View Knowledge Graph Index

About

Releases

Packages

Languages

License

freistli/AdvancedRAG

Folders and files

Latest commit

History

Repository files navigation

Advanced RAG (Retrieval Augmented Generation) Service

Introduction

Latest Features Update

Source Code & Development

Quick Start

Run it on Azure Container App

Build Index

(Optional) Setup Your Own Index in the Docker Image

Call ADVRAGSVC through REST API CALL

Endpoint 1: Chat API

METHOD

HEADER

Sample Data

Endpoint 2: Proofread Addin API

METHOD

HEADER

Sample Data

Endpoint 3: CSV Query Engine API

METHOD

HEADER

Sample Data

Consume the service in Office Add-In (use Proofread Addin use case)

Integrate to Copilot Studio

Try your index in Chat Mode

Proofread mode

View Knowledge Graph Index

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages