Skip to content

jedt/ReplyCaddyTray

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

test-recording.webm

ReplyCaddy Tray

This is a RAG implementation for Front end ReactJS and Flask backend

Requirements:

  • MacOS
  • Python 3.11
  • Ollama
    • llama3.1:8b
    • mxbai-embed-large
  • HuggingFace
    • MBZUAI/LaMini-GPT-124M (for perplexity)

How it works

Basic RAG + reranker

  • First, it reads your text and PDF files inside your Downloads and Documents for context. Then it saves them to Chromadb.
  • During ETL, the app will check any tokens that exceeded 0.3 of normalized perplexity
    • normalized = (perplexity - 1) / 100
    • This will help ignore noise. Text from log dumps and unreadable sections of pdf files. These happen to generate spam in the retrieval process because they always appear in the results. Like when we enter a query on google and all it returns are spam websites.
  • Then we chunk the data into 512 tokens with an overlap of 128
  • websockets for the chat ui

About

RAG React.js Frontend using python-ollama

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published