Dolla Llama: Real-Time Co-Pilot for Closing the Deal

Implements speech-to-text (STT) and retrieval-augmented generation (RAG) to assist live sales calls.

🌟 Features:

STT with Whisper.cpp and llama.cpp for your LLM
Custom embeddings for your text corpus using SentenceTransformers
Indexing documents + embeddings with ElasticSearch

Getting Started

This demo assumes you have:

docker and docker-compose installed
Familiarity with RAG and its applications

Setup

Make sure to convert your Llama model to gguf format with llama.cpp for serving using their instructions. Then save the model in a local directory named models/

Launch with:

docker-compose up

And navigate to http://localhost:8090

Creating Custom Embeddings

By fine-tuning with SentenceTransformers, we can generate text embeddings locally for matching with documents in our Elasticsearch index.

The scraper/main.py script scrapes a list of sites to index. You can update the links in scraper/config.json

Indexing with ElasticSearch

Using Elasticsearch, we can index and tag documents for filtering and customization of the relevance scoring.

The scraper/main.py script also handles this after scraping.

Interface with Gradio

With Gradio, you press a button to begin and read suggestions in the chatbox.

The app/app.py contains the logic to run whisper for speech-to-text, run queries on the elasticsearch index, and launch the front-end.

Next Steps

Fine-tune an LLM for your usecase
Add additional indices for query/retrieval
Try a container orchestrator like k8s for robust distributed deployments

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
app		app
assets		assets
scraper		scraper
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dolla Llama: Real-Time Co-Pilot for Closing the Deal

🌟 Features:

Table of Contents

Getting Started

Setup

Creating Custom Embeddings

Indexing with ElasticSearch

Interface with Gradio

Next Steps

About

Releases

Packages

Languages

TatjanaChernenko/dolla_llama

Folders and files

Latest commit

History

Repository files navigation

Dolla Llama: Real-Time Co-Pilot for Closing the Deal

🌟 Features:

Table of Contents

Getting Started

Setup

Creating Custom Embeddings

Indexing with ElasticSearch

Interface with Gradio

Next Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages