LlamaSearch

LlamaSearch is a conversational search engine powered by multiple LLM providers, delivering intelligent, context-aware responses with enhanced answer quality through Retrieval-Augmented Generation (RAG). It supports various search backends and provides detailed insights into model performance via Langfuse, offering a comprehensive and transparent search experience.

Best of all, by combining Ollama for local LLM processing and DuckDuckGo for open-source search, users can run a powerful, privacy-respecting search engine entirely on their local machines without any external dependencies or API keys. It is an extension of the search_with_lepton project by LeptonAI.

Features

Built-in support for multiple LLM providers:
- OpenAI
- TogetherAI
- HuggingFace Text Generation Interface (HF_TGI)
- Ollama
Integrated search engine capabilities
Customizable and responsive UI interface
Retrieval-Augmented Generation (RAG) for improved answer quality
Related questions generation
Support for multiple search backends:
- Google
- Serper
- DuckDuckGo
LLM observability with Langfuse

Run Locally Without API Keys

One of the main advantages of LlamaSearch is the ability to run it entirely on your local machine without requiring any external API keys. This setup ensures privacy, reduces costs, and allows for complete control over the search and language model processes.

Local LLM with Ollama

LlamaSearch supports Ollama, which allows you to run large language models locally on your own hardware. This means you can have a powerful LLM running on your machine without relying on external API services.

To use Ollama:

Install Ollama on your system following their installation guide.
Set the CLIENT environment variable to "OLLAMA" in your .env file.
Specify the Ollama model you want to use (e.g., "llama2") in the configuration.

Open-Source Search with DuckDuckGo

LlamaSearch integrates with DuckDuckGo, an open-source search engine that doesn't require an API key. This allows you to perform web searches without relying on proprietary search APIs.

To use DuckDuckGo:

Set the SEARCH_BACKEND environment variable to "DUCKDUCKGO" in your .env file.
No additional configuration is needed - it works out of the box!

By combining Ollama for local LLM processing and DuckDuckGo for open-source search, you can run a powerful, privacy-respecting search engine entirely on your local machine without any external dependencies or API keys.

LLM Observability with Langfuse

LlamaSearch integrates Langfuse for LLM observability, allowing you to monitor, debug, and optimize your language model interactions.

To set up Langfuse:

Sign up for a Langfuse account at langfuse.com
Obtain your Langfuse API keys (public and secret)

Add the following environment variables to your .env file:

langfuse_secret_key=your_secret_key_here
langfuse_public_key=your_public_key_here
langfuse_cloud=https://cloud.langfuse.com  # or your self-hosted Langfuse URL

Once configured, LlamaSearch will automatically send LLM interaction data to Langfuse, allowing you to monitor performance, debug issues, optimize prompts, and track costs associated with LLM API calls.

Prerequisites

Python 3.10+
Node.js
npm

Installation

Clone the repository:

git clone https://github.com/yourusername/LlamaSearch.git
cd LlamaSearch

Set up the environment variables:
```
cp .env_example .env
```
Edit the .env file and fill in the required configurations.
Install Python dependencies:
```
pip install -r requirements.txt
```
Install and build the web interface:
```
cd web
npm install
npm run build
```

Usage

Start the server:
```
cd ..
python thesearch.py
```
Open your web browser and navigate to http://localhost:8282 to use LlamaSearch.

Configuration

You can configure LlamaSearch by modifying the following environment variables in the .env file:

SEARCH_BACKEND: Choose between "GOOGLE", "SERPER", or "DUCKDUCKGO"
LLM_PROVIDER: Set to "OPENAI", "TOGETHER", "HF_TGI", or "OLLAMA"
OPENAI_API_KEY: Your OpenAI API key (if using OpenAI)
TOGETHER_API_KEY: Your TogetherAI API key (if using TogetherAI)
HF_API_KEY: Your Hugging Face TGI server API key (if using HuggingFace Text Generation Interface over a self hosted server)
GOOGLE_SEARCH_API_KEY: Your Google Search API key (if using Google search)
GOOGLE_SEARCH_CX: Your Google Search engine ID
SERPER_SEARCH_API_KEY: Your Serper API key (if using Serper search)
langfuse_secret_key: Your Langfuse secret key
langfuse_public_key: Your Langfuse public key
langfuse_cloud: Langfuse cloud URL or self-hosted instance URL

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Acknowledgments

Lepton AI for the search_with_lepton project
OpenAI for their language models
TogetherAI for their AI services
Hugging Face for their Transformers library
Ollama for local LLM support
DuckDuckGo for their open-source search engine
Langfuse for LLM observability

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
llm		llm
prompt		prompt
rag		rag
retrieval		retrieval
utils		utils
web		web
.env_example		.env_example
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
thesearch.py		thesearch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LlamaSearch

Features

Run Locally Without API Keys

Local LLM with Ollama

Open-Source Search with DuckDuckGo

LLM Observability with Langfuse

Prerequisites

Installation

Usage

Configuration

Contributing

Acknowledgments

About

Releases

Packages

Languages

abhibisht89/llamasearch

Folders and files

Latest commit

History

Repository files navigation

LlamaSearch

Features

Run Locally Without API Keys

Local LLM with Ollama

Open-Source Search with DuckDuckGo

LLM Observability with Langfuse

Prerequisites

Installation

Usage

Configuration

Contributing

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages