GitHub - poggiolabs/ragas: Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Documentation | Installation | Quickstart | Community | Open Analytics | Hugging Face

🚀 Dedicated solutions to evaluate, monitor and improve performance of LLM & RAG application in production including custom models for production quality monitoring.Talk to founders

Ragas is a framework that helps you evaluate your Retrieval Augmented Generation (RAG) pipelines. RAG denotes a class of LLM applications that use external data to augment the LLM’s context. There are existing tools and frameworks that help you build these pipelines but evaluating it and quantifying your pipeline performance can be hard. This is where Ragas (RAG Assessment) comes in.

Ragas provides you with the tools based on the latest research for evaluating LLM-generated text to give you insights about your RAG pipeline. Ragas can be integrated with your CI/CD to provide continuous checks to ensure performance.

🛡️ Installation

pip install ragas

if you want to install from source

git clone https://github.com/explodinggradients/ragas && cd ragas
pip install -e .

🔥 Quickstart

This is a small example program you can run to see ragas in action!

from ragas import evaluate
from datasets import Dataset
import os

os.environ["OPENAI_API_KEY"] = "your-openai-key"

# prepare your huggingface dataset in the format
# Dataset({
#     features: ['question', 'contexts', 'answer', 'ground_truths'],
#     num_rows: 25
# })

dataset: Dataset

results = evaluate(dataset)
# {'ragas_score': 0.860, 'context_precision': 0.817,
# 'faithfulness': 0.892, 'answer_relevancy': 0.874}

Refer to our documentation to learn more.

🫂 Community

If you want to get more involved with Ragas, check out our discord server. It's a fun community where we geek out about LLM, Retrieval, Production issues, and more.

🔍 Open Analytics

We track very basic usage metrics to guide us to figure out what our users want, what is working, and what's not. As a young startup, we have to be brutally honest about this which is why we are tracking these metrics. But as an Open Startup, we open-source all the data we collect. You can read more about this here. Ragas does not track any information that can be used to identify you or your company. You can take a look at exactly what we track in the code

To disable usage-tracking you set the RAGAS_DO_NOT_TRACK flag to true.

Name		Name	Last commit message	Last commit date
Latest commit History 125 Commits
.github/workflows		.github/workflows
docs		docs
experiments		experiments
requirements		requirements
src/ragas		src/ragas
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
references.md		references.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Documentation | Installation | Quickstart | Community | Open Analytics | Hugging Face

🛡️ Installation

🔥 Quickstart

🫂 Community

🔍 Open Analytics

About

Releases

Packages

Languages

License

poggiolabs/ragas

Folders and files

Latest commit

History

Repository files navigation

Documentation | Installation | Quickstart | Community | Open Analytics | Hugging Face

🛡️ Installation

🔥 Quickstart

🫂 Community

🔍 Open Analytics

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages