Skip to content

Commit

Permalink
Add README for running OPEA ragas using HF endpoint on Gaudi (#137)
Browse files Browse the repository at this point in the history
* minimized required fields/columns in user data

Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>

* add bench-target as the prefix of output folder (#133)

Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>
Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>

* remove examples. (#135)

Co-authored-by: root <root@idc708073.jf.intel.com>
Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>

* minor naming correction to maintain consistency

Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>

* Add hyperlinks and paths validation. (#132)

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>

* adding README for OPEA ragas

Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>

* adding python3 syntax to README

Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: aasavari <aasavari.dhananjay.kakne@intel.com>
Signed-off-by: Yingchun Guo <yingchun.guo@intel.com>
Signed-off-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: Ying Chun Guo <yingchun.guo@intel.com>
Co-authored-by: lkk <33276950+lkk12014402@users.noreply.github.com>
Co-authored-by: root <root@idc708073.jf.intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
  • Loading branch information
6 people authored Sep 24, 2024
1 parent f1593ea commit 0dff0d3
Show file tree
Hide file tree
Showing 2 changed files with 41 additions and 4 deletions.
39 changes: 39 additions & 0 deletions evals/metrics/ragas/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
# OPEA adaption of ragas (LLM-as-a-judge evaluation of Retrieval Augmented Generation)
OPEA's adaption of [ragas](https://github.com/explodinggradients/ragas) allows you to use [ragas](https://github.com/explodinggradients/ragas) on Intel's Gaudi AI accelerator chips.

## User data
Please wrap your input data in `datasets.Dataset` class.
```python3
from datasets import Dataset

example = {
"question": "Who is wife of Barak Obama",
"contexts": [
"Michelle Obama, wife of Barak Obama (former President of the United States of America) is an attorney",
"Barak and Michelle Obama have 2 daughters - Malia and Sasha",
],
"answer": "Michelle Obama",
"ground_truth": "Wife of Barak Obama is Michelle Obama",
}
dataset = Dataset.from_list([example])
```

## Launch HuggingFace endpoint on Intel's Gaudi machines
Please follow instructions mentioned in [TGI Gaudi repo](https://github.com/huggingface/tgi-gaudi) with your desired LLM such as `meta-llama/Meta-Llama-3.1-70B-Instruct`.

## Run OPEA ragas pipeline using your desired list of metrics
```python3
# note - if you wish to use answer relevancy metric, please set the embedding parameter
from langchain_community.embeddings import HuggingFaceBgeEmbeddings

embeddings = HuggingFaceBgeEmbeddings(model_name="BAAI/bge-base-en-v1.5")

from ragas import RagasMetric

ragas_metric = RagasMetric(threshold=0.5, model="<set your HF endpoint URL>", embeddings=embeddings)
print(ragas_metric.measure(dataset))
```
That's it!

## Troubleshooting
Please allow few minutes for HuggingFace endpoint to download model weights and load them. Larger models may take few more minutes. For any other issue, please file an issue and we will get back to you.
6 changes: 2 additions & 4 deletions evals/metrics/ragas/ragas.py
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ def __init__(
"context_recall",
"faithfulness",
"context_utilization",
"reference_free_rubrics_score",
# "reference_free_rubrics_score",
]

async def a_measure(self, test_case: Dict):
Expand All @@ -55,7 +55,6 @@ def measure(self, test_case: Dict):
context_recall,
context_utilization,
faithfulness,
reference_free_rubrics_score,
)
except ModuleNotFoundError:
raise ModuleNotFoundError("Please install ragas to use this metric. `pip install ragas`.")
Expand All @@ -71,7 +70,7 @@ def measure(self, test_case: Dict):
"context_recall": context_recall,
"faithfulness": faithfulness,
"context_utilization": context_utilization,
"reference_free_rubrics_score": reference_free_rubrics_score,
# "reference_free_rubrics_score": reference_free_rubrics_score,
}
# Set LLM model
openai_key = os.getenv("OPENAI_API_KEY", None)
Expand Down Expand Up @@ -118,7 +117,6 @@ def measure(self, test_case: Dict):
]
# Find necessary input fields using the given metrics
_required_columns = set()
is_latest = faithfulness
column_map = { # this column maps new naming style in ragas to their old naming style
"user_input": "question",
"response": "answer",
Expand Down

0 comments on commit 0dff0d3

Please sign in to comment.