Social Bias Evaluation for Large Language Models Requires Prompt Variations

This repository contains the code for social bias evaluation for LLMs using prompt variations. The dataset is BBQ dataset. Prompt variations are comprised of three perspectives.

task instruction and prompt
- see data/template.tsv
few-shot examples
- see data/BBQ_few_shot.jsonl
debias-prompts
- see data/debias_prompts.json

How to Use

You can run experiments with the following command.

Dataset Preparation

Before inference, prepare the variation of 1. task instruction and prompt, 2. few-shot settings. python3 data/convert_format.py

Inference

You can run the inference by each LLM. export PYTHONPATH="$pwd/src:$PYTHONPATH"; python3 src/pred.py --model <model_name> --file <file_name> --debias_prompt <debias_prompt>

model_name: model checkpoint in huggingface.
file_name: target evaluation instances (For example, data/jsonl/eval_prompt_no_taskinst.jsonl)
debias_prompt: debias_prompt key. See the above description. When evaluating without debias-prompts, drop this arg.

Evaluation

You can calculate task performance and social bias of LLMs. python3 evaluation/eval_bbq.py --result_dir <result_folder>

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
evaluation		evaluation
src		src
LICENSE		LICENSE
README.md		README.md
eval_config.json		eval_config.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Social Bias Evaluation for Large Language Models Requires Prompt Variations

How to Use

Dataset Preparation

Inference

Evaluation

About

Releases

Packages

Languages

License

rem-h4/llm_socialbias_prompts

Folders and files

Latest commit

History

Repository files navigation

Social Bias Evaluation for Large Language Models Requires Prompt Variations

How to Use

Dataset Preparation

Inference

Evaluation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages