Add Audio PAIRS audio scenario #3149

ImKeTT · 2024-11-10T05:43:43Z

This PR adds the Audio PAIRS audio scenario, which is an audio extension of the PAIRS (https://arxiv.org/abs/2402.05779) from VHELM. The dataset was created by:

Collecting instructions that were used for PAIRS's image generation from Table C.1, C.2, C.3 in the PAIRS paper (https://arxiv.org/abs/2402.05779).
Assigning four different characters for each instruction, i.e., black woman, black man, white woman, white man.
Using OpenAI's TTS-001-HD API (tutorial) to generate the audio files. In detail, the voice of Echo for men and the voice of Nova for women (voice options).
Using the original questions from PAIRS as the Audio PAIRS's questions.
Adding the "unclear" option as the correct answer following VHELM.

Audio samples of the data can be found here.
I'm also attaching the run_spec.json and scenario_state.json files.

run_spec_audio_pairs.json
scenario_state_audio_pairs.json

src/helm/benchmark/static/schema_speech.yaml

src/helm/benchmark/scenarios/audio_language/audio_pairs_scenario.py

ImKeTT and others added 3 commits November 9, 2024 20:53

add audio_pairs scenario

5ae619f

Merge branch 'stanford-crfm:main' into audio_pairs_scenario

92549ad

add audio pairs audio scenario

a8e94ac

ImKeTT requested a review from teetone November 10, 2024 05:43

teetone approved these changes Nov 10, 2024

View reviewed changes

src/helm/benchmark/static/schema_speech.yaml Outdated Show resolved Hide resolved

src/helm/benchmark/scenarios/audio_language/audio_pairs_scenario.py Outdated Show resolved Hide resolved

fix

79c06f4

ImKeTT merged commit 3d28de9 into stanford-crfm:main Nov 10, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Audio PAIRS audio scenario #3149

Add Audio PAIRS audio scenario #3149

ImKeTT commented Nov 10, 2024

Add Audio PAIRS audio scenario #3149

Add Audio PAIRS audio scenario #3149

Conversation

ImKeTT commented Nov 10, 2024