Skip to content

Repository for Aya Expedition Project : Reward Model Multilingual

License

Notifications You must be signed in to change notification settings

for-ai/aya_rm_multilingual

Repository files navigation

Expedition Aya: Reward Model Multilingual

In this project, we evaluate reward models on their multilingual capabilities. We translated RewardBench into 23 diverse languages and evaluated several open-source and multilingual LLMs on their chat, safety, and reasoning capabilities. This project is part of Cohere for AI's Expedition Aya 2024, a 6-week open build challenge.

🤗 Dataset | 💬 Presentation | 📚 Documentation | 📄 Paper (coming soon!)

News

  • [2024-08-28] We won Silver Prize in Expedition Aya 2024! We're also releasing the v1 of the multilingual RewardBench on HuggingFace.

Setup and installation

We recommend installing the dependencies inside a virtual environment:

# Create and activate the virtual environment
python -m venv venv
source venv/bin/activate
# Install the dependencies (within venv context)
pip install -r requirements.txt

Note that the rewardbench package requires Python 3.10 and above.

Testing and Development

This codebase contains minimal tests, mostly we test functions that were added or patched from RewardBench. First, you need to install all the development dependencies:

pip install -r requirements-dev.txt

Then, you can run the tests by:

pytest tests/ -v --capture=no
pytest tests/ -m "not api" -v --capture=no  # to ignore tests that make use of third-party APIs

When developing, we format the code using black and isort, to be consistent with the RewardBench codebase. You can automatically format your code by running:

make style

Team Members

The team is composed of Srishti Gureja (@srishti-git1110), Shayekh Bin Islam, (@ShayekhBinIslam), Rishabh Maheshwary (@RishabhMaheshwary), Drishti Sushma (@DrishtiShrrrma), Gusti Winata (@sanggusti), and Lj Miranda (@ljvmiranda921).

About

Repository for Aya Expedition Project : Reward Model Multilingual

Resources

License

Stars

Watchers

Forks

Contributors 4

  •  
  •  
  •  
  •  

Languages