LMSYS - Chatbot Arena Human Preference Predictions

Requirements

Hardware

A100 SXM 80G x4

Software

Base Image

nvcr.io/nvidia/pytorch:24.04-py3

Packages

detectron2==0.6
transformers==4.43.3
datasets==2.19.0
flash-attn==2.6.2
optimi==0.2.1

Training

Directory structure should be as follows.

├── data
│   ├── train.csv
│   └── test.csv
├── artifacts
│   ├── dtrainval.csv
│   ├── lmsys-33k-deduplicated.csv
│   ├── ...
│   ├── stage1
│   ├── ...
│   └── stage3
└── src  # this repo
    ├── configs
    ├── human_pref
    └── main.py

python scripts/prepare_dataset.py and download 21k external data from abdullahmeda
stage1
make pseudo labels
stage2
stage3

Inference

Reference scripts to convert checkpoints for inference.

python scripts/prepare_gemma2_for_submission.py
python scripts/prepare_llama3_for_submission.py

Kaggle Notebook

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

LMSYS - Chatbot Arena Human Preference Predictions

Requirements

Hardware

Software

Training

Inference

Files

README.md

Latest commit

History

README.md

File metadata and controls

LMSYS - Chatbot Arena Human Preference Predictions

Requirements

Hardware

Software

Training

Inference