LMSYS - Chatbot Arena Human Preference Predictions

Requirements

Hardware

A100 SXM 80G x4

Software

Base Image

nvcr.io/nvidia/pytorch:24.04-py3

Packages

detectron2==0.6
transformers==4.43.3
datasets==2.19.0
flash-attn==2.6.2
optimi==0.2.1

Training

Directory structure should be as follows.

├── data
│   ├── train.csv
│   └── test.csv
├── artifacts
│   ├── dtrainval.csv
│   ├── lmsys-33k-deduplicated.csv
│   ├── ...
│   ├── stage1
│   ├── ...
│   └── stage3
└── src  # this repo
    ├── configs
    ├── human_pref
    └── main.py

python scripts/prepare_dataset.py and download 21k external data from abdullahmeda
stage1
make pseudo labels
stage2
stage3

Inference

Reference scripts to convert checkpoints for inference.

python scripts/prepare_gemma2_for_submission.py
python scripts/prepare_llama3_for_submission.py

Kaggle Notebook

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
human_pref		human_pref
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LMSYS - Chatbot Arena Human Preference Predictions

Requirements

Hardware

Software

Training

Inference

About

Releases

Packages

Languages

License

tascj/kaggle-lmsys-chatbot-arena

Folders and files

Latest commit

History

Repository files navigation

LMSYS - Chatbot Arena Human Preference Predictions

Requirements

Hardware

Software

Training

Inference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages