A100 SXM 80G x4
Base Image
nvcr.io/nvidia/pytorch:24.04-py3
Packages
detectron2==0.6
transformers==4.43.3
datasets==2.19.0
flash-attn==2.6.2
optimi==0.2.1
Directory structure should be as follows.
├── data
│ ├── train.csv
│ └── test.csv
├── artifacts
│ ├── dtrainval.csv
│ ├── lmsys-33k-deduplicated.csv
│ ├── ...
│ ├── stage1
│ ├── ...
│ └── stage3
└── src # this repo
├── configs
├── human_pref
└── main.py
python scripts/prepare_dataset.py
and download 21k external data from abdullahmeda- stage1
- make pseudo labels
- stage2
- stage3
Reference scripts to convert checkpoints for inference.
python scripts/prepare_gemma2_for_submission.py
python scripts/prepare_llama3_for_submission.py