StanfordRibonanza2023

This repo contains my code for the 5th place solution of the Stanford Ribonanza RNA Folding competition 2023. All training was done on a single 4090. There are still some missing pieces but for now hopefully it serves as a readable reference. The 5-fold ensemble by itself achieves a LB score of 0.14034 public and 0.1423 private.

Summary

A 5-fold ensemble is trained on the competition training data then used to create pseudo-labels by inferencing test data. The checkpoints are discarded.
The pseudo-labels are filtered then used to train a single model to serve as a pretrained checkpoint for the next step.
A 5-fold ensemble is trained on the competition training data from the pretrained checkpoint. These are the final models.

Hyperparameters

Hyperparameters are set in trainer.py and it also serves as the entry point for training then inference. Included in the repo are the hparam yaml files from the 3 stages above.

Quickstart

Install requirements.txt
Download the preprocessed data from this kaggle dataset (WIP) and put them in data/
Preprocess Eternafold BPP files (WIP)
Set hyperparamers in trainer.py according to the desired training stage
(cd exp && chmod +x trainer.py) ./trainer.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
exp		exp
main		main
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Ribonanza.png		Ribonanza.png
hparams_step_1.yaml		hparams_step_1.yaml
hparams_step_2.yaml		hparams_step_2.yaml
hparams_step_3.yaml		hparams_step_3.yaml
infer.py		infer.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StanfordRibonanza2023

Summary

Hyperparameters

Quickstart

About

Releases

Packages

Languages

License

s-rog/StanfordRibonanza2023

Folders and files

Latest commit

History

Repository files navigation

StanfordRibonanza2023

Summary

Hyperparameters

Quickstart

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages