minRLHF

A (somewhat) minimal library for finetuning language models with PPO on human feedback.
Primarily for educational purposes but can be used to train up to 1B parameter models.
Inspired by Andrej Karpathy's minGPT and OpenAI's spinning up.

Clone and install locally (ie with pip install .) and see examples/huggingface_example.ipynb for how to get started.

Future work:

Produce a JAX version of this
Produce a demo showing how you can finetune minGPT models for dependency free RLHF.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
examples		examples
minRLHF.egg-info		minRLHF.egg-info
minRLHF		minRLHF
.gitignore		.gitignore
README.md		README.md
graph.png		graph.png
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

minRLHF

About

Releases

Packages

Languages

zerlinwang/minRLHF

Folders and files

Latest commit

History

Repository files navigation

minRLHF

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages