RobertKirk

Follow

Robert Kirk RobertKirk

Follow

PhD student at @ucl-dark. Interested in understanding LLM fine-tuning, AI safety and (super)alignment.

47 followers · 9 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

facebookresearch/rlfh-gen-div facebookresearch/rlfh-gen-div Public

This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity

Python 40 7
tinystories-wrappers tinystories-wrappers Public

Code for the TinyStories experiments from "Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks".

Jupyter Notebook 5 1
facebookresearch/minihack facebookresearch/minihack Public

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Python 487 60
stanford_alpaca stanford_alpaca Public

Forked from tatsu-lab/stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python