Skip to content

mattosborn/caa-replication

Repository files navigation

./run.py baseline   # test refusal baseline
./run.py generate   # generate steering vectors
./run.py steering   # test steering vectors
├── caa/       # model wrapper, util code, etc
├── data/      # data output
├── datasets/  # input datasets

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published