Skip to content
/ ClamPy Public

Sparse AutoEncoders for Clamping LLM Behavior. Inspired by Anthropic.

Notifications You must be signed in to change notification settings

MDK8888/ClamPy

Repository files navigation

LIVE DEMO

This is Clampy, a project based off of Anthropic's paper 'Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet'.

About

Sparse AutoEncoders for Clamping LLM Behavior. Inspired by Anthropic.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published