The repository contains the code accompanying the paper You can remove GPT2's LayerNorm by fine-tuning. See our HuggingFace repository for models with removed LayerNorm. The code is based on karpathy/nanoGPT, with just some small changes to the model and training script.
forked from karpathy/nanoGPT
-
Notifications
You must be signed in to change notification settings - Fork 1
A GPT2 fine-tuning script to remove LayerNorm layers. Based on karpathy/nanoGPT
License
ApolloResearch/gpt2_noLN
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A GPT2 fine-tuning script to remove LayerNorm layers. Based on karpathy/nanoGPT
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 100.0%