The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.
The code is based on Wang et al.; I fixed some bugs, made it more efficient for inference, and enabled model-parallel to support multi-GPUs.
Just run the jupyter notebook.