Change the repository type filter
All
Repositories list
12 repositories
llm-past-tense
PublicDoes Refusal Training in LLMs Generalize to the Past Tense? [NeurIPS 2024 Safe Generative AI Workshop (Oral)]why-weight-decay
Publicllm-adaptive-attacks
Publicicl-alignment
PublicIs In-Context Learning Sufficient for Instruction Following in LLMs?sam-low-rank-features
Publicsgd-sparse-features
PublicSGD with large step sizes learns sparse features [ICML 2023]tml-epfl.github.io
Publicunderstanding-sam
PublicTowards Understanding Sharpness-Aware Minimization [ICML 2022]adv-training-corruptions
Public- Understanding and Improving Fast Adversarial Training [NeurIPS 2020]