Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs
Signal
78
Hype
15
In three linesStudy showing that unlearning in LLMs merely suppresses information at surface level—models recover original behavior through minimal fine-tuning. Authors introduce representation-level analysis framework (PCA, CKA, Fisher information) to assess genuine data erasure and identify four forgetting regimes based on reversibility and catastrophicity.Read source
Your take?
Summary generated by Claude — human-verified