arXiv cs.LG·1 June 2026

MAAT: Multi-phase Adapter-Aware Targeted Unlearning

Signal

Hype

In three lines5WBENCH, a balanced 5,000-sample benchmark across 5W categories, reveals unlearning methods fail on causal (Why) questions. MAAT, a three-phase framework operating on LoRA weights, combines gradient-projected ascent, SVD rank pruning, and KL-hidden-state repair to simultaneously achieve high forgetting and retention on causal knowledge.

Read source

Your take?

Fine-tuning AI safety Alignment Benchmarks Papers

Summary generated by Claude — human-verified

MAAT: Multi-phase Adapter-Aware Targeted Unlearning

Other angles on this story