ALU uses public data to suppress unlearning cost quadratically while characterizing distribution mismatch effects, enabling mass unlearning with maintained utility.
Snap: Unlearning selective knowledge in large language models with negative instructions.arXiv preprint arXiv:2406.12329
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it