The Utility and Complexity of in- and out-of-Distribution Machine Unlearning

Joshua Kazdan; Rachid Guerraoui; Sanmi Koyejo; Youssef Allouah

arxiv: 2412.09119 · v3 · pith:WQVZ67JMnew · submitted 2024-12-12 · 💻 cs.LG · cs.CR· math.OC

The Utility and Complexity of in- and out-of-Distribution Machine Unlearning

Youssef Allouah , Joshua Kazdan , Rachid Guerraoui , Sanmi Koyejo This is my paper

classification 💻 cs.LG cs.CRmath.OC

keywords dataunlearningcomplexityprivacytimeutilityaddressingdifferential

0 comments

read the original abstract

Machine unlearning, the process of selectively removing data from trained models, is increasingly crucial for addressing privacy concerns and knowledge gaps post-deployment. Despite this importance, existing approaches are often heuristic and lack formal guarantees. In this paper, we analyze the fundamental utility, time, and space complexity trade-offs of approximate unlearning, providing rigorous certification analogous to differential privacy. For in-distribution forget data -- data similar to the retain set -- we show that a surprisingly simple and general procedure, empirical risk minimization with output perturbation, achieves tight unlearning-utility-complexity trade-offs, addressing a previous theoretical gap on the separation from unlearning "for free" via differential privacy, which inherently facilitates the removal of such data. However, such techniques fail with out-of-distribution forget data -- data significantly different from the retain set -- where unlearning time complexity can exceed that of retraining, even for a single sample. To address this, we propose a new robust and noisy gradient descent variant that provably amortizes unlearning time complexity without compromising utility.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data
cs.LG 2026-05 unverdicted novelty 7.0

Asymmetric Langevin Unlearning uses public data to suppress unlearning noise costs by O(1/n_pub²), enabling practical mass unlearning with preserved utility under distribution mismatch.