hub

Cer- tified data removal from machine learning models

Chuan Guo, Tom Goldstein, Awni Hannun, Laurens Van Der Maaten · 1911 · arXiv 1911.03030

26 Pith papers cite this work. Polarity classification is still indexing.

26 Pith papers citing it

read on arXiv browse 26 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2 baseline 1

citation-polarity summary

background 2 baseline 1

representative citing papers

Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning

cs.LG · 2024-04-08 · conditional · novelty 8.0

NPO enables stable unlearning of 50%+ training data in LLMs on TOFU by making collapse exponentially slower than gradient ascent, preserving sensible outputs where prior methods fail.

TRACER: Token ReAssignment for Concept ERasure in Generative Recommendation

cs.IR · 2026-06-05 · unverdicted · novelty 7.0

TRACER uses token reassignment for concept-related items plus a coherence regularizer to unlearn specific concepts in generative recommendation while preserving utility better than baselines.

Exact Unlearning in Reinforcement Learning

cs.LG · 2026-06-02 · unverdicted · novelty 7.0

For any ρ>0 there exists a ρ-TV-stable RL algorithm for tabular MDPs supporting exact unlearning at expected cost ρ√(ln T) of retraining from scratch, with regret O(H²√(SAT)+H³S²A+H^{2.5}S²A/ρ) and matching lower bound Ω(H√(SAT)+SAH/ρ).

Initialization is Half the Battle: Generating Diverse Images from a Guidance Potential Posterior

cs.CV · 2026-06-01 · unverdicted · novelty 7.0

DivIn samples initial noise from a guidance potential posterior via Langevin dynamics to improve diversity in class-to-image and text-to-image generation.

Interference-Aware Multi-Task Unlearning

cs.AI · 2026-05-18 · unverdicted · novelty 7.0

Introduces interference-aware multi-task unlearning with task-aware gradient projection and instance-level gradient orthogonalization, reducing interference scores by 30.3% and 52.9% on vision benchmarks.

Erase Persona, Forget Lore: Benchmarking Multimodal Copyright Unlearning in Large Vision Language Models

cs.CV · 2026-05-05 · unverdicted · novelty 7.0

CoVUBench is the first benchmark framework for evaluating multimodal copyright unlearning in LVLMs via synthetic data, systematic variations, and a dual protocol for forgetting efficacy and utility preservation.

Shape of Memory: a Geometric Analysis of Machine Unlearning in Second-Order Optimizers

cs.LG · 2026-04-24 · unverdicted · novelty 7.0

Second-order optimizers retain residual geometric memory in their state after unlearning that first-order metrics miss, and only controlled eigendecay perturbations fully erase it.

ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models

cs.LG · 2026-05-16 · unverdicted · novelty 6.0 · 3 refs

ZeroUnlearn reformulates machine unlearning as knowledge re-mapping via model editing, using multiplicative updates with closed-form solutions for efficient few-shot removal of sensitive representations while preserving utility.

Representation-Guided Parameter-Efficient LLM Unlearning

cs.CL · 2026-04-19 · unverdicted · novelty 6.0

REGLU guides LoRA-based unlearning via representation subspaces and orthogonal regularization to outperform prior methods on forget-retain trade-off in LLM benchmarks.

WIN-U: Woodbury-Informed Newton-Unlearning as a retain-free Machine Unlearning Framework

cs.LG · 2026-04-15 · unverdicted · novelty 6.0

WIN-U delivers a retain-free unlearning update that approximates the gold-standard retrained model via a Woodbury-informed Newton step using only forget-set curvature information.

PrivEraserVerify: Efficient, Private, and Verifiable Federated Unlearning

cs.LG · 2026-04-14 · unverdicted · novelty 6.0

PrivEraserVerify unifies efficiency via adaptive checkpointing, privacy via layer-adaptive DP, and verifiability via fingerprints in federated unlearning, claiming 2-3x faster performance than retraining with formal guarantees.

Label Leakage Attacks in Machine Unlearning: A Parameter and Inversion-Based Approach

cs.CR · 2026-04-08 · unverdicted · novelty 6.0

Parameter-difference and model-inversion attacks can identify forgotten classes after machine unlearning on standard image datasets.

Jellyfish: Zero-Shot Federated Unlearning Scheme with Knowledge Disentanglement

cs.CR · 2026-04-05 · unverdicted · novelty 6.0

Jellyfish enables zero-shot federated unlearning through synthetic proxy data generation, channel-restricted knowledge disentanglement, and a composite loss with repair to forget target data while retaining model utility.

Forget-It-All: Multi-Concept Machine Unlearning via Concept-Aware Neuron Masking

cs.CV · 2026-01-07 · unverdicted · novelty 6.0

FIA uses contrastive concept saliency and temporal-spatial neuron identification to build unified masks that erase multiple target concepts while preserving general generation quality in diffusion models.

POUR: A Provably Optimal Method for Unlearning Representations via Neural Collapse

cs.CV · 2025-11-24 · unverdicted · novelty 6.0

POUR derives a provably optimal forgetting operator by showing that orthogonal projections of simplex equiangular tight frames remain ETFs in lower dimensions, enabling representation-level unlearning with closed-form and distillation variants.

Exploring Nonlinear Pathway in Parameter Space for Machine Unlearning

cs.AI · 2025-05-16 · unverdicted · novelty 6.0

MCU applies mode connectivity to trace nonlinear unlearning pathways in parameter space, adds a parameter mask and adaptive penalty, and produces a range of unlearning models that plug into existing methods.

TOFU: A Task of Fictitious Unlearning for LLMs

cs.LG · 2024-01-11 · conditional · novelty 6.0

TOFU is a new benchmark with synthetic profiles and metrics demonstrating that existing unlearning algorithms for LLMs fail to achieve effective forgetting of targeted information.

SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation

cs.LG · 2023-10-19 · conditional · novelty 6.0

SalUn uses gradient-based weight saliency to achieve effective machine unlearning of data, classes, or concepts in image classification and generation, narrowing the gap to exact retraining.

TrustErase: Auditable Instant Machine Unlearning with Passport-Embedded Representations

cs.CR · 2026-06-15 · unverdicted · novelty 5.0

TrustErase uses passport-embedded representations for instant, data-free, and auditable machine unlearning through simple deactivation of adaptation layers.

Incentivizing User Data Contributions for LLM Improvement under Withdrawal Rights

cs.GT · 2026-05-08 · unverdicted · novelty 5.0

Withdrawal rights paired with centralized cost-based assignment prevent subsidy waste by collecting data only when the improvement threshold is sustainably reachable, turning infeasible cases into null outcomes.

Forgetting to Witness: Efficient Federated Unlearning and Its Visible Evaluation

cs.LG · 2026-04-06 · unverdicted · novelty 5.0

A complete pipeline for federated unlearning via knowledge distillation for efficient removal and a GAN-integrated classifier for visual evaluation of forgetting capacity.

Machine Unlearning on Pre-trained Models by Residual Feature Alignment Using LoRA

cs.LG · 2024-11-13 · unverdicted · novelty 5.0

A LoRA-based residual feature alignment method for efficient machine unlearning on pre-trained models by targeting zero residuals on retained data and shifted residuals on unlearned data.

AdaProb: Efficient Machine Unlearning via Adaptive Probability

cs.LG · 2024-11-04 · unverdicted · novelty 5.0

AdaProb performs machine unlearning by substituting final-layer output probabilities with optimized uniform pseudo-probabilities and updating model weights.

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

cs.AI · 2023-08-10 · accept · novelty 5.0

Survey organizes LLM trustworthiness into seven categories and 29 sub-categories, measures eight sub-categories on popular models, and finds that more aligned models generally score higher but with varying effectiveness.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Incentivizing User Data Contributions for LLM Improvement under Withdrawal Rights cs.GT · 2026-05-08 · unverdicted · none · ref 17
Withdrawal rights paired with centralized cost-based assignment prevent subsidy waste by collecting data only when the improvement threshold is sustainably reachable, turning infeasible cases into null outcomes.
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment cs.AI · 2023-08-10 · accept · none · ref 203
Survey organizes LLM trustworthiness into seven categories and 29 sub-categories, measures eight sub-categories on popular models, and finds that more aligned models generally score higher but with varying effectiveness.

Cer- tified data removal from machine learning models

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer