pith. sign in

hub

Cer- tified data removal from machine learning models

26 Pith papers cite this work. Polarity classification is still indexing.

26 Pith papers citing it

hub tools

citation-role summary

background 2 baseline 1

citation-polarity summary

clear filters

representative citing papers

Exact Unlearning in Reinforcement Learning

cs.LG · 2026-06-02 · unverdicted · novelty 7.0

For any ρ>0 there exists a ρ-TV-stable RL algorithm for tabular MDPs supporting exact unlearning at expected cost ρ√(ln T) of retraining from scratch, with regret O(H²√(SAT)+H³S²A+H^{2.5}S²A/ρ) and matching lower bound Ω(H√(SAT)+SAH/ρ).

Interference-Aware Multi-Task Unlearning

cs.AI · 2026-05-18 · unverdicted · novelty 7.0

Introduces interference-aware multi-task unlearning with task-aware gradient projection and instance-level gradient orthogonalization, reducing interference scores by 30.3% and 52.9% on vision benchmarks.

ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models

cs.LG · 2026-05-16 · unverdicted · novelty 6.0 · 3 refs

ZeroUnlearn reformulates machine unlearning as knowledge re-mapping via model editing, using multiplicative updates with closed-form solutions for efficient few-shot removal of sensitive representations while preserving utility.

TOFU: A Task of Fictitious Unlearning for LLMs

cs.LG · 2024-01-11 · conditional · novelty 6.0

TOFU is a new benchmark with synthetic profiles and metrics demonstrating that existing unlearning algorithms for LLMs fail to achieve effective forgetting of targeted information.

citing papers explorer

Showing 3 of 3 citing papers after filters.

  • Interference-Aware Multi-Task Unlearning cs.AI · 2026-05-18 · unverdicted · none · ref 23

    Introduces interference-aware multi-task unlearning with task-aware gradient projection and instance-level gradient orthogonalization, reducing interference scores by 30.3% and 52.9% on vision benchmarks.

  • Exploring Nonlinear Pathway in Parameter Space for Machine Unlearning cs.AI · 2025-05-16 · unverdicted · none · ref 13

    MCU applies mode connectivity to trace nonlinear unlearning pathways in parameter space, adds a parameter mask and adaptive penalty, and produces a range of unlearning models that plug into existing methods.

  • Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment cs.AI · 2023-08-10 · accept · none · ref 203

    Survey organizes LLM trustworthiness into seven categories and 29 sub-categories, measures eight sub-categories on popular models, and finds that more aligned models generally score higher but with varying effectiveness.