MAGIC: Near-optimal data attribution for deep learning.CoRR, abs/2504.16430

URL https://openreview · arXiv 2504.16430

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

How Faithful Is Trajectory-Based Data Attribution? Error Sources, Remedies, and Practical Guidelines

cs.LG · 2026-05-12 · conditional · novelty 7.0

The paper decomposes errors in trajectory-based data attribution into config, algorithm, and system levels, proposes AdamW-influence to fix optimizer mismatch, derives an error proxy for Taylor approximation, and unifies data selection under a K-step look-ahead framework.

Efficient Estimation of Kernel Surrogate Models for Task Attribution

cs.LG · 2026-02-03 · unverdicted · novelty 6.0

Kernel surrogate models with first-order gradient approximation achieve 25% higher correlation to leave-one-out ground truth for task attribution and 40% better downstream data selection than linear surrogates.

How to sketch a learning algorithm

cs.LG · 2026-04-08 · unverdicted · novelty 5.0

A sketching method based on higher-order derivatives enables efficient data deletion predictions for deep learning models under a stability assumption with near-linear overhead in error and failure parameters.

citing papers explorer

Showing 3 of 3 citing papers.

How Faithful Is Trajectory-Based Data Attribution? Error Sources, Remedies, and Practical Guidelines cs.LG · 2026-05-12 · conditional · none · ref 5
The paper decomposes errors in trajectory-based data attribution into config, algorithm, and system levels, proposes AdamW-influence to fix optimizer mismatch, derives an error proxy for Taylor approximation, and unifies data selection under a K-step look-ahead framework.
Efficient Estimation of Kernel Surrogate Models for Task Attribution cs.LG · 2026-02-03 · unverdicted · none · ref 5
Kernel surrogate models with first-order gradient approximation achieve 25% higher correlation to leave-one-out ground truth for task attribution and 40% better downstream data selection than linear surrogates.
How to sketch a learning algorithm cs.LG · 2026-04-08 · unverdicted · none · ref 10
A sketching method based on higher-order derivatives enables efficient data deletion predictions for deep learning models under a stability assumption with near-linear overhead in error and failure parameters.

MAGIC: Near-optimal data attribution for deep learning.CoRR, abs/2504.16430

fields

years

verdicts

representative citing papers

citing papers explorer