arXiv preprint arXiv:2302.06960 , year=

Data pruning, neural scaling laws: fundamental limitations of score-based algorithms , author= · 2023 · arXiv 2302.06960

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

On-Policy Self-Distillation with Sampled Demonstrations Reduces Output Diversity

cs.LG · 2026-06-24 · unverdicted · novelty 6.0

On-policy self-distillation with sampled demonstrations reduces rollout diversity by amplifying existing probability gaps in the base model, unlike ideal RL which preserves ratios among correct outputs.

Data Selection Through Iterative Self-Filtering for Vision-Language Settings

cs.CV · 2026-06-22 · unverdicted · novelty 5.0

An iterative bootstrapped self-filtering approach selects balanced clean and diverse subsets from noisy vision-language datasets to train improved CLIP models.

OrderDP: A Theoretically Guaranteed Lossless Dynamic Data Pruning Framework

cs.LG · 2026-06-07 · unverdicted · novelty 5.0

OrderDP is a plug-and-play data pruning method that selects a random subset then top-q samples to guarantee unbiased surrogate-loss training with convergence analysis and over 40% training cost reduction on CIFAR and ImageNet.

citing papers explorer

Showing 3 of 3 citing papers.

On-Policy Self-Distillation with Sampled Demonstrations Reduces Output Diversity cs.LG · 2026-06-24 · unverdicted · none · ref 212
On-policy self-distillation with sampled demonstrations reduces rollout diversity by amplifying existing probability gaps in the base model, unlike ideal RL which preserves ratios among correct outputs.
Data Selection Through Iterative Self-Filtering for Vision-Language Settings cs.CV · 2026-06-22 · unverdicted · none · ref 191
An iterative bootstrapped self-filtering approach selects balanced clean and diverse subsets from noisy vision-language datasets to train improved CLIP models.
OrderDP: A Theoretically Guaranteed Lossless Dynamic Data Pruning Framework cs.LG · 2026-06-07 · unverdicted · none · ref 79
OrderDP is a plug-and-play data pruning method that selects a random subset then top-q samples to guarantee unbiased surrogate-loss training with convergence analysis and over 40% training cost reduction on CIFAR and ImageNet.

arXiv preprint arXiv:2302.06960 , year=

fields

years

verdicts

representative citing papers

citing papers explorer