Benign overfitting in linear regression.Proceedings of the National Academy of Sciences, 117(48):30063–30070, 2020

Peter L Bartlett, Philip M Long, Gábor Lugosi, Alexander Tsigler · 2020

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

LoRA vs. Full Fine-Tuning: A Theoretical Perspective

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

In linear regression, LoRA can achieve lower excess risk than full fine-tuning when the pretraining-downstream difference is low-rank, and small LoRA ranks can improve generalization by acting as regularization.

Unveiling Memorization-Generalization Coexistence: A Case Study on Arithmetic Tasks with Label Noise

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

Experiments on modular arithmetic with heavy label noise show that over-parameterized networks form a distributed internal generalization structure that can be extracted via frequency methods to achieve high accuracy despite 80% noise.

citing papers explorer

Showing 2 of 2 citing papers.

LoRA vs. Full Fine-Tuning: A Theoretical Perspective cs.LG · 2026-05-18 · unverdicted · none · ref 2
In linear regression, LoRA can achieve lower excess risk than full fine-tuning when the pretraining-downstream difference is low-rank, and small LoRA ranks can improve generalization by acting as regularization.
Unveiling Memorization-Generalization Coexistence: A Case Study on Arithmetic Tasks with Label Noise cs.LG · 2026-05-18 · unverdicted · none · ref 27
Experiments on modular arithmetic with heavy label noise show that over-parameterized networks form a distributed internal generalization structure that can be extracted via frequency methods to achieve high accuracy despite 80% noise.

Benign overfitting in linear regression.Proceedings of the National Academy of Sciences, 117(48):30063–30070, 2020

fields

years

verdicts

representative citing papers

citing papers explorer