International Conference on Machine Learning , year=

A convergence theory for deep learning via over-parameterization , author=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Unveiling High-Probability Generalization in Decentralized SGD

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

High-probability generalization bounds for D-SGD are derived at the optimal rate O(1/sqrt(mn) log(1/δ)) via pointwise uniform stability across convex and non-convex settings.

Rethinking the Rank Threshold for LoRA Fine-Tuning

cs.LG · 2026-05-05 · unverdicted · novelty 7.0

For binary classification in the NTK regime, LoRA rank r=1 suffices and is often optimal under cross-entropy loss, reducing the prior sufficient condition from r>=12.

citing papers explorer

Showing 2 of 2 citing papers.

Unveiling High-Probability Generalization in Decentralized SGD cs.LG · 2026-05-11 · unverdicted · none · ref 121
High-probability generalization bounds for D-SGD are derived at the optimal rate O(1/sqrt(mn) log(1/δ)) via pointwise uniform stability across convex and non-convex settings.
Rethinking the Rank Threshold for LoRA Fine-Tuning cs.LG · 2026-05-05 · unverdicted · none · ref 6
For binary classification in the NTK regime, LoRA rank r=1 suffices and is often optimal under cross-entropy loss, reducing the prior sufficient condition from r>=12.

International Conference on Machine Learning , year=

fields

years

verdicts

representative citing papers

citing papers explorer