Kernel interpolation generalizes poorly

Yicheng Li, Haobo Zhang, Qian Lin · 2023 · DOI 10.1093/biomet/asad048

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

How to Scale Mixture-of-Experts: From muP to the Maximally Scale-Stable Parameterization

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

The authors derive a Maximally Scale-Stable Parameterization (MSSP) for MoE models that achieves robust learning-rate transfer and monotonic performance gains with scale across co-scaling regimes of width, experts, and sparsity.

Optimal differentially private kernel learning with random projection

stat.ML · 2025-07-23 · unverdicted · novelty 7.0

A random-projection differentially private kernel ERM method attains minimax-optimal excess risk bounds for squared and Lipschitz-smooth convex losses under local strong convexity, plus the first dimension-free bounds for objective-perturbation private linear ERM.

citing papers explorer

Showing 2 of 2 citing papers.

How to Scale Mixture-of-Experts: From muP to the Maximally Scale-Stable Parameterization cs.LG · 2026-05-13 · unverdicted · none · ref 144
The authors derive a Maximally Scale-Stable Parameterization (MSSP) for MoE models that achieves robust learning-rate transfer and monotonic performance gains with scale across co-scaling regimes of width, experts, and sparsity.
Optimal differentially private kernel learning with random projection stat.ML · 2025-07-23 · unverdicted · none · ref 36
A random-projection differentially private kernel ERM method attains minimax-optimal excess risk bounds for squared and Lipschitz-smooth convex losses under local strong convexity, plus the first dimension-free bounds for objective-perturbation private linear ERM.

Kernel interpolation generalizes poorly

fields

years

verdicts

representative citing papers

citing papers explorer