archive

Every paper Pith has read. Search by title, abstract, or pith.

2684 papers in stat.ML · page 2

cs.LG 2026-05-20 reviewed

MMD-balls as credal sets bound worst-case risk in test-time adaptation
MMD-Balls as Credal Sets: A PAC-Bayesian Framework for Epistemic Uncertainty in Test-Time Adaptation

Ahanaf Hasan Ariq
cs.LG 2026-05-20 reviewed

Only full-domain utilities make OCE risk measures PAC-learnable in RL
On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents

Oliver Mortensen +1
stat.ML 2026-05-20 reviewed

Support-aware method certifies ad reserve policies from logs
Support-aware offline policy selection for advertising marketplaces

Prashant Shekhar +1
cs.LG 2026-05-20 reviewed

Representation Gap is governed by task intrinsic dimension
Representation Gap: Explaining the Unreasonable Effectiveness of Neural Networks from a Geometric Perspective

David Perera +4
cs.LG 2026-05-20 reviewed

Dropout creates two scaling-law classes by activation type
Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos

Lucas Fernandez Sarmiento
stat.ME 2026-05-20 reviewed

Conformal sets identify root-cause stream with finite-sample coverage
Distribution-free root cause analysis

Rohan Hore +1
cs.LG 2026-05-20 reviewed

Amortized noise sampling cuts diffusion teacher variance 10x
Variance Reduction for Expectations with Diffusion Teachers

Jesse Bettencourt +4
cs.LG 2026-05-20 reviewed

Amortized resampling yields 2-3x compute gains for diffusion teachers
Variance Reduction for Expectations with Diffusion Teachers

Jesse Bettencourt +4
cs.LG 2026-05-20 reviewed

Embedding learning rate boost replicates muP transfer
Quantifying Hyperparameter Transfer and the Importance of Embedding Layer Learning Rate

Dayal Singh Kalra +1
physics.geo-ph 2026-05-20 reviewed

Per-cell dispersion cuts tail forecast error 12.5 percent
Neural Negative Binomial Regression for Weekly Seismicity Forecasting: Per-Cell Dispersion Estimation and Tail Risk Assessment

Alim Igilik
stat.ML 2026-05-20 reviewed

Models converge without recovering main latent factors
Memorisation, convergence and generalisation in generative models

Antoine Maillard +1
cs.LG 2026-05-20 reviewed

Transport maps to PDE measures are Hölder continuous
On the Regularity and Generalization of One-Step Wasserstein-guided Generative Models for PDE-Induced Measures

Likun Lin +3
math.ST 2026-05-20 reviewed

L2 over Wasserstein gives random measures Riemannian geometry
$L^2$ over Wasserstein: Statistical Analysis for Optimal Transport

Riccardo Passeggeri +2
stat.ML 2026-05-20 reviewed

Debiasing fixes bias in bilevel hypergradients
Semiparametric Efficient Bilevel Gradient Estimation

Fares El Khoury +4
stat.ML 2026-05-20 reviewed

Large learning rates alter transformer attractors to cycles and chaos
Large-Step Training Dynamics of a Two-Factor Linear Transformer Model

Krishnakumar Balasubramanian
stat.ML 2026-05-20 reviewed

Wasserstein bounds set tuning rules for annealed Langevin in SBI
Theoretical guidelines for annealed Langevin dynamics in compositional simulation-based inference

Camille Touron +3
stat.ML 2026-05-20 reviewed

Decomposition recovers shared LoRA subspace across clients
Federated LoRA Fine-Tuning for LLMs via Collaborative Alignment

Shuaida He +2
stat.ML 2026-05-20 reviewed

Adaptive batch scaling unlocks large-batch RL
Scalable Reinforcement Learning via Adaptive Batch Scaling

Jongchan Park
stat.ML 2026-05-20 reviewed

Gradient similarities unify measures of model complexity
A Rigorous, Tractable Measure of Model Complexity

Oskar Allerbo +1
cs.LG 2026-05-20 reviewed

Projection algorithm reduces constraint violations to O(log T)
Improved Guarantees for Constrained Online Convex Optimization via Self-Contraction

Dhruv Sarkar +1
cs.LG 2026-05-20 reviewed

Expectation consistency suffices for calibration under covariate shift
Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift

Jinzong Dong +2
cs.LG 2026-05-20 reviewed

Vector quantization builds local calibration maps for multiclass models
Divide et Calibra: Multiclass Local Calibration via Vector Quantization

Cesare Barbera +4
stat.ML 2026-05-20 reviewed

Diffusion link lets GPs condition on text or physics
Conditioning Gaussian Processes on Almost Anything

Henry Moss +7
stat.ML 2026-05-20 reviewed

Local boundary finds valid adjustment sets for causal effects
Local Covariate Selection for Average Causal Effect Estimation without Pretreatment and Causal Sufficiency Assumptions

Zeyu Liu +5
math.PR 2026-05-20 reviewed

SA error tails range from sub-Gaussian to near-Pareto with Markov noise
Concentration of General Stochastic Approximation Under Heavy-Tailed Markovian Noise

Shubhada Agrawal +2
cs.CR 2026-05-20 reviewed

Frequency regularization lifts attack transfer to closed MLLMs
Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs

Leitao Yuan +7
cs.LG 2026-05-20 reviewed

LOSCAR-SGD overlaps local steps with sparse delayed updates
LOSCAR-SGD: Local SGD with Communication-Computation Overlap and Delay-Corrected Sparse Model Averaging

Yassine Maziane +3
cs.LG 2026-05-20 reviewed

Bias correction cuts pretraining loss in AdamW and similar optimizers
Correcting Stochastic Update Bias in Preconditioned Language Model Optimizers

Nikhil Nayak +9
stat.ME 2026-05-20 reviewed

Conformal tests bound false discoveries for every possible threshold
Everywhere Valid Bounds on False Discovery Proportions in Conformal Inference

Ziang Song +2
cs.LG 2026-05-20 reviewed

Decision path flips raise random forest accuracy
Decision-Path Patterns as Tree Reliability Signals: Path-based Adaptive Weighting for Random Forest Classification

Youngjoon Park
cs.LG 2026-05-20 reviewed

Decision-path flips yield unbiased per-sample weights for random forests
Decision-Path Patterns as Tree Reliability Signals: Path-based Adaptive Weighting for Random Forest Classification

Youngjoon Park
cs.CL 2026-05-20 reviewed

Agreement screening yields clearer text features at full accuracy
Interpretable Discriminative Text Representations via Agreement and Label Disentanglement

Tong Wang +2
cs.LG 2026-05-20 reviewed

Localization method builds Transformers from local kernels
The General Theory of Localization Methods

Congwei Song
cs.LG 2026-05-20 reviewed

CDF inversion fixes uneven Pareto front sampling
SURF: Steering the Scalarization Weight to Uniformly Traverse the Pareto Front

Liuyuan Jiang +2
cs.LG 2026-05-20 reviewed

Unlearning by shifting erased points to retained semantic neighbors
Approximate Machine Unlearning through Manifold Representation Forgetting Guided by Self Mode Connectivity

Weiqi Wang +4
stat.ML 2026-05-20 reviewed

Adaptive kernels and LOOCV improve RBF KAN models
Adaptive RBF-KAN: A Comparative Evaluation of Dynamic Shape Parameters in Kolmogorov-Arnold Networks

Roberto Cavoretto +3
stat.ML 2026-05-19 reviewed

Overlapping nuclear norms recover subgroup low-rank geometry
Group-Aware Matrix Estimation and Latent Subspace Recovery

Hamza Golubovic +3
stat.ML 2026-05-19 reviewed

Bandits learn smooth graph payoffs scaling only with effective dimension
Spectral bandits for smooth graph functions with applications in recommender systems

Tom\'a\v{s} Koc\'ak +4
cs.LG 2026-05-19 reviewed

Learn image-space generators matching latent-process marginals
Latent Process Generator Matching

Lukas Billera +2
stat.ML 2026-05-19 reviewed

Transfer learning reaches O(m^(-(α+1)/d)) rate for d>3
Sample Complexity of Transfer Learning: An Optimal Transport Approach

Haoyang Cao +3
cs.LG 2026-05-19 reviewed

Geometric axioms explain neural network mechanisms
Axiomatizing Neural Networks via Pursuit of Subspaces

Mehmet Yamac +6
cs.LG 2026-05-19 reviewed

Neurons encode exact Maxwell solutions for fast sparse field reconstruction
Fast Reconstruction of Exact Maxwell Dynamics from Sparse Data

Dan DeGenaro +6
cs.LG 2026-05-19 reviewed

Min-gate fuses diffusion models to catch all four OOD shifts
Tippett-minimum Fusion of Representation-space Diffusion Models for Multi-Encoder Out-of-Distribution Detection

Neelkamal Bhuyan
cs.LG 2026-05-19 reviewed

Classifier uncertainty narrows conformal intervals by 39% for confident cases
CASCADE Conformal Prediction: Uncertainty-Adaptive Prediction Intervals for Two-Stage Clinical Decision Support

Ricardo Diaz-Rincon +3
stat.ML 2026-05-19 reviewed

Contradiction graph decides VC dimension threshold for any m
Contradiction Graphs Determine VC Dimension

Jesse Campbell +2

5 Piths
stat.AP 2026-05-19 reviewed

Negative random effects group shows 400x larger causal effects
Understanding Deterioration Random Effects for Causal Discovery in Infrastructure Management

Takato Yasuno
cs.LG 2026-05-19 reviewed

Scoring functions recover causal graphs with latent variables
Score-Based Causal Discovery of Latent Variable Causal Models

Ignavier Ng +5
cs.LG 2026-05-19 reviewed

Symmetrized cross-entropy produces unique convex multi-class unhinged loss
Symmetrization of Loss Functions for Robust Training of Neural Networks in the Presence of Noisy Labels

Alexandre Lemire Paquin +2
stat.ML 2026-05-19 reviewed

Importance sampling corrects ILA to recover true posteriors
Corrected Integrated Laplace Approximation for Bayesian Inference in Latent Gaussian Models

Jinlin Lai +2
stat.ML 2026-05-19 reviewed

Post-hoc calibration sharpens GP lower tails for optimization
Goal-Oriented Lower-Tail Calibration of Gaussian Processes for Bayesian Optimization

Aur\'elien Pion +1