pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

2684 papers in stat.ML · page 2

  1. cs.LG 2026-05-20 reviewed
    MMD-balls as credal sets bound worst-case risk in test-time adaptation

    MMD-Balls as Credal Sets: A PAC-Bayesian Framework for Epistemic Uncertainty in Test-Time Adaptation

    Ahanaf Hasan Ariq

  2. cs.LG 2026-05-20 reviewed
    Only full-domain utilities make OCE risk measures PAC-learnable in RL

    On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents

    Oliver Mortensen +1

  3. stat.ML 2026-05-20 reviewed
    Support-aware method certifies ad reserve policies from logs

    Support-aware offline policy selection for advertising marketplaces

    Prashant Shekhar +1

  4. cs.LG 2026-05-20 reviewed
    Representation Gap is governed by task intrinsic dimension

    Representation Gap: Explaining the Unreasonable Effectiveness of Neural Networks from a Geometric Perspective

    David Perera +4

  5. cs.LG 2026-05-20 reviewed
    Dropout creates two scaling-law classes by activation type

    Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos

    Lucas Fernandez Sarmiento

  6. stat.ME 2026-05-20 reviewed
    Conformal sets identify root-cause stream with finite-sample coverage

    Distribution-free root cause analysis

    Rohan Hore +1

  7. cs.LG 2026-05-20 reviewed
    Amortized noise sampling cuts diffusion teacher variance 10x

    Variance Reduction for Expectations with Diffusion Teachers

    Jesse Bettencourt +4

  8. cs.LG 2026-05-20 reviewed
    Amortized resampling yields 2-3x compute gains for diffusion teachers

    Variance Reduction for Expectations with Diffusion Teachers

    Jesse Bettencourt +4

  9. cs.LG 2026-05-20 reviewed
    Embedding learning rate boost replicates muP transfer

    Quantifying Hyperparameter Transfer and the Importance of Embedding Layer Learning Rate

    Dayal Singh Kalra +1

  10. physics.geo-ph 2026-05-20 reviewed
    Per-cell dispersion cuts tail forecast error 12.5 percent

    Neural Negative Binomial Regression for Weekly Seismicity Forecasting: Per-Cell Dispersion Estimation and Tail Risk Assessment

    Alim Igilik

  11. stat.ML 2026-05-20 reviewed
    Models converge without recovering main latent factors

    Memorisation, convergence and generalisation in generative models

    Antoine Maillard +1

  12. cs.LG 2026-05-20 reviewed
    Transport maps to PDE measures are Hölder continuous

    On the Regularity and Generalization of One-Step Wasserstein-guided Generative Models for PDE-Induced Measures

    Likun Lin +3

  13. math.ST 2026-05-20 reviewed
    L2 over Wasserstein gives random measures Riemannian geometry

    $L^2$ over Wasserstein: Statistical Analysis for Optimal Transport

    Riccardo Passeggeri +2

  14. stat.ML 2026-05-20 reviewed
    Debiasing fixes bias in bilevel hypergradients

    Semiparametric Efficient Bilevel Gradient Estimation

    Fares El Khoury +4

  15. stat.ML 2026-05-20 reviewed
    Large learning rates alter transformer attractors to cycles and chaos

    Large-Step Training Dynamics of a Two-Factor Linear Transformer Model

    Krishnakumar Balasubramanian

  16. stat.ML 2026-05-20 reviewed
    Wasserstein bounds set tuning rules for annealed Langevin in SBI

    Theoretical guidelines for annealed Langevin dynamics in compositional simulation-based inference

    Camille Touron +3

  17. stat.ML 2026-05-20 reviewed
    Decomposition recovers shared LoRA subspace across clients

    Federated LoRA Fine-Tuning for LLMs via Collaborative Alignment

    Shuaida He +2

  18. stat.ML 2026-05-20 reviewed
    Adaptive batch scaling unlocks large-batch RL

    Scalable Reinforcement Learning via Adaptive Batch Scaling

    Jongchan Park

  19. stat.ML 2026-05-20 reviewed
    Gradient similarities unify measures of model complexity

    A Rigorous, Tractable Measure of Model Complexity

    Oskar Allerbo +1

  20. cs.LG 2026-05-20 reviewed
    Projection algorithm reduces constraint violations to O(log T)

    Improved Guarantees for Constrained Online Convex Optimization via Self-Contraction

    Dhruv Sarkar +1

  21. cs.LG 2026-05-20 reviewed
    Expectation consistency suffices for calibration under covariate shift

    Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift

    Jinzong Dong +2

  22. cs.LG 2026-05-20 reviewed
    Vector quantization builds local calibration maps for multiclass models

    Divide et Calibra: Multiclass Local Calibration via Vector Quantization

    Cesare Barbera +4

  23. stat.ML 2026-05-20 reviewed
    Diffusion link lets GPs condition on text or physics

    Conditioning Gaussian Processes on Almost Anything

    Henry Moss +7

  24. stat.ML 2026-05-20 reviewed
    Local boundary finds valid adjustment sets for causal effects

    Local Covariate Selection for Average Causal Effect Estimation without Pretreatment and Causal Sufficiency Assumptions

    Zeyu Liu +5

  25. math.PR 2026-05-20 reviewed
    SA error tails range from sub-Gaussian to near-Pareto with Markov noise

    Concentration of General Stochastic Approximation Under Heavy-Tailed Markovian Noise

    Shubhada Agrawal +2

  26. cs.CR 2026-05-20 reviewed
    Frequency regularization lifts attack transfer to closed MLLMs

    Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs

    Leitao Yuan +7

  27. cs.LG 2026-05-20 reviewed
    LOSCAR-SGD overlaps local steps with sparse delayed updates

    LOSCAR-SGD: Local SGD with Communication-Computation Overlap and Delay-Corrected Sparse Model Averaging

    Yassine Maziane +3

  28. cs.LG 2026-05-20 reviewed
    Bias correction cuts pretraining loss in AdamW and similar optimizers

    Correcting Stochastic Update Bias in Preconditioned Language Model Optimizers

    Nikhil Nayak +9

  29. stat.ME 2026-05-20 reviewed
    Conformal tests bound false discoveries for every possible threshold

    Everywhere Valid Bounds on False Discovery Proportions in Conformal Inference

    Ziang Song +2

  30. cs.LG 2026-05-20 reviewed
    Decision path flips raise random forest accuracy

    Decision-Path Patterns as Tree Reliability Signals: Path-based Adaptive Weighting for Random Forest Classification

    Youngjoon Park

  31. cs.LG 2026-05-20 reviewed
    Decision-path flips yield unbiased per-sample weights for random forests

    Decision-Path Patterns as Tree Reliability Signals: Path-based Adaptive Weighting for Random Forest Classification

    Youngjoon Park

  32. cs.CL 2026-05-20 reviewed
    Agreement screening yields clearer text features at full accuracy

    Interpretable Discriminative Text Representations via Agreement and Label Disentanglement

    Tong Wang +2

  33. cs.LG 2026-05-20 reviewed
    Localization method builds Transformers from local kernels

    The General Theory of Localization Methods

    Congwei Song

  34. cs.LG 2026-05-20 reviewed
    CDF inversion fixes uneven Pareto front sampling

    SURF: Steering the Scalarization Weight to Uniformly Traverse the Pareto Front

    Liuyuan Jiang +2

  35. cs.LG 2026-05-20 reviewed
    Unlearning by shifting erased points to retained semantic neighbors

    Approximate Machine Unlearning through Manifold Representation Forgetting Guided by Self Mode Connectivity

    Weiqi Wang +4

  36. stat.ML 2026-05-20 reviewed
    Adaptive kernels and LOOCV improve RBF KAN models

    Adaptive RBF-KAN: A Comparative Evaluation of Dynamic Shape Parameters in Kolmogorov-Arnold Networks

    Roberto Cavoretto +3

  37. stat.ML 2026-05-19 reviewed
    Overlapping nuclear norms recover subgroup low-rank geometry

    Group-Aware Matrix Estimation and Latent Subspace Recovery

    Hamza Golubovic +3

  38. stat.ML 2026-05-19 reviewed
    Bandits learn smooth graph payoffs scaling only with effective dimension

    Spectral bandits for smooth graph functions with applications in recommender systems

    Tom\'a\v{s} Koc\'ak +4

  39. cs.LG 2026-05-19 reviewed
    Learn image-space generators matching latent-process marginals

    Latent Process Generator Matching

    Lukas Billera +2

  40. stat.ML 2026-05-19 reviewed
    Transfer learning reaches O(m^(-(α+1)/d)) rate for d>3

    Sample Complexity of Transfer Learning: An Optimal Transport Approach

    Haoyang Cao +3

  41. cs.LG 2026-05-19 reviewed
    Geometric axioms explain neural network mechanisms

    Axiomatizing Neural Networks via Pursuit of Subspaces

    Mehmet Yamac +6

  42. cs.LG 2026-05-19 reviewed
    Neurons encode exact Maxwell solutions for fast sparse field reconstruction

    Fast Reconstruction of Exact Maxwell Dynamics from Sparse Data

    Dan DeGenaro +6

  43. cs.LG 2026-05-19 reviewed
    Min-gate fuses diffusion models to catch all four OOD shifts

    Tippett-minimum Fusion of Representation-space Diffusion Models for Multi-Encoder Out-of-Distribution Detection

    Neelkamal Bhuyan

  44. cs.LG 2026-05-19 reviewed
    Classifier uncertainty narrows conformal intervals by 39% for confident cases

    CASCADE Conformal Prediction: Uncertainty-Adaptive Prediction Intervals for Two-Stage Clinical Decision Support

    Ricardo Diaz-Rincon +3

  45. stat.ML 2026-05-19 reviewed
    Contradiction graph decides VC dimension threshold for any m

    Contradiction Graphs Determine VC Dimension

    Jesse Campbell +2

    5 Piths
  46. stat.AP 2026-05-19 reviewed
    Negative random effects group shows 400x larger causal effects

    Understanding Deterioration Random Effects for Causal Discovery in Infrastructure Management

    Takato Yasuno

  47. cs.LG 2026-05-19 reviewed
    Scoring functions recover causal graphs with latent variables

    Score-Based Causal Discovery of Latent Variable Causal Models

    Ignavier Ng +5

  48. cs.LG 2026-05-19 reviewed
    Symmetrized cross-entropy produces unique convex multi-class unhinged loss

    Symmetrization of Loss Functions for Robust Training of Neural Networks in the Presence of Noisy Labels

    Alexandre Lemire Paquin +2

  49. stat.ML 2026-05-19 reviewed
    Importance sampling corrects ILA to recover true posteriors

    Corrected Integrated Laplace Approximation for Bayesian Inference in Latent Gaussian Models

    Jinlin Lai +2

  50. stat.ML 2026-05-19 reviewed
    Post-hoc calibration sharpens GP lower tails for optimization

    Goal-Oriented Lower-Tail Calibration of Gaussian Processes for Bayesian Optimization

    Aur\'elien Pion +1