archive

Every paper Pith has read. Search by title, abstract, or pith.

2684 papers in stat.ML · page 3

stat.AP 2026-05-19 reviewed

Semi-parametric BART separates covariates from epigenetic trees
Semi-Parametric Bayesian Additive Regression Trees for Risk Prediction with High-Dimensional Epigenetic Signatures and Low-Dimensional Covariates

Saurabh Bhandari +3
stat.ML 2026-05-19 reviewed

Grid sketch achieves optimal Wasserstein runtime for smooth laws
Optimizing Computational-Statistical Runtime for Wasserstein Distance Estimation

Peter Matthew Jacobs +1
stat.ML 2026-05-19 reviewed

Soft-log transform lets flow matching handle heavy tails
Tail Annealing for Heavy-Tailed Flow Matching

Jean Pachebat
stat.ME 2026-05-19 reviewed

Gated estimator cuts manifold density error by 22-36%
Variance-Reduced Manifold Sampling via Polynomial-Maximization Density Estimation

Serhii Zabolotnii
cs.LG 2026-05-19 reviewed

Domain cuts let neural operators handle PDE discontinuities
Smooth Piecewise Cutting for Neural Operator to Handle Discontinuities and Sharp Transitions

Ha Dang +2
cs.LG 2026-05-19 reviewed

Benchmark separates ML models on flux extrapolation via tail errors
FLUXtrapolation: A benchmark on extrapolating ecosystem fluxes

Anya Fries +4
cs.LG 2026-05-19 reviewed

Laplace diffusion generates long forecasts for irregular time series
Latent Laplace Diffusion for Irregular Multivariate Time Series

Zinuo You +2
cs.AI 2026-05-19 reviewed

Variance-aware regret bound proven optimal for logistic MDPs
Minimax Optimal Variance-Aware Regret Bounds for Multinomial Logistic MDPs

Pierre Boudart (SIERRA) +4
cs.AI 2026-05-19 reviewed

Benchmark shows attention models scale better than RNNs on sequences
CogScale: Scalable Benchmark for Sequence Processing

Yannis Bendi-Ouis (Mnemosyne) +2
stat.ML 2026-05-19 reviewed

Diffusion copula turns simultaneous crashes into expected events
Probabilistic Multivariate Time Series Forecasting with Diffusion Copulas

David Huk +2
stat.ML 2026-05-19 reviewed

Federated stochastic approximation gets explicit Gaussian error bounds
Gaussian Approximation and Multiplier Bootstrap for Federated Linear Stochastic Approximation

Ilya Levin +4
cs.LG 2026-05-19 reviewed

MiMuon reaches O(1/N) generalization bound for matrix models
MiMuon: Mixed Muon Optimizer with Improved Generalization for Large Models

Feihu Huang +2
stat.ML 2026-05-19 reviewed

Lévy B-spline posterior contracts near minimax rates in Besov spaces
Posterior Contraction of L\'evy Adaptive B-spline Regression in Besov Spaces

Jeunghun Oh +2
cs.LG 2026-05-19 reviewed

Order-book no-trades yield square-root regret in market making
Online Market Making and the Value of Observing the Order Book

Davide Maran +1
stat.ML 2026-05-19 reviewed

Density ratios enable adjustable post-hoc deferral
Density-Ratio Losses for Post-Hoc Learning to Defer

Alexander Soen +3
stat.ML 2026-05-19 reviewed

Tweedie formulae now cover non-Gaussian diffusions
Tweedie's Formulae and Diffusion Generative Models Beyond Gaussian

Wenpin Tang +3
cs.CL 2026-05-19 reviewed

Benchmark labels hallucinations via explicit reference worlds
HalluWorld: A Controlled Benchmark for Hallucination via Reference World Models

Emmy Liu +6

5 Piths
q-bio.QM 2026-05-19 reviewed

Protein Thoughts ranks true binders at mean position 11.2
Protein Thoughts: Interpretable Reasoning with Tree of Thoughts and Embedding-Space Flow Matching for Protein-Protein Interaction Discovery

Kingsley Yeon +2
stat.ML 2026-05-19 reviewed

Method clusters subjects and learns their distinct causal graphs
A Unified Framework for Structure-Aware Clustering and Heterogeneous Causal Graph Learning

Honglin Du +2
stat.ML 2026-05-19 reviewed

Factor-augmented SGD converges with streaming high-dimensional data
Factor Augmented High-Dimensional SGD

Shubo Li +2
cs.LG 2026-05-19 reviewed

Trajectory selection beats sampling in delayed disambiguation
EviTrack: Selection over Sampling for Delayed Disambiguation

Omer Haq
cs.LG 2026-05-19 reviewed

Regime gate improves time series forecast accuracy under shifts
DeRegiME: Deep Regime Mixtures for Probabilistic Forecasting under Distribution Shift

Kieran Wood +2
stat.AP 2026-05-19 reviewed

RL on All of Us data prescribes steadier higher daily steps
Precision Physical Activity Prescription via Reinforcement Learning for Functional Actions

Gefei Lin +3
cond-mat.stat-mech 2026-05-18 reviewed

Thermodynamic bound sets optimal dataset size for linear regression
The Thermodynamic Costs of Simple Linear Regression

Samuel H. D'Ambrosia +3
stat.ML 2026-05-18 reviewed

Multi-head attention error falls as subspaces decorrelate
Multi-Head Attention as Ensemble Nadaraya-Watson Estimation: Variance Reduction, Decorrelation, and Optimal Head Diversity

Ernest Fokou\'e
stat.ML 2026-05-18 reviewed

Higher-order Langevin dynamics reduce memorization in diffusion models
Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics

Benjamin Sterling +2
cs.LG 2026-05-18 reviewed

Wrapper gives pathwise risk control for updating LLMs
Conformal Selective Acting: Anytime-Valid Risk Control for RLVR-Trained LLMs

Hamed Khosravi +1

4 Piths
stat.ML 2026-05-18 reviewed

Total capacity of stationary physical systems predicts ML performance
Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration

Rahul Uma Ramachandran +1
stat.ML 2026-05-18 reviewed

Total IPC of stationary systems bounds to readout count and predicts ML results
Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration

Rahul Uma Ramachandran +1
cs.LG 2026-05-18 reviewed

Low-rank bandits recover drifting subspaces from scalar rewards
Catching a Moving Subspace: Low-Rank Bandits Beyond Stationarity

Hamed Khosravi +1

4 Piths
stat.ML 2026-05-18 reviewed

Dual-channel networks select tensor structures with finite-sample guarantees
Dual-Channel Tensor Neural Networks: Finite-Sample Theory and Conformal Structure Selection

Elynn Chen +3
stat.ME 2026-05-18 reviewed

Greedy method learns optimal integer clinical risk scores directly
Learning Interpretable Point-Based Clinical Risk Scores via Direct Optimization

Ying Cui +5
cs.LG 2026-05-18 reviewed

ScheduleFree+ beats WSD schedules on long LLM pretraining
ScheduleFree+: Scaling Learning-Rate-Free & Schedule-Free Learning to Large Language Models

Aaron Defazio
stat.ML 2026-05-18 reviewed

Learned multipliers achieve optimal Theta(s/sqrt(N)) rate
Provably Data-driven Lagrangian Relaxation for Mixed Integer Linear Programming

Tung Quoc Le +2

4 Piths
stat.ML 2026-05-18 reviewed

Beta law tracks conformal coverage under dependence
Conformal Prediction via Transported Beta Laws

Thiago R. Ramos +2
cs.LG 2026-05-18 reviewed

Transformer model lowers earnings forecast error by 32 percent at ten years
SAGA: A Sequence-Adaptive Generative Architecture for Multi-Horizon Probabilistic Forecasting with Adaptive Temporal Conformal Prediction

Gustav Olaf Yunus Laitinen-Fredriksson Lundstr\"om-Imanov +1
stat.ME 2026-05-18 reviewed

Categorical confounder makes causal effects identifiable from proxies or multiple tests
Causal Inference with Categorical Unobserved Confounder via Mixture Learning

Aytijhya Saha +2
stat.ML 2026-05-18 reviewed

Girsanov path weights recover exact particle SMC for diffusion guidance
SURGE: Approximation and Training Free Particle Filter for Diffusion Surrogate

Lifu Wei +3
math.OC 2026-05-18 reviewed

AdaGrad converges under heavy-tailed noise without knowing the tail index
Can Adaptive Gradient Methods Converge under Heavy-Tailed Noise? A Case Study of AdaGrad

Zijian Liu
stat.ML 2026-05-18 reviewed

FedNewton matches SGD accuracy with fewer rounds under privacy
Statistical Limits and Efficient Algorithms for Differentially Private Federated Learning

Arnab Auddy +2
stat.ME 2026-05-18 reviewed

Weighted DAG aggregation stabilizes causal discovery
Stable Causal Discovery via Directed Acyclic Graph Aggregation

Yunan Wu +3
cs.LG 2026-05-18 reviewed

Embeddings let federated clients achieve centralized Bayesian uncertainty
Federated Martingale Posterior Samping

Boning Zhang +4
cs.LG 2026-05-18 reviewed

Manifold probe reveals how models encode time and space
Probing for Representation Manifolds in Superposition

Alexander Modell
cs.CL 2026-05-18 reviewed

Continuous diffusion scales to 20x compute gap of autoregressive models
Continuous Diffusion Scales Competitively with Discrete Diffusion for Language

Zhihan Yang +7
stat.ML 2026-05-18 reviewed

Flow models gain per-sample confidence at standard sampling cost
Flowing with Confidence

Friso de Kruiff +3
stat.ML 2026-05-18 reviewed

Shallow ReLU^s networks beat random features below critical p
Shallow ReLU$^s$ Networks in $L^p$-Type and Sobolev Spaces: Approximation and Path-Norm Controlled Generalization

Weizhao Li +2
stat.ML 2026-05-18 reviewed

Path-norm ReLU^s nets match minimax regression rates
Shallow ReLU$^s$ Networks in $L^p$-Type and Sobolev Spaces: Approximation and Path-Norm Controlled Generalization

Weizhao Li +2
stat.ML 2026-05-18 reviewed

Path-norm ReLU nets hit minimax rates in regression
Shallow ReLU$^s$ Networks in $L^p$-Type and Sobolev Spaces: Approximation and Path-Norm Controlled Generalization

Weizhao Li +2
stat.ML 2026-05-18 reviewed

Markov Chain Decoders Fix Heavy-Tail Limits in VAEs
Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models

Abdelhakim Ziani +2
cs.LG 2026-05-18 reviewed

Closed-form policy optimizes allocation in censored survival trials
Adaptive Experimentation for Censored Survival Outcomes

Yuxin Wang +5