archive

Every paper Pith has read. Search by title, abstract, or pith.

14903 papers in cs.LG · page 2

cs.CV 2026-05-22 reviewed

Bootstrapped GRTO unifies RL and tool training for segmentation
B-GRTO: Bootstrapped Group Relative Tool Optimization for Referring Segmentation

Mario Markov +5
cs.LG 2026-05-22 reviewed

Self-generated tests and code co-evolve to match RLVR results
CoSPlay: Cooperative Self-Play at Test-Time with Self-Generated Code and Unit Test

Zhangyi Hu +8
cs.LG 2026-05-22 reviewed

Non-normal operators flag neural network training instabilities
Non-normal spectral signatures of instability in neural network training dynamics

Souvik Ghosh
cs.LG 2026-05-22 reviewed

DSEBO switches subspace dimension on convergence
Automated Random Embedding for Practical Bayesian Optimization with Unknown Effective Dimension

Hong Qian +7
cs.LG 2026-05-22 reviewed

CBANet raises minority recall in aggressive driving detection
CBANet: A Compact Attention-Based CNN-BiLSTM Network for Aggressive Driving Event Detection

Hanadi Alhamdan +3
cs.LG 2026-05-22 reviewed

Static contexts make individual dynamics identifiable from single snapshots
Learning Individual Dynamics from Sparse Cross-Sectional Snapshots

Christian Lagemann +3
cs.LG 2026-05-22 reviewed

S³GNN cuts oversquashing errors up to 10x with 50% fewer parameters
S$^3$GNN: Efficient Global Mixing and Local Message Passing for Long-Range Graph Learning

Dai Shi +6
cs.LG 2026-05-22 reviewed

Time-varying transforms block model extraction in sharded training
Unextractable Protocol Models: Collaborative Training and Inference without Weight Materialization

Alexander Long +7
cs.LG 2026-05-22 reviewed

Hybrid augmentation raises migraine F1 average to 0.862
Class-Dependent Hybrid Data Augmentation for Multiclass Migraine Classification under Severe Class Imbalance

Elvin Som\'on +1
cs.LG 2026-05-22 reviewed

VAE decoder learns to respect non-commutative latent order
Commutator-Induced Uncertainty in VAEs

Tahereh Dehdarirad +3
cs.LG 2026-05-22 reviewed

k-WL cannot distinguish all simple-spectrum graphs for any k
Weisfeiler-Leman Is Incomplete on Simple Spectrum Graphs, so Canonicalize Them

Snir Hordan +2
cs.LG 2026-05-22 reviewed

Onsager-Machlup path transport beats DBVI on large DGP tasks
Onsager-Machlup Posterior Transport for Deep Gaussian Processes

Jian Xu +3
cs.IT 2026-05-22 reviewed

Shortest-path tree cuts in-network training traffic by 70%
Sparse In-Network Learning via Shortest-Path Backpropagation and Finite-Rate Gating

Mohammad Reza Deylam Salehi
cs.LG 2026-05-22 reviewed

Oblique trees gain quadratic approximation via Newton least-squares
Hinge Regression Trees and HRT-Boost: Newton-Optimized Oblique Learning for Compact Tabular Models

Hongyi Li +2
cs.LG 2026-05-22 reviewed

500K trajectories enable BBO foundation models
An Open-Source Training Dataset for Foundation Models for Black-box Optimization

Aaron Klein +3
cs.LG 2026-05-22 reviewed

Reflection symmetry speeds up state-based RL
Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

Shuai Zhen +3
cs.AI 2026-05-22 reviewed

Consistency checks raise LLM multi-agent planning success 9.75%
When Planning Fails Despite Correct Execution: On Epistemic Calibration for LLM-Based Multi-Agent Systems

Zehao Wang +3
cs.LG 2026-05-22 reviewed

Sample-wise attacks fool TTA while keeping label counts normal
Sample-wise Targeted Adversarial Attacks on Test-time Adaptation

Phuc Duc Nguyen +1
cs.LG 2026-05-22 reviewed

Multi-view probes read model weights more accurately
What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

Eunwoo Heo +2
cs.LG 2026-05-22 reviewed

Quantum circuits in UNet boost wind downscaling metrics
Hybrid Quantum-Classical Corrective Diffusion Modeling for Meteorological Downscaling

Rui Wang +5
cs.LG 2026-05-22 reviewed

PPM maps parametric priors into generative forecasts
Parametric Prior Mapping Framework for Non-stationary Probabilistic Time Series Forecasting

Jinglin Li +3
cs.LG 2026-05-22 reviewed

Convex factors let energy models scale to larger reasoning tasks
Convex Compositional Reasoning Models

Meir Roketlishvili +9
cs.LG 2026-05-22 reviewed

One recursion decomposes every component into paths and token credit
Every Component is a Lookup: Token Attribution and Composition from a Single Decomposition

Po-Kai Chen +2
cs.LG 2026-05-22 reviewed

Preconditioning caps PINN kernel radius independent of coupling
Coupling-Robust Accuracy in Multiphysics Physics Informed Neural Networks via Kronecker-Preconditioned Optimization

Youngjae Park +2
math.OC 2026-05-22 reviewed

Selective dual dispatch improves ambulance response with less fleet use
Selective Ambulance Dispatch Under Contextual Travel-Time Uncertainty

Zikun Lin +2
cs.LG 2026-05-22 reviewed

VAE turns non-Euclidean tasks into measurable space for RL curricula
Curriculum reinforcement learning with measurable task representation learning

Yongyan Wen +5
cs.LG 2026-05-22 reviewed

One-step MeanFlow policy hits SOTA on locomotion tasks
Score-Based One-step MeanFlow Policy Optimization

Kyungyoon Kim +3
cs.LG 2026-05-22 reviewed

Adaptive allocation matches oracle rate for multi-judge LLM scoring
Instance-Optimal Estimation with Multiple LLM Judges on a Budget

Junghyun Lee +4
cs.LG 2026-05-22 reviewed

Prudent-Banker keeps bandit safety at constant cost amid delays
Prudent-Banker: No Extra Fees for Baseline Safety in Adversarial Bandits With and Without Delays

Ting Hu +2
cs.LG 2026-05-22 reviewed

CDM amortizes Twisted SMC for discrete diffusion with under 5% overhead
Contrastive Distribution Matching for Amortized Sequential Monte Carlo in Discrete Diffusion

Jaihoon Kim +5
physics.soc-ph 2026-05-22 reviewed

Spin field model infers traffic phases from trajectories
SpinFlow: A Physics-Informed Spin Field Framework for Traffic Phase Inference and Transition Detection

Haopeng Deng +2
physics.optics 2026-05-22 reviewed

Hybrid search boosts Max-Cut solutions on photonic Ising machines
Accelerating ground state search of spatial photonic Ising machines with genetic-simulated annealing hybrid algorithm

Ze Zheng +8
cs.LG 2026-05-22 reviewed

Reinforcement learning enforces exact graph assortativity without tuning
Reinforcement Learning for Microcanonical Graph Ensemble with Assortativity Constraints

Hoyun Choi +2
eess.IV 2026-05-22 reviewed

Neural operator deblurs varying blur in pathology slides
Discontinuous Galerkin Neural Operator for Pathology Defocus Deblurring

Shaoqing Duan +4
cs.LG 2026-05-22 reviewed

Compact coordinator expands diffusion models to larger domains
Diffusion Domain Expansion: Learning to Coordinate Pre-trained Diffusion Models

Egor Lifar +4
cs.LG 2026-05-22 reviewed

Structural priors fix bad scores for good equations in symbolic regression
When Good Equations Get Bad Scores: Improving Symbolic Regression Through Better Parameter Optimization

Boxiao Wang +7
stat.ML 2026-05-22 reviewed

Joint training avoids error inheritance from weak privileged data
Coupled Training with Privileged Information and Unlabeled Data

Jiahao Shi +2
cs.LG 2026-05-22 reviewed

Multi-gate residuals stabilize deep nets without extra comms cost
Multi-Gate Residuals

Zhizhan Zheng +6
cs.LG 2026-05-22 reviewed

Plug-in adds reconstructability routing to KV cache eviction
A Simple Plug-in for Improving Eviction-Based KV Cache Compression

Yuping Lin +5
cs.LG 2026-05-22 reviewed

Algorithms keep preemptions constant per job using predictions
Learning-Augmented Online Scheduling with Parsimonious Preemption

Mugen Blue +2
cs.LG 2026-05-22 reviewed

RefCal jointly optimizes calibration and refinement in DNNs
Enhancing Deep Neural Network Reliability with Refinement and Calibration

Ramya Hebbalaguppe +3
cs.LG 2026-05-22 reviewed

Neural network predicts DLT times 10-100x faster
Accelerating Divisible Load Processing Through Machine Learning: A Practical Framework for Large-Scale Workloads

Bharadwaj Veeravalli
cs.LG 2026-05-22 reviewed

Convex optimization aligns LLMs on one GPU without reference model
Convex Optimization for Alignment and Preference Learning on a Single GPU

Miria Feng +1
cs.LG 2026-05-22 reviewed

Multi-perspective clustering pre-trains relational database models
RelPrism: A Multi-Faceted Pre-training Framework with Self-Generated Tasks for Relational Databases

Jinyu Yang +6
cs.LG 2026-05-22 reviewed

Purifier cleans adversarial graphs for any GNN
Self-supervised Adversarial Purification for Graph Neural Networks

Woohyun Lee +1
cs.AI 2026-05-22 reviewed

Generated card games expose jagged strategic skills in top LLMs
GENSTRAT: Toward a Science of Strategic Reasoning in Large Language Models

Vartan Shadarevian +3
cs.LG 2026-05-22 reviewed

Convex optimization detects accented languages at 97-98% accuracy
Convex Low-resource Accent-Robust Language Detection in Speech Recognition

Miria Feng +2
cs.LG 2026-05-22 reviewed

Movement patterns can expose unfairness in predictive models
Assessing Predictive Models for Fairness Based on Movement Patterns

Francesco Lettich +3
cs.DS 2026-05-22 reviewed

Entropy testing needs fewer samples than closeness testing
Entropy Equivalence Testing

Cl\'ement L. Canonne +3
cs.LG 2026-05-22 reviewed

Automated search yields stronger attacks on world-model agents
WMAttack: Automated Attack Search for Adversarial Evaluation of World-Model Agents

Zhixiang Guo +6