archive

Every paper Pith has read. Search by title, abstract, or pith.

14903 papers in cs.LG · page 18

cs.IT 2026-05-19 reviewed

Adaptive rates lower energy use in humanoid robot teleoperation
Domain-Adaptive Communication-Rate Optimization for Sim-to-Real Humanoid-Robot Wireless XR Teleoperation

Caolu Xu +5
cs.LG 2026-05-19 reviewed

Decoupled recursion cuts interference in MLLM edits
Modality-Decoupled Online Recursive Editing

Siyuan Li +3
stat.ML 2026-05-19 reviewed

Factor-augmented SGD converges with streaming high-dimensional data
Factor Augmented High-Dimensional SGD

Shubo Li +2
cs.CL 2026-05-19 reviewed

LLMs learn redundant copies of concepts across languages
Language models struggle with compartmentalization

Thomas Vincent Howe +1
cs.LG 2026-05-19 reviewed

Trajectory selection beats sampling in delayed disambiguation
EviTrack: Selection over Sampling for Delayed Disambiguation

Omer Haq
cs.LG 2026-05-19 reviewed

High-pass spectral filter fixes Muon failures in VLA and RLVR
Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR

Chongyu Fan +4
q-fin.PM 2026-05-19 reviewed

Best volatility forecast model differs from best portfolio model
Do Better Volatility Forecasts Lead to Better Portfolios? Evidence from Graph Neural Networks

Rylan Wade
q-fin.PM 2026-05-19 reviewed

Three different models win at forecast error
Do Better Volatility Forecasts Lead to Better Portfolios? Evidence from Graph Neural Networks

Rylan Wade
cs.CL 2026-05-19 reviewed

Modular platform enables concurrent LLM evaluation
OpenCompass: A Universal Evaluation Platform for Large Language Models

Maosong Cao +29
cs.LG 2026-05-19 reviewed

Transformers rewrite non-attention ops as GEMM epilogues
CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Han Guo +6
cs.LG 2026-05-19 reviewed

Transformers rewritten as GEMM epilogue programs
CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Han Guo +6
cs.LG 2026-05-19 reviewed

Small abstract spaces enable RL generalization to larger tasks
Smaller Abstract State Spaces Enable Cross-Scale Generalization in Reinforcement Learning

Nasehatul Mustakim +1
cs.LG 2026-05-19 reviewed

GMM curriculum cuts PINN errors on PDEs by up to 98%
From Simple to Complex: Curriculum-Guided Physics-Informed Neural Networks via Gaussian Mixture Models

Jianan Yang +5
cs.LG 2026-05-19 reviewed

Backdoor attack hits near-100% success on masked diffusion LMs
Backdooring Masked Diffusion Language Models

Daniel Yiming Cao +5
cs.LG 2026-05-19 reviewed

Python framework unifies XAI methods for ECG models
ExECG: An Explainable AI Framework for ECG models

Jong-Hwan Jang +1
cs.LG 2026-05-19 reviewed

Proxy of post-target continuations boosts time series forecasts
Beyond Extrapolation: Knowledge Utilization Paradigm with Bidirectional Inspiration for Time Series Forecasting

Liu Chong +5
cs.LG 2026-05-19 reviewed

Local distance graphs recover global Euclidean embeddings
Euclidean Embedding of Data Using Local Distances

Dimitris Arabadjis
cs.CV 2026-05-19 reviewed

Post-training lifts video models' physical consistency
PhyWorld: Physics-Faithful World Model for Video Generation

Pu Zhao +12
cs.LG 2026-05-19 reviewed

Centralized critic removes action-sampling variance in self-play RL
GAE Falls Short in Imperfect-Information Self-Play Reinforcement Learning

Zhiyuan Fan +1
cs.CR 2026-05-19 reviewed

Quantum hybrid raises F1 when UAV detectors drop contextual proxies
Quantum Machine Learning for Cyber-Physical Anomaly Detection in Unmanned Aerial Vehicles: A Leakage-Free Evaluation with Proxy-Audited Feature Sets

Carlos A. Dur\'an Paredes +4
cs.LG 2026-05-19 reviewed

Regime gate improves time series forecast accuracy under shifts
DeRegiME: Deep Regime Mixtures for Probabilistic Forecasting under Distribution Shift

Kieran Wood +2
cs.CV 2026-05-19 reviewed

Method reduces age bias in medical image classification by decorrelating difficulty
Robust Mitigation of Age-Dependent Confounding Effects via Sample-Difficulty Decorrelation

Nikhil Cherian Kurian +4
cs.CL 2026-05-19 reviewed

Step-level scores flag reasoning errors in closed LLMs
Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution

Xiaoou Liu +5
cs.CL 2026-05-19 reviewed

LLM Uncertainty Scores Only Measure Output Consistency
Position: Uncertainty Quantification in LLMs is Just Unsupervised Clustering

Tiejin Chen +3
cs.LG 2026-05-19 reviewed

Regularizer cuts demographic gaps in medical image AI
Worst-Group Equalized Odds Regularization for Multi-Attribute Fair Medical Image Classification

Nikhil Cherian Kurian +8
stat.AP 2026-05-19 reviewed

RL on All of Us data prescribes steadier higher daily steps
Precision Physical Activity Prescription via Reinforcement Learning for Functional Actions

Gefei Lin +3
cs.CV 2026-05-19 reviewed

Quantized model cuts brain tumor AI size by 6x with same accuracy
Quantized Machine Learning Models for Medical Imaging in Low-Resource Healthcare Settings

Sumanth Meenan Kanneti +1
cs.LG 2026-05-19 reviewed

PneumoNet hits 86.6% accuracy with 1.4% forgetting across device shifts
On-Device Continual Learning with Dual-Stage Buffer and Dynamic Loss for Point-of-Care Pneumonia Diagnosis

Danu Kim
stat.ML 2026-05-18 reviewed

Multi-head attention error falls as subspaces decorrelate
Multi-Head Attention as Ensemble Nadaraya-Watson Estimation: Variance Reduction, Decorrelation, and Optimal Head Diversity

Ernest Fokou\'e
cs.LG 2026-05-18 reviewed

SPRT cuts LLM debate calls 3.7x on GSM8K at 97% accuracy
Sequential Consensus for Multi-Agent LLM Debates: A Wald-SPRT compute governor with calibration-based failure detection

Andrea Morandi
cs.LG 2026-05-18 reviewed

Action-gap certificate certifies greedy goal reach in sparse planning
Planner-Admissible Graph-PDE Value Extensions for Sparse Goal-Conditioned Planning

Shiheng Zhang
astro-ph.EP 2026-05-18 reviewed

Drones with machine learning aid meteorite recovery
A Cloud-Based Tool for Meteorite Recovery Using Drones and Machine Learning

Seamus L. Anderson +32
cond-mat.dis-nn 2026-05-18 reviewed

Exponential activations let RBMs capture strong higher-order terms
Activation Functions, Statistics and Learning of Higher-Order Interactions in Restricted Boltzmann Machines

Giovanni di Sarra +1
cs.LG 2026-05-18 reviewed

Retrieval memory sharpens forecasts for new delivery zones
Bridge: Retrieval-Augmented Spatiotemporal Modeling for Urban Delivery Demand

Yihong Tang +5
stat.ML 2026-05-18 reviewed

Higher-order Langevin dynamics reduce memorization in diffusion models
Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics

Benjamin Sterling +2
cs.RO 2026-05-18 reviewed

Reward heuristics tune quadrotor RL policies for fast or slow settling
A Heuristic Approach for Performance Tuning in RL-based Quadrotor Control via Reward Design and Termination Conditions

Fausto Mauricio Lagos Suarez +3
cs.AI 2026-05-18 reviewed

AI agents produce 117 papers but none clear top-tier bar
How Far Are We From True Auto-Research?

Zhengxin Zhang +3
cs.LG 2026-05-18 reviewed

Wrapper gives pathwise risk control for updating LLMs
Conformal Selective Acting: Anytime-Valid Risk Control for RLVR-Trained LLMs

Hamed Khosravi +1

4 Piths
stat.ML 2026-05-18 reviewed

Total capacity of stationary physical systems predicts ML performance
Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration

Rahul Uma Ramachandran +1
stat.ML 2026-05-18 reviewed

Total IPC of stationary systems bounds to readout count and predicts ML results
Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration

Rahul Uma Ramachandran +1
cs.LG 2026-05-18 reviewed

Sparse matrix bank gives SSMs dense-model expressivity
Flash PD-SSM: Memory-Optimized Structured Sparse State-Space Models

Aleksandar Terzi\'c +6
cs.LG 2026-05-18 reviewed

Low-rank bandits recover drifting subspaces from scalar rewards
Catching a Moving Subspace: Low-Rank Bandits Beyond Stationarity

Hamed Khosravi +1

4 Piths
cs.CR 2026-05-18 reviewed

Benign rewriting lifts LLM safety against poisoning by 51 percent
Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks

John T. Halloran +1
cs.LG 2026-05-18 reviewed

Pareto points minimize forgetting on conflicting tasks
PMF-CL: Pareto-Minimal-Forgetting Continual Learner for Conflicting Tasks

Srijith Nair +2
cs.LG 2026-05-18 reviewed

Local attack and support calls stabilize global argument rankings
GRASP: Deterministic argument ranking in interaction graphs

Diganta Misra +3
cs.LG 2026-05-18 reviewed

One model trained on text and time series matches both specialists
Chronicle: A Multimodal Foundation Model for Joint Language and Time Series Understanding

Paul Quinlan +3
cs.RO 2026-05-18 reviewed

Smartphone teleop rivals specialized hardware for robot demos
COBALT: Crowdsourcing Robot Learning via Cloud-Based Teleoperation with Smartphones

Ayush Agarwal +8
cs.RO 2026-05-18 reviewed

Smartphones collect 7500 robot demos in five days
COBALT: Crowdsourcing Robot Learning via Cloud-Based Teleoperation with Smartphones

Ayush Agarwal +8
cs.LG 2026-05-18 reviewed

Causal latents shown identifiable in multimodal partial-sharing setups
Identifiable Multimodal Causal Representation Learning under Partial Latent Sharing

Manal Benhamza +2
cs.LG 2026-05-18 reviewed

Text-encoded context boosts ECG pathology classification
CLIC: Contextual Language-Informed Cardiac Pathology Classification

Giovani D. Lucafo +4