pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

14903 papers in cs.LG · page 18

  1. cs.IT 2026-05-19 reviewed
    Adaptive rates lower energy use in humanoid robot teleoperation

    Domain-Adaptive Communication-Rate Optimization for Sim-to-Real Humanoid-Robot Wireless XR Teleoperation

    Caolu Xu +5

  2. cs.LG 2026-05-19 reviewed
    Decoupled recursion cuts interference in MLLM edits

    Modality-Decoupled Online Recursive Editing

    Siyuan Li +3

  3. stat.ML 2026-05-19 reviewed
    Factor-augmented SGD converges with streaming high-dimensional data

    Factor Augmented High-Dimensional SGD

    Shubo Li +2

  4. cs.CL 2026-05-19 reviewed
    LLMs learn redundant copies of concepts across languages

    Language models struggle with compartmentalization

    Thomas Vincent Howe +1

  5. cs.LG 2026-05-19 reviewed
    Trajectory selection beats sampling in delayed disambiguation

    EviTrack: Selection over Sampling for Delayed Disambiguation

    Omer Haq

  6. cs.LG 2026-05-19 reviewed
    High-pass spectral filter fixes Muon failures in VLA and RLVR

    Rethinking Muon Beyond Pretraining: Spectral Failures and High-Pass Remedies for VLA and RLVR

    Chongyu Fan +4

  7. q-fin.PM 2026-05-19 reviewed
    Best volatility forecast model differs from best portfolio model

    Do Better Volatility Forecasts Lead to Better Portfolios? Evidence from Graph Neural Networks

    Rylan Wade

  8. q-fin.PM 2026-05-19 reviewed
    Three different models win at forecast error

    Do Better Volatility Forecasts Lead to Better Portfolios? Evidence from Graph Neural Networks

    Rylan Wade

  9. cs.CL 2026-05-19 reviewed
    Modular platform enables concurrent LLM evaluation

    OpenCompass: A Universal Evaluation Platform for Large Language Models

    Maosong Cao +29

  10. cs.LG 2026-05-19 reviewed
    Transformers rewrite non-attention ops as GEMM epilogues

    CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

    Han Guo +6

  11. cs.LG 2026-05-19 reviewed
    Transformers rewritten as GEMM epilogue programs

    CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

    Han Guo +6

  12. cs.LG 2026-05-19 reviewed
    Small abstract spaces enable RL generalization to larger tasks

    Smaller Abstract State Spaces Enable Cross-Scale Generalization in Reinforcement Learning

    Nasehatul Mustakim +1

  13. cs.LG 2026-05-19 reviewed
    GMM curriculum cuts PINN errors on PDEs by up to 98%

    From Simple to Complex: Curriculum-Guided Physics-Informed Neural Networks via Gaussian Mixture Models

    Jianan Yang +5

  14. cs.LG 2026-05-19 reviewed
    Backdoor attack hits near-100% success on masked diffusion LMs

    Backdooring Masked Diffusion Language Models

    Daniel Yiming Cao +5

  15. cs.LG 2026-05-19 reviewed
    Python framework unifies XAI methods for ECG models

    ExECG: An Explainable AI Framework for ECG models

    Jong-Hwan Jang +1

  16. cs.LG 2026-05-19 reviewed
    Proxy of post-target continuations boosts time series forecasts

    Beyond Extrapolation: Knowledge Utilization Paradigm with Bidirectional Inspiration for Time Series Forecasting

    Liu Chong +5

  17. cs.LG 2026-05-19 reviewed
    Local distance graphs recover global Euclidean embeddings

    Euclidean Embedding of Data Using Local Distances

    Dimitris Arabadjis

  18. cs.CV 2026-05-19 reviewed
    Post-training lifts video models' physical consistency

    PhyWorld: Physics-Faithful World Model for Video Generation

    Pu Zhao +12

  19. cs.LG 2026-05-19 reviewed
    Centralized critic removes action-sampling variance in self-play RL

    GAE Falls Short in Imperfect-Information Self-Play Reinforcement Learning

    Zhiyuan Fan +1

  20. cs.CR 2026-05-19 reviewed
    Quantum hybrid raises F1 when UAV detectors drop contextual proxies

    Quantum Machine Learning for Cyber-Physical Anomaly Detection in Unmanned Aerial Vehicles: A Leakage-Free Evaluation with Proxy-Audited Feature Sets

    Carlos A. Dur\'an Paredes +4

  21. cs.LG 2026-05-19 reviewed
    Regime gate improves time series forecast accuracy under shifts

    DeRegiME: Deep Regime Mixtures for Probabilistic Forecasting under Distribution Shift

    Kieran Wood +2

  22. cs.CV 2026-05-19 reviewed
    Method reduces age bias in medical image classification by decorrelating difficulty

    Robust Mitigation of Age-Dependent Confounding Effects via Sample-Difficulty Decorrelation

    Nikhil Cherian Kurian +4

  23. cs.CL 2026-05-19 reviewed
    Step-level scores flag reasoning errors in closed LLMs

    Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution

    Xiaoou Liu +5

  24. cs.CL 2026-05-19 reviewed
    LLM Uncertainty Scores Only Measure Output Consistency

    Position: Uncertainty Quantification in LLMs is Just Unsupervised Clustering

    Tiejin Chen +3

  25. cs.LG 2026-05-19 reviewed
    Regularizer cuts demographic gaps in medical image AI

    Worst-Group Equalized Odds Regularization for Multi-Attribute Fair Medical Image Classification

    Nikhil Cherian Kurian +8

  26. stat.AP 2026-05-19 reviewed
    RL on All of Us data prescribes steadier higher daily steps

    Precision Physical Activity Prescription via Reinforcement Learning for Functional Actions

    Gefei Lin +3

  27. cs.CV 2026-05-19 reviewed
    Quantized model cuts brain tumor AI size by 6x with same accuracy

    Quantized Machine Learning Models for Medical Imaging in Low-Resource Healthcare Settings

    Sumanth Meenan Kanneti +1

  28. cs.LG 2026-05-19 reviewed
    PneumoNet hits 86.6% accuracy with 1.4% forgetting across device shifts

    On-Device Continual Learning with Dual-Stage Buffer and Dynamic Loss for Point-of-Care Pneumonia Diagnosis

    Danu Kim

  29. stat.ML 2026-05-18 reviewed
    Multi-head attention error falls as subspaces decorrelate

    Multi-Head Attention as Ensemble Nadaraya-Watson Estimation: Variance Reduction, Decorrelation, and Optimal Head Diversity

    Ernest Fokou\'e

  30. cs.LG 2026-05-18 reviewed
    SPRT cuts LLM debate calls 3.7x on GSM8K at 97% accuracy

    Sequential Consensus for Multi-Agent LLM Debates: A Wald-SPRT compute governor with calibration-based failure detection

    Andrea Morandi

  31. cs.LG 2026-05-18 reviewed
    Action-gap certificate certifies greedy goal reach in sparse planning

    Planner-Admissible Graph-PDE Value Extensions for Sparse Goal-Conditioned Planning

    Shiheng Zhang

  32. astro-ph.EP 2026-05-18 reviewed
    Drones with machine learning aid meteorite recovery

    A Cloud-Based Tool for Meteorite Recovery Using Drones and Machine Learning

    Seamus L. Anderson +32

  33. cond-mat.dis-nn 2026-05-18 reviewed
    Exponential activations let RBMs capture strong higher-order terms

    Activation Functions, Statistics and Learning of Higher-Order Interactions in Restricted Boltzmann Machines

    Giovanni di Sarra +1

  34. cs.LG 2026-05-18 reviewed
    Retrieval memory sharpens forecasts for new delivery zones

    Bridge: Retrieval-Augmented Spatiotemporal Modeling for Urban Delivery Demand

    Yihong Tang +5

  35. stat.ML 2026-05-18 reviewed
    Higher-order Langevin dynamics reduce memorization in diffusion models

    Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics

    Benjamin Sterling +2

  36. cs.RO 2026-05-18 reviewed
    Reward heuristics tune quadrotor RL policies for fast or slow settling

    A Heuristic Approach for Performance Tuning in RL-based Quadrotor Control via Reward Design and Termination Conditions

    Fausto Mauricio Lagos Suarez +3

  37. cs.AI 2026-05-18 reviewed
    AI agents produce 117 papers but none clear top-tier bar

    How Far Are We From True Auto-Research?

    Zhengxin Zhang +3

  38. cs.LG 2026-05-18 reviewed
    Wrapper gives pathwise risk control for updating LLMs

    Conformal Selective Acting: Anytime-Valid Risk Control for RLVR-Trained LLMs

    Hamed Khosravi +1

    4 Piths
  39. stat.ML 2026-05-18 reviewed
    Total capacity of stationary physical systems predicts ML performance

    Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration

    Rahul Uma Ramachandran +1

  40. stat.ML 2026-05-18 reviewed
    Total IPC of stationary systems bounds to readout count and predicts ML results

    Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration

    Rahul Uma Ramachandran +1

  41. cs.LG 2026-05-18 reviewed
    Sparse matrix bank gives SSMs dense-model expressivity

    Flash PD-SSM: Memory-Optimized Structured Sparse State-Space Models

    Aleksandar Terzi\'c +6

  42. cs.LG 2026-05-18 reviewed
    Low-rank bandits recover drifting subspaces from scalar rewards

    Catching a Moving Subspace: Low-Rank Bandits Beyond Stationarity

    Hamed Khosravi +1

    4 Piths
  43. cs.CR 2026-05-18 reviewed
    Benign rewriting lifts LLM safety against poisoning by 51 percent

    Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks

    John T. Halloran +1

  44. cs.LG 2026-05-18 reviewed
    Pareto points minimize forgetting on conflicting tasks

    PMF-CL: Pareto-Minimal-Forgetting Continual Learner for Conflicting Tasks

    Srijith Nair +2

  45. cs.LG 2026-05-18 reviewed
    Local attack and support calls stabilize global argument rankings

    GRASP: Deterministic argument ranking in interaction graphs

    Diganta Misra +3

  46. cs.LG 2026-05-18 reviewed
    One model trained on text and time series matches both specialists

    Chronicle: A Multimodal Foundation Model for Joint Language and Time Series Understanding

    Paul Quinlan +3

  47. cs.RO 2026-05-18 reviewed
    Smartphone teleop rivals specialized hardware for robot demos

    COBALT: Crowdsourcing Robot Learning via Cloud-Based Teleoperation with Smartphones

    Ayush Agarwal +8

  48. cs.RO 2026-05-18 reviewed
    Smartphones collect 7500 robot demos in five days

    COBALT: Crowdsourcing Robot Learning via Cloud-Based Teleoperation with Smartphones

    Ayush Agarwal +8

  49. cs.LG 2026-05-18 reviewed
    Causal latents shown identifiable in multimodal partial-sharing setups

    Identifiable Multimodal Causal Representation Learning under Partial Latent Sharing

    Manal Benhamza +2

  50. cs.LG 2026-05-18 reviewed
    Text-encoded context boosts ECG pathology classification

    CLIC: Contextual Language-Informed Cardiac Pathology Classification

    Giovani D. Lucafo +4