pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

14903 papers in cs.LG · page 16

  1. cs.LG 2026-05-19 reviewed
    Alternating Muon and Lion steps improves loss at lower compute

    LionMuon: Alternating Spectral and Sign Descent for Efficient Training

    Arman Bolatov +6

  2. cs.LG 2026-05-19 reviewed
    Laplace diffusion generates long forecasts for irregular time series

    Latent Laplace Diffusion for Irregular Multivariate Time Series

    Zinuo You +2

  3. cs.CV 2026-05-19 reviewed
    Stitched model lifts rewards to noisy latents for faster alignment

    Stitched Value Model for Diffusion Alignment

    Hyojun Go +10

  4. cs.LG 2026-05-19 reviewed
    Prototypes on the hypersphere reach neural collapse by design

    Neural Collapse by Design: Learning Class Prototypes on the Hypersphere

    Panagiotis Koromilas +3

  5. cs.LG 2026-05-19 reviewed
    Class prototypes on the hypersphere reach neural collapse by design

    Neural Collapse by Design: Learning Class Prototypes on the Hypersphere

    Panagiotis Koromilas +3

  6. cs.AI 2026-05-19 reviewed
    LLMs optimize code via priors

    Prior Knowledge or Search? A Study of LLM Agents in Hardware-Aware Code Optimization

    Dmitry Redko (1) +9

  7. cs.AI 2026-05-19 reviewed
    Conformal methods deliver distribution-free coverage for AI agent scores

    Distribution-Free Uncertainty Quantification for Continuous AI Agent Evaluation

    Yuxuan Gao +2

  8. cs.LG 2026-05-19 reviewed
    B-cos GNNs deliver exact per-node explanations after one forward pass

    B-cos GNNs: Faithful Explanations through Dynamic Linearity

    Joschka Gro{\ss} +2

  9. cs.AI 2026-05-19 reviewed
    Variance-aware regret bound proven optimal for logistic MDPs

    Minimax Optimal Variance-Aware Regret Bounds for Multinomial Logistic MDPs

    Pierre Boudart (SIERRA) +4

  10. cs.LG 2026-05-19 reviewed
    Rank-1 queries keep ZO signals strong for high-rank LoRA

    AR1-ZO: Topology-Aware Rank-1 Zeroth-Order Queries for High-Rank LoRA Fine-Tuning

    Ziye Chen +5

  11. cs.LG 2026-05-19 reviewed
    Quadratic model handles heavy and light tailed noise

    Robust Subspace-Constrained Quadratic Models for Low-Dimensional Structure Learning

    Zheng Zhai +1

  12. cs.LG 2026-05-19 reviewed
    Models distort physical quantity distributions despite plausible paths

    Mechanisms of Misgeneralization in Physical Sequence Modeling

    Kento Nishi +4

  13. cs.LG 2026-05-19 reviewed
    Aligning spectrum and molecule models improves metabolite retrieval

    MSAlign: Aligning Molecule and Mass Spectra Foundation Models for Metabolite Identification

    Paul Krzakala +6

  14. eess.SP 2026-05-19 reviewed
    Ensemble ML classifies epilepsy in IED-free stimulation EEG

    Classification of IED-free EEG Responses for Assisted Epilepsy Diagnosis

    Giacomo Zanardini +4

  15. cs.AI 2026-05-19 reviewed
    Multi-agent LLM framework hits 97 percent task completion on engineering benchmarks

    EngiAI: A Multi-Agent Framework and Benchmark Suite for LLM-Driven Engineering Design

    Gioele Molinari +3

  16. math.NA 2026-05-19 reviewed
    GNNs detect communities to aid graph signal interpolation

    Graph Neural Networks for Community Detection in Graph Signal Analysis

    Roberto Cavoretto +2

  17. cs.CR 2026-05-19 reviewed
    Hydra keeps 95% attack success across 500 concept pairs in diffusion models

    Awakening the Hydra: Stabilizing Multi-Concept Backdoor Injection in Text-to-Image Diffusion Models

    Kai Wang +4

  18. cs.CV 2026-05-19 reviewed
    CRP groups medical tasks from text for 73% Dice with 4% forgetting

    MedCRP-CL: Continual Medical Image Segmentation via Bayesian Nonparametric Semantic Modality Discovery

    Ziyuan Gao

  19. stat.ML 2026-05-19 reviewed
    Diffusion copula turns simultaneous crashes into expected events

    Probabilistic Multivariate Time Series Forecasting with Diffusion Copulas

    David Huk +2

  20. cs.LG 2026-05-19 reviewed
    AI workflow finds cryomicroneedle mix with 95 percent viability

    Agentic Discovery of Cryomicroneedle Formulations

    Hao Li +5

  21. cs.LG 2026-05-19 reviewed
    Spectral filter repairs fine-tuning damage without retraining

    Spectral Unforgetting: Post-Hoc Recovery of Damaged Capabilities Without Retraining

    Aarash Abro +1

  22. math.OC 2026-05-19 reviewed
    Consensus particles converge exponentially to bi-level optima

    Convergence of Consensus-Based Particle Methods for Nonconvex Bi-Level Optimization

    Yutong Chao +4

  23. physics.med-ph 2026-05-19 reviewed
    Dual-view net estimates cardiac output from short PPG

    Cross-View Attention Fusion Net: A Prior-Guided Dual-View Representation Learning for Cardiac Output Estimation from Short-Term PPG Signals

    Yaowen Zhang +5

  24. cs.LG 2026-05-19 reviewed
    OScaR reaches near-lossless INT2 KV cache quantization

    OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

    Zunhai Su +13

  25. cs.LG 2026-05-19 reviewed
    Static quantization speeds LLM inference on mobile NPUs

    Quant.npu: Enabling Efficient Mobile NPU Inference for on-device LLMs via Fully Static Quantization

    Jinghe Zhang +7

  26. q-bio.NC 2026-05-19 reviewed
    BCI-sift toolbox picks neural features to raise decoding accuracy

    BCI-sift: An automated feature selection toolbox for Brain Computer Interface applications

    Elena C Offenberg +5

  27. cs.CR 2026-05-19 reviewed
    Knowledge graph embeddings leak sensitive user attributes

    Inferring Sensitive Attributes from Knowledge Graph Embeddings: Attack and Defense Strategies

    Yasmine Hayder (PETSCRAFT)

  28. cs.CL 2026-05-19 reviewed
    One LLM system optimizes text to beat specialists on six tasks

    optimize_anything: A Universal API for Optimizing any Text Parameter

    Lakshya A Agrawal +13

  29. cs.LG 2026-05-19 reviewed
    Hierarchical Gaussian filters close the gap in deep predictive coding

    Closed-form predictive coding via hierarchical Gaussian filters

    Aleksandrs Baskakovs +5

  30. stat.ML 2026-05-19 reviewed
    Federated stochastic approximation gets explicit Gaussian error bounds

    Gaussian Approximation and Multiplier Bootstrap for Federated Linear Stochastic Approximation

    Ilya Levin +4

  31. cs.LG 2026-05-19 reviewed
    Reconstruction error from linear queries limits to sqrt(2d/(d+1)) delta

    Optimal Reconstruction from Linear Queries

    Yuval Filmus +2

  32. eess.IV 2026-05-19 reviewed
    Regularized graph diffusion yields stable EIT reconstructions

    Diffusion Graph Posterior Sampling for Nonlinear Inverse Problems with Application to Electrical Impedance Tomography

    Giovanni S. Alberti +4

  33. cs.LG 2026-05-19 reviewed
    MiMuon reaches O(1/N) generalization bound for matrix models

    MiMuon: Mixed Muon Optimizer with Improved Generalization for Large Models

    Feihu Huang +2

  34. cs.LG 2026-05-19 reviewed
    Divergence measures locate where tree surrogates lose fidelity

    A Family of Divergence Measures for Evaluating the Reconstruction Quality of Explainable Ensemble Trees

    Massimo Aria +2

  35. stat.ML 2026-05-19 reviewed
    Lévy B-spline posterior contracts near minimax rates in Besov spaces

    Posterior Contraction of L\'evy Adaptive B-spline Regression in Besov Spaces

    Jeunghun Oh +2

  36. cs.CV 2026-05-19 reviewed
    SVD-ordered paths yield less noisy model attributions

    Spectral Integrated Gradients for Coarse-to-Fine Feature Attribution

    Soyeon Kim +3

  37. cs.LG 2026-05-19 reviewed
    Graph surrogate cuts dental aerosol rollout time by 37x

    Physics-Informed Graph Neural Network Surrogates for Turbulent Nanoparticle Dispersion in Dental Clinical Environments

    Takshak Shende +1

  38. cs.LG 2026-05-19 reviewed
    Tree paths turn irregular EHR data into traceable evidence

    TreeText-CTS: Compact, Source-Traceable Tree-Path Evidence for Irregular Clinical Time-Series Prediction

    Kwanhyung Lee +5

  39. cs.LG 2026-05-19 reviewed
    Order-book no-trades yield square-root regret in market making

    Online Market Making and the Value of Observing the Order Book

    Davide Maran +1

  40. cs.LG 2026-05-19 reviewed
    Trajectory selection gives 10x faster training and better out-of-domain web agents

    Weasel: Out-of-Domain Generalization for Web Agents via Importance-Diversity Data Selection

    Fatemeh Pesaran zadeh +4

  41. physics.flu-dyn 2026-05-19 reviewed
    First open high-fidelity CFD dataset for high-lift aircraft released

    HiLiftAeroML: High-Fidelity Computational Fluid Dynamics Dataset for High-Lift Aircraft Aerodynamics

    Neil Ashton +13

  42. cs.RO 2026-05-19 reviewed
    Neural warm starts triple speed of UAV-UGV handover planning

    Learning-Accelerated Optimization-based Trajectory Planning for Cooperative Aerial-Ground Handover Missions

    Jingshan Chen +3

  43. cs.LG 2026-05-19 reviewed
    Rotations fix MXFP4 activation errors in LLMs

    TORQ: Two-Level Orthogonal Rotation for MXFP4 Quantization

    Zukang Xu +2

  44. eess.SP 2026-05-19 reviewed
    Decoupling network separates target and jamming in mixed HRRP

    JointHRRP-Net: A Statistically Constrained Decoupling Network for Joint Target and Jamming Recognition in Composite Jamming

    Yunfei Zhao +4

  45. stat.ML 2026-05-19 reviewed
    Density ratios enable adjustable post-hoc deferral

    Density-Ratio Losses for Post-Hoc Learning to Defer

    Alexander Soen +3

  46. cs.SE 2026-05-19 reviewed
    MILP solves fairness repair for neural networks with formal guarantees

    Provable Fairness Repair for Deep Neural Networks

    Jianan Ma +3

  47. cs.LG 2026-05-19 reviewed
    Inference backend shifts LLM benchmark scores by 16.6 points

    The Silent Hyperparameter: Quantifying the Impact of Inference Backends on LLM Reproducibility

    David Pape +2

  48. cs.LG 2026-05-19 reviewed
    Inference backends shift LLM scores by up to 16.6 points

    The Silent Hyperparameter: Quantifying the Impact of Inference Backends on LLM Reproducibility

    David Pape +2

  49. cs.CV 2026-05-19 reviewed
    Early core token attention ranks best seeds for text-to-image results

    Boosting Text-to-Image Diffusion Models via Core Token Attention-Based Seed Selection

    Yunzhe Zhang +2

  50. cs.CL 2026-05-19 reviewed
    Base models fool AI detectors into rating text as human

    Base Models Look Human To AI Detectors

    Yixuan Even Xu +4