pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

14903 papers in cs.LG · page 7

  1. cs.LG 2026-05-21 reviewed
    ARC-STAR cuts PDE rollout error 36x on every cell

    ARC-STAR: Auditable Post-Hoc Correction for PDE Foundation Models

    Chengze Li +9

  2. cs.LG 2026-05-21 reviewed
    ARC-STAR cuts PDE model error 36x across all regimes

    ARC-STAR: Auditable Post-Hoc Correction for PDE Foundation Models

    Chengze Li +9

  3. cs.LG 2026-05-21 reviewed
    Attention mask forces transformer backtracking to ignore search history

    Can Transformers Learn to Verify During Backtracking Search?

    Yin Jun Phua +3

  4. cs.LG 2026-05-21 reviewed
    Strict gate stabilizes self-play RL regardless of reward

    Survive or Collapse: The Asymmetric Roles of Data Gating and Reward Grounding in Self-Play RL

    Sophia Xiao Pu +6

  5. eess.SY 2026-05-21 reviewed
    Kernel embeddings learn safe barriers during deep RL

    Kernel-Based Safe Exploration in Deep Reinforcement Learning

    Rupak Majumdar +2

  6. cs.AI 2026-05-21 reviewed
    9B model with skill modules beats 32B LLM

    Skill Weaving: Efficient LLM Improvement via Modular Skillpacks

    Zhuo Li +7

  7. cs.CV 2026-05-21 reviewed
    Video models top open suturing skill challenge

    OSS: Open Suturing Skills Vision-Based Assessment Challenge 2024-2025

    Hanna Hoffmann +56

  8. cs.LG 2026-05-21 reviewed
    RL automates adaptive graphs of operations for LLM prompting

    Reinforced Graph of Thoughts: RL-Driven Adaptive Prompting for LLMs

    Manuel Noah Riesen +1

  9. cs.LG 2026-05-21 reviewed
    Two-point feedback lets prediction error set bandit regret

    Bandit Convex Optimization with Gradient Prediction Adaptivity

    Shuche Wang +2

  10. cs.LG 2026-05-21 reviewed
    GPU batches cut optimal sparse GLM search time by 10-100 times

    From Sequential Nodes to GPU Batches: Parallel Branch and Bound for Optimal $k$-Sparse GLMs

    Jiachang Liu +1

  11. cs.CV 2026-05-21 reviewed
    Telematics and CV fusion boosts MLLM safety event detection

    Enhancing Multimodal Large Language Models for Safety-Critical Driving Video Analysis

    Tomaso Trinci +2

  12. cs.LG 2026-05-21 reviewed
    Infinite-order kernels raise neural operator accuracy

    IKNO: Infinite-order Kernel Neural Operators

    Pengyuan Zhu +2

  13. cs.LG 2026-05-21 reviewed
    4B RL policy beats GPT-5 by picking expert models

    Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles

    Jinyang Wu +9

  14. cs.AI 2026-05-21 reviewed
    Metric shows VLM explainers miss text synergy

    Measuring Cross-Modal Synergy: A Benchmark for VLM Explainability

    Jo\"el Roman Ky +2

  15. cs.LG 2026-05-21 reviewed
    Pairwise metric on logged pairs lifts latent planning success to 97 percent

    Beyond Euclidean Proximity: Repairing Latent World Models with Horizon-Matched Trajectory Reachability Metrics

    Liangyu Li +2

  16. astro-ph.IM 2026-05-21 reviewed
    Language models treat star spectra as text to estimate parameters

    Spectra as Language: Large Language Models for Scalable Stellar Parameter and Abundance Inference

    Hai-Ling Lu +6

  17. astro-ph.IM 2026-05-21 reviewed
    LLMs treat stellar spectra as language sequences to estimate parameters

    Spectra as Language: Large Language Models for Scalable Stellar Parameter and Abundance Inference

    Hai-Ling Lu +6

  18. cs.LG 2026-05-21 reviewed
    OWPO lets LLMs self-evolve without fixed references

    One-Way Policy Optimization for Self-Evolving LLMs

    Shuo Yang +8

  19. cs.LG 2026-05-21 reviewed
    Algebraic ML beats cross-validated CNNs on small images

    Algebraic Machine Learning for Small-to-Medium Datasets Is Competitive against Strong Standard Baselines

    David Mendez +2

  20. cs.LG 2026-05-21 reviewed
    Learned transfer keeps relevant facts in long-term KG memory

    Short-Term-to-Long-Term Memory Transfer for Knowledge Graphs under Partial Observability

    Taewoon Kim +2

  21. cs.AI 2026-05-21 reviewed
    30B agents rival 1T models with 25-95% fewer tokens

    Efficient Agentic Reasoning Through Self-Regulated Simulative Planning

    Mingkai Deng +6

  22. stat.ML 2026-05-21 reviewed
    Betting wealth bound yields empirical Bernstein LIL

    From Betting to Empirical Bernstein LIL

    Francesco Orabona

  23. astro-ph.HE 2026-05-21 reviewed
    ConvLSTM detects gamma-ray transients after learning from simulated sky

    Self-Supervised ConvLSTM for Fermi Large Area Telescope Transient Detection

    Alberto Garinei +13

  24. cs.LG 2026-05-21 reviewed
    Physics-informed model recovers aerodynamic loads from noisy bridge data

    Aerodynamic force reconstruction using physics-informed Gaussian processes

    Gledson Rodrigo Tondo +2

  25. cs.CV 2026-05-21 reviewed
    Text embeddings boost ImageNet accuracy by up to 2.7 points

    TextTeacher: What Can Language Teach About Images?

    Tobias Christian Nauen +5

  26. quant-ph 2026-05-21 reviewed
    Genetic search designs photonic quantum models reaching 99% accuracy

    Q-PhotoNAS: Hybrid Quantum Neural Architecture Search Framework on Photonic Devices

    Farah Elnakhal +4

  27. cs.SD 2026-05-21 reviewed
    Augmentations reduce TTS word error rate from 1.44 to 1.38

    RobustSpeechFlow: Learning Robust Text-to-Speech Trajectories via Augmentation-based Contrastive Flow Matching

    Jinhyeok Yang +5

  28. cs.RO 2026-05-21 reviewed
    Transformer infers contact states to adapt robots on hardware

    CoRMA: Contrastive RMA for Contact-Rich Meta-Adaptation

    Wentian Wang +8

  29. cs.LG 2026-05-21 reviewed
    Breath VOCs causally affect blood glucose levels

    Can Breath Biomarkers Causally Influence Blood Glucose? Investigating VOC-Mediated Modulation in Diabetes

    Varsha Sharma +2

  30. cs.LG 2026-05-21 reviewed
    Subproblem curriculum RL improves LLM math reasoning by 4.1 points

    From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning

    Xitai Jiang +5

  31. cs.CV 2026-05-21 reviewed
    Spline-based warp gives accurate start for sparse 3DGS

    TWINGS: Thin Plate Splines Warp-aligned Initialization for Sparse-View Gaussian Splatting

    Hyeseong Kim +3

  32. cs.LG 2026-05-21 reviewed
    Prototype stages top time series accuracy on 80 of 128 UCR datasets

    Prototype-Guided Classification Sub-Task Decoupling Framework: Enhancing Generalization and Interpretability for Multivariate Time Series

    Xianhao Song +4

  33. cs.LG 2026-05-21 reviewed
    Causal attention lifts time-series classification to 98.6 percent

    CASE-NET: Deep Spatio-Temporal Representation Learning via Causal Attention and Channel Recalibration for Multivariate Time Series Classification

    Fan Zhang +2

  34. cs.CR 2026-05-21 reviewed
    Graph cuts and Bayesian memory defend RAG from dynamic attacks

    RADAR: Defending RAG Dynamically against Retrieval Corruption

    Ziyuan Chen +6

  35. cs.CV 2026-05-21 reviewed
    Reasoning paths in training data lift 3D point cloud models

    PointLLM-R: Enhancing 3D Point Cloud Reasoning via Chain-of-Thought

    Chaoqi Chen +3

  36. stat.ML 2026-05-21 reviewed
    Finite networks track mean-field limit uniformly in time

    Uniform-in-Time Weak Propagation-of-Chaos in Shallow Neural Networks

    Margalit Glasgow +1

  37. cs.LG 2026-05-21 reviewed
    Five lines of code expose an LLM's hidden vocabulary secrets

    Check Your LLM's Secret Dictionary! Five Lines of Code Reveal What Your LLM Learned (Including What It Shouldn't Have)

    Hisashi Miyashita

  38. cs.CL 2026-05-21 reviewed
    RoBERTa reaches 93 percent accuracy on IMDb sentiment task

    From TF-IDF to Transformers: A Comparative and Ensemble Approach to Sentiment Classification

    Dip Biswas Shanto +3

  39. cs.LG 2026-05-21 reviewed
    The paper identifies that in adversarial distillation

    Toward Understanding Adversarial Distillation: Why Robust Teachers Fail

    Hongsin Lee +1

  40. cs.LG 2026-05-21 reviewed
    Auditable encoder reveals semantic nodes are structurally disconnected

    Ex-GraphRAG: Interpretable Evidence Routing for Graph-Augmented LLMs

    Yoav Kor Sade +4

  41. cs.AI 2026-05-21 reviewed
    Coupled optimization yields verifiable evidence in rankings

    ECPO: Evidence-Coupled Policy Optimization for Evidence-Certified Candidate Ranking

    Miaobo Hu +7

  42. cs.LG 2026-05-21 reviewed
    RL ties LLM reasoning to verifiable stock forecasts for 25.9% gains

    Reasoning through Verifiable Forecast Actions: Consistency-Grounded RL for Financial LLMs

    Jialin Chen +9

  43. cs.LG 2026-05-21 reviewed
    Sparsity allocation choice changes label-free repair accuracy

    How Sparsity Allocation Shapes Label-Free Post-Pruning Recoverability

    Qishi Zhan +2

  44. cs.LG 2026-05-21 reviewed
    IAdaPID-ADG optimizer fixes Adam convergence and stability

    An Improved Adaptive PID Optimizer with Enhanced Convergence and Stability for Deep Learning

    Saurabh Saini +3

  45. cs.LG 2026-05-21 reviewed
    Medical world model cuts kidney disease forecast error by 7%

    ChronoMedicalWorld: A Medical World Model for Learning Patient Trajectories from Longitudinal Care Data

    Jiangyuan Wang +5

  46. cs.LG 2026-05-21 reviewed
    Latent memory mixture lifts continual accuracy by 10 percent

    Dynamic Mixture of Latent Memories for Self-Evolving Agents

    Dianzhi Yu +9

  47. cs.LG 2026-05-21 reviewed
    Defense blocks semantic attacks on LLM rankings with perfect precision

    SCI-Defense: Defending Manipulation Attacks from Generative Engine Optimization

    Xucheng Yu +3

  48. cs.LG 2026-05-21 reviewed
    Rényi DP audits reach information-theoretic optimality up to logs

    Optimal Guarantees for Auditing R\'enyi Differentially Private Machine Learning

    Benjamin D. Kim +2

  49. cond-mat.stat-mech 2026-05-21 reviewed
    Irreversibility equates four measures and picks low-entropy paths

    Thermodynamic Irreversibility of Training Algorithms

    Liu Ziyin +3

  50. cs.LG 2026-05-21 reviewed
    CausalGuard weights candidate graphs for covered causal effect estimates

    CausalGuard: Conformal Inference under Graph Uncertainty

    Vikash Singh +14