archive

Every paper Pith has read. Search by title, abstract, or pith.

14903 papers in cs.LG · page 7

cs.LG 2026-05-21 reviewed

ARC-STAR cuts PDE rollout error 36x on every cell
ARC-STAR: Auditable Post-Hoc Correction for PDE Foundation Models

Chengze Li +9
cs.LG 2026-05-21 reviewed

ARC-STAR cuts PDE model error 36x across all regimes
ARC-STAR: Auditable Post-Hoc Correction for PDE Foundation Models

Chengze Li +9
cs.LG 2026-05-21 reviewed

Attention mask forces transformer backtracking to ignore search history
Can Transformers Learn to Verify During Backtracking Search?

Yin Jun Phua +3
cs.LG 2026-05-21 reviewed

Strict gate stabilizes self-play RL regardless of reward
Survive or Collapse: The Asymmetric Roles of Data Gating and Reward Grounding in Self-Play RL

Sophia Xiao Pu +6
eess.SY 2026-05-21 reviewed

Kernel embeddings learn safe barriers during deep RL
Kernel-Based Safe Exploration in Deep Reinforcement Learning

Rupak Majumdar +2
cs.AI 2026-05-21 reviewed

9B model with skill modules beats 32B LLM
Skill Weaving: Efficient LLM Improvement via Modular Skillpacks

Zhuo Li +7
cs.CV 2026-05-21 reviewed

Video models top open suturing skill challenge
OSS: Open Suturing Skills Vision-Based Assessment Challenge 2024-2025

Hanna Hoffmann +56
cs.LG 2026-05-21 reviewed

RL automates adaptive graphs of operations for LLM prompting
Reinforced Graph of Thoughts: RL-Driven Adaptive Prompting for LLMs

Manuel Noah Riesen +1
cs.LG 2026-05-21 reviewed

Two-point feedback lets prediction error set bandit regret
Bandit Convex Optimization with Gradient Prediction Adaptivity

Shuche Wang +2
cs.LG 2026-05-21 reviewed

GPU batches cut optimal sparse GLM search time by 10-100 times
From Sequential Nodes to GPU Batches: Parallel Branch and Bound for Optimal $k$-Sparse GLMs

Jiachang Liu +1
cs.CV 2026-05-21 reviewed

Telematics and CV fusion boosts MLLM safety event detection
Enhancing Multimodal Large Language Models for Safety-Critical Driving Video Analysis

Tomaso Trinci +2
cs.LG 2026-05-21 reviewed

Infinite-order kernels raise neural operator accuracy
IKNO: Infinite-order Kernel Neural Operators

Pengyuan Zhu +2
cs.LG 2026-05-21 reviewed

4B RL policy beats GPT-5 by picking expert models
Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles

Jinyang Wu +9
cs.AI 2026-05-21 reviewed

Metric shows VLM explainers miss text synergy
Measuring Cross-Modal Synergy: A Benchmark for VLM Explainability

Jo\"el Roman Ky +2
cs.LG 2026-05-21 reviewed

Pairwise metric on logged pairs lifts latent planning success to 97 percent
Beyond Euclidean Proximity: Repairing Latent World Models with Horizon-Matched Trajectory Reachability Metrics

Liangyu Li +2
astro-ph.IM 2026-05-21 reviewed

Language models treat star spectra as text to estimate parameters
Spectra as Language: Large Language Models for Scalable Stellar Parameter and Abundance Inference

Hai-Ling Lu +6
astro-ph.IM 2026-05-21 reviewed

LLMs treat stellar spectra as language sequences to estimate parameters
Spectra as Language: Large Language Models for Scalable Stellar Parameter and Abundance Inference

Hai-Ling Lu +6
cs.LG 2026-05-21 reviewed

OWPO lets LLMs self-evolve without fixed references
One-Way Policy Optimization for Self-Evolving LLMs

Shuo Yang +8
cs.LG 2026-05-21 reviewed

Algebraic ML beats cross-validated CNNs on small images
Algebraic Machine Learning for Small-to-Medium Datasets Is Competitive against Strong Standard Baselines

David Mendez +2
cs.LG 2026-05-21 reviewed

Learned transfer keeps relevant facts in long-term KG memory
Short-Term-to-Long-Term Memory Transfer for Knowledge Graphs under Partial Observability

Taewoon Kim +2
cs.AI 2026-05-21 reviewed

30B agents rival 1T models with 25-95% fewer tokens
Efficient Agentic Reasoning Through Self-Regulated Simulative Planning

Mingkai Deng +6
stat.ML 2026-05-21 reviewed

Betting wealth bound yields empirical Bernstein LIL
From Betting to Empirical Bernstein LIL

Francesco Orabona
astro-ph.HE 2026-05-21 reviewed

ConvLSTM detects gamma-ray transients after learning from simulated sky
Self-Supervised ConvLSTM for Fermi Large Area Telescope Transient Detection

Alberto Garinei +13
cs.LG 2026-05-21 reviewed

Physics-informed model recovers aerodynamic loads from noisy bridge data
Aerodynamic force reconstruction using physics-informed Gaussian processes

Gledson Rodrigo Tondo +2
cs.CV 2026-05-21 reviewed

Text embeddings boost ImageNet accuracy by up to 2.7 points
TextTeacher: What Can Language Teach About Images?

Tobias Christian Nauen +5
quant-ph 2026-05-21 reviewed

Genetic search designs photonic quantum models reaching 99% accuracy
Q-PhotoNAS: Hybrid Quantum Neural Architecture Search Framework on Photonic Devices

Farah Elnakhal +4
cs.SD 2026-05-21 reviewed

Augmentations reduce TTS word error rate from 1.44 to 1.38
RobustSpeechFlow: Learning Robust Text-to-Speech Trajectories via Augmentation-based Contrastive Flow Matching

Jinhyeok Yang +5
cs.RO 2026-05-21 reviewed

Transformer infers contact states to adapt robots on hardware
CoRMA: Contrastive RMA for Contact-Rich Meta-Adaptation

Wentian Wang +8
cs.LG 2026-05-21 reviewed

Breath VOCs causally affect blood glucose levels
Can Breath Biomarkers Causally Influence Blood Glucose? Investigating VOC-Mediated Modulation in Diabetes

Varsha Sharma +2
cs.LG 2026-05-21 reviewed

Subproblem curriculum RL improves LLM math reasoning by 4.1 points
From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning

Xitai Jiang +5
cs.CV 2026-05-21 reviewed

Spline-based warp gives accurate start for sparse 3DGS
TWINGS: Thin Plate Splines Warp-aligned Initialization for Sparse-View Gaussian Splatting

Hyeseong Kim +3
cs.LG 2026-05-21 reviewed

Prototype stages top time series accuracy on 80 of 128 UCR datasets
Prototype-Guided Classification Sub-Task Decoupling Framework: Enhancing Generalization and Interpretability for Multivariate Time Series

Xianhao Song +4
cs.LG 2026-05-21 reviewed

Causal attention lifts time-series classification to 98.6 percent
CASE-NET: Deep Spatio-Temporal Representation Learning via Causal Attention and Channel Recalibration for Multivariate Time Series Classification

Fan Zhang +2
cs.CR 2026-05-21 reviewed

Graph cuts and Bayesian memory defend RAG from dynamic attacks
RADAR: Defending RAG Dynamically against Retrieval Corruption

Ziyuan Chen +6
cs.CV 2026-05-21 reviewed

Reasoning paths in training data lift 3D point cloud models
PointLLM-R: Enhancing 3D Point Cloud Reasoning via Chain-of-Thought

Chaoqi Chen +3
stat.ML 2026-05-21 reviewed

Finite networks track mean-field limit uniformly in time
Uniform-in-Time Weak Propagation-of-Chaos in Shallow Neural Networks

Margalit Glasgow +1
cs.LG 2026-05-21 reviewed

Five lines of code expose an LLM's hidden vocabulary secrets
Check Your LLM's Secret Dictionary! Five Lines of Code Reveal What Your LLM Learned (Including What It Shouldn't Have)

Hisashi Miyashita
cs.CL 2026-05-21 reviewed

RoBERTa reaches 93 percent accuracy on IMDb sentiment task
From TF-IDF to Transformers: A Comparative and Ensemble Approach to Sentiment Classification

Dip Biswas Shanto +3
cs.LG 2026-05-21 reviewed

The paper identifies that in adversarial distillation
Toward Understanding Adversarial Distillation: Why Robust Teachers Fail

Hongsin Lee +1
cs.LG 2026-05-21 reviewed

Auditable encoder reveals semantic nodes are structurally disconnected
Ex-GraphRAG: Interpretable Evidence Routing for Graph-Augmented LLMs

Yoav Kor Sade +4
cs.AI 2026-05-21 reviewed

Coupled optimization yields verifiable evidence in rankings
ECPO: Evidence-Coupled Policy Optimization for Evidence-Certified Candidate Ranking

Miaobo Hu +7
cs.LG 2026-05-21 reviewed

RL ties LLM reasoning to verifiable stock forecasts for 25.9% gains
Reasoning through Verifiable Forecast Actions: Consistency-Grounded RL for Financial LLMs

Jialin Chen +9
cs.LG 2026-05-21 reviewed

Sparsity allocation choice changes label-free repair accuracy
How Sparsity Allocation Shapes Label-Free Post-Pruning Recoverability

Qishi Zhan +2
cs.LG 2026-05-21 reviewed

IAdaPID-ADG optimizer fixes Adam convergence and stability
An Improved Adaptive PID Optimizer with Enhanced Convergence and Stability for Deep Learning

Saurabh Saini +3
cs.LG 2026-05-21 reviewed

Medical world model cuts kidney disease forecast error by 7%
ChronoMedicalWorld: A Medical World Model for Learning Patient Trajectories from Longitudinal Care Data

Jiangyuan Wang +5
cs.LG 2026-05-21 reviewed

Latent memory mixture lifts continual accuracy by 10 percent
Dynamic Mixture of Latent Memories for Self-Evolving Agents

Dianzhi Yu +9
cs.LG 2026-05-21 reviewed

Defense blocks semantic attacks on LLM rankings with perfect precision
SCI-Defense: Defending Manipulation Attacks from Generative Engine Optimization

Xucheng Yu +3
cs.LG 2026-05-21 reviewed

Rényi DP audits reach information-theoretic optimality up to logs
Optimal Guarantees for Auditing R\'enyi Differentially Private Machine Learning

Benjamin D. Kim +2
cond-mat.stat-mech 2026-05-21 reviewed

Irreversibility equates four measures and picks low-entropy paths
Thermodynamic Irreversibility of Training Algorithms

Liu Ziyin +3
cs.LG 2026-05-21 reviewed

CausalGuard weights candidate graphs for covered causal effect estimates
CausalGuard: Conformal Inference under Graph Uncertainty

Vikash Singh +14