archive

Every paper Pith has read. Search by title, abstract, or pith.

14903 papers in cs.LG · page 8

quant-ph 2026-05-21 reviewed

Quantum amplitudes adapt to predict network link changes
A2QTGN: Adaptive Amplitude Quantum-Integrated Temporal Graph Network for Dynamic Link Prediction

Nouhaila Innan +4
cs.CR 2026-05-21 reviewed

Learning-based CCs degrade less than traditional ones under attack
CCLab: Adversarial Testing of Learning- and Non-Learning-Based Congestion Controllers

Zhi Chen +4
cs.IT 2026-05-21 reviewed

Topological index spots wireless receiver shifts early
Resilience Characterization of AI-Native Wireless Receivers via Persistent Homology

Christo Kurisummoottil Thomas +1
cs.LG 2026-05-21 reviewed

Optimal control yields tunable noise schedules for diffusion models
Noise Schedule Design for Diffusion Models: An Optimal Control Perspective

Seo Taek Kong +2
eess.SY 2026-05-21 reviewed

Physics laws inside neural nets speed up power-grid modeling
Engineering Hybrid Physics-Informed Neural Networks for Next-Generation Electricity Systems: A State-of-the-Art Review

Joseph Nyangon
cs.AI 2026-05-21 reviewed

7B model beats larger ones at Lean proof optimization
ImProver 2: Iteratively Self-Improving LMs for Neurosymbolic Proof Optimization

Riyaz Ahuja +3
cs.LG 2026-05-21 reviewed

Predicting switch timing improves game strategy advice
When to Switch, Not Just What: Transition Quality Prediction in Clash Royale

Heeyun Heo +1
q-bio.PE 2026-05-21 reviewed

Flow model in tree space cuts divergence to phylogenetic posteriors
PhylaFlow: Hybrid Flow Matching in Billera-Holmes-Vogtmann Tree Space for Phylogenetic Inference

Yasha Ektefaie +4
cs.LG 2026-05-21 reviewed

Truncating CoT exposes evasive contamination in LLMs
The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation

Yifan Lan +4
cs.LG 2026-05-21 reviewed

Accumulating oracle signals yields token-level advantages for LLMs
OPPO: Bayesian Value Recursion for Token-Level Credit Assignment in LLM Reasoning

Yu Li +3
cs.LG 2026-05-21 reviewed

Accumulating oracle signals yields token-level advantages in one pass
OPPO: Bayesian Value Recursion for Token-Level Credit Assignment in LLM Reasoning

Yu Li +3
cs.LG 2026-05-21 reviewed

Dictionary realignment keeps OOD explanations faithful
Geometry-Adaptive Explainer for Faithful Dictionary-Based Interpretability under Distribution Shift

Sungjun Lim +3
stat.ME 2026-05-21 reviewed

Equal-variance structural VARs identified only up to orthogonal transforms and scale
Causal Discovery in Structural VAR Models Under Equal Noise Variance

SeyedSina Seyedi HasanAbadi +3
cs.LG 2026-05-21 reviewed

Tensor Cache stores evicted tokens in outer-product memory
Tensor Cache: Eviction-conditioned Associative Memory for Transformers

Kabir Swain +4
cs.LG 2026-05-21 reviewed

Energy gating lifts transformer loss by 0.1 with tiny overhead
Energy-Gated Attention: Spectral Salience as an Inductive Bias for Transformer Attention

Athanasios Zeris
cs.LG 2026-05-20 reviewed

On-policy training halves LLM sycophancy without capability loss
On-Policy Consistency Training Improves LLM Safety with Minimal Capability Degradation

Andy Han +7
cs.LG 2026-05-20 reviewed

Expert comparisons guide nanoscale experiments without scalar goals
Beyond Scalar Objectives: Expert-Feedback-Driven Autonomous Experimentation for Scientific Discovery at the Nanoscale

Ralph Bulanadi +7
cs.LG 2026-05-20 reviewed

Symbolic search recovers exact discrete distribution formulas
Symbolic Density Estimation for Discrete Distributions

Ziwen Liu +1
stat.CO 2026-05-20 reviewed

Truncation makes neural likelihood work for long state sequences
Truncated Neural Likelihood Estimation for Simulation-Based Inference in State-Space Models

Kostas Tsampourakis +1
eess.IV 2026-05-20 reviewed

Embeddings support 99% accurate tomato field mapping
Mapping Tomato Cropping Systems in California Using AlphaEarth Geospatial Embeddings and Deep Learning Analysis

Mohammadreza Narimani +2
cs.LG 2026-05-20 reviewed

Optimizers create different spectral scaling laws in the same model
Same Architecture, Different Capacity: Optimizer-Induced Spectral Scaling Laws

Nandan Kumar Jha +1
cs.LG 2026-05-20 reviewed

Geometry-aware calibration closes entropy gaps for LLM optimization
Why Semantic Entropy Fails: Geometry-Aware and Calibrated Uncertainty for Policy Optimization

Zheyuan Zhang +5
cs.LG 2026-05-20 reviewed

One platform unifies the full world model research pipeline
stable-worldmodel: A Platform for Reproducible World Modeling Research and Evaluation

Lucas Maes +11
cs.AI 2026-05-20 reviewed

Agentic AI uses 4.33x more energy per successful goal than linear baselines
Energy per Successful Goal: Goal-Level Energy Accounting for Agentic AI Systems

Deepak Panigrahy +1
cs.LG 2026-05-20 reviewed

KL divergence to GPs splits into three costs for neural processes
Three Costs of Amortizing Gaussian Process Inference with Neural Processes

Robin Young
cs.CL 2026-05-20 reviewed

DivSkill-SQL lifts Text-to-SQL accuracy by up to 11 points
Residual Skill Optimization for Text-to-SQL Ensembles

Jiongli Zhu +10
cs.LG 2026-05-20 reviewed

MMD-balls as credal sets bound worst-case risk in test-time adaptation
MMD-Balls as Credal Sets: A PAC-Bayesian Framework for Epistemic Uncertainty in Test-Time Adaptation

Ahanaf Hasan Ariq
cs.LG 2026-05-20 reviewed

Privacy profiles connect randomized smoothing to differential privacy for joint…
Provable Robustness against Backdoor Attacks via the Primal-Dual Perspective on Differential Privacy

Aman Saxena +3
cs.CR 2026-05-20 reviewed

LLMs lose accuracy on complex noisy logs for intrusion detection
HIDBench: Benchmarking Large Language Models for Host-Based Intrusion Detection

Danyu Sun +3
cs.LG 2026-05-20 reviewed

Manifold projections steer LLMs clear of reasoning mistakes
Manifold-Guided Attention Steering

ian Li +5
cs.LG 2026-05-20 reviewed

Local rerollouts fix unfair credit assignment in memory LLM agents
Memory-R2: Fair Credit Assignment for Long-Horizon Memory-Augmented LLM Agents

Sikuan Yan +6
cs.LG 2026-05-20 reviewed

Sampling-based inference reaches parity with optimization in BNNs
Position: The Time for Sampling Is Now! Charting a New Course for Bayesian Deep Learning

Emanuel Sommer +1
cs.LG 2026-05-20 reviewed

Only full-domain utilities make OCE risk measures PAC-learnable in RL
On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents

Oliver Mortensen +1
cs.LG 2026-05-20 reviewed

ML on calcium scans predicts obstructive CAD
Machine learning prediction of obstructive coronary artery disease using opportunistic coronary calcium and epicardial fat assessments from CT calcium scoring scans

Juhwan Lee +9
cs.LG 2026-05-20 reviewed

External data files fix binding failures in text-to-optimization
Models Can Model, But Can't Bind: Structured Grounding in Text-to-Optimization

Zhiqi Gao +4
cs.LG 2026-05-20 reviewed

Pairwise comparisons yield unbiased preference percentiles
PEARL: Unbiased Percentile Estimation via Contrastive Learning for Industrial-Scale Livestream Recommendation

Blake Gella +8
cs.LG 2026-05-20 reviewed

Calcium-omics features lift ischemia prediction from CT scans to 99% precision
Quantitative coronary calcification analysis for prediction of myocardial ischemia using non-contrast CT calcium scoring

Juhwan Lee +8
cs.LG 2026-05-20 reviewed

Thresholding fixes class imbalance in PFNs for tabular data
Correcting Class Imbalance in Prior-Data Fitted Networks for Tabular Classification

Samuel McDowell +2
stat.ML 2026-05-20 reviewed

Support-aware method certifies ad reserve policies from logs
Support-aware offline policy selection for advertising marketplaces

Prashant Shekhar +1
cs.LG 2026-05-20 reviewed

Audit tool uncovers hidden differences in accurate AI drug models
I-SAFE: Wasserstein Coherence Metrics for Structural Auditing of Scientific AI Models

Barbara Tarantino +2
cs.CV 2026-05-20 reviewed

Lightweight cross-encoder matches LLM judges for caption evaluation
BEiTScore: Reference-free Image Captioning Evaluation with an Efficient Cross-Encoder Model

Gon\c{c}alo Gomes +2
cs.LG 2026-05-20 reviewed

Exact doubly stochastic mixes via transportation polytopes
TBP-mHC: full expressivity for manifold-constrained hyper connections through transportation polytopes

Anton Lyubinin
cond-mat.stat-mech 2026-05-20 reviewed

Adaptive bias lets neural samplers cross discrete energy barriers
MetaDNS: Enhancing Exploration in Discrete Neural Samplers via Well-Tempered Metadynamics

Xiaochen Du +9
cs.CE 2026-05-20 reviewed

Market maker adapts to new regimes without retraining
Zero-shot adaptation to order book dynamics

Arip Asadulaev
cs.LG 2026-05-20 reviewed

Projection matrix aligns tokenizers for better distillation
X-Token: Projection-Guided Cross-Tokenizer Knowledge Distillation

Sharath Turuvekere Sreenivas +6
cs.LG 2026-05-20 reviewed

Representation Gap is governed by task intrinsic dimension
Representation Gap: Explaining the Unreasonable Effectiveness of Neural Networks from a Geometric Perspective

David Perera +4
cs.LG 2026-05-20 reviewed

Stochastic policy amortizes diffusion guidance for 5x faster sampling
Hierarchical Variational Policies for Reward-Guided Diffusion

Kushagra Pandey +4
cs.LG 2026-05-20 reviewed

Actor updates match value gradients under differentiable rollouts
Value-Gradient Hypothesis of RL for LLMs

Arip Asadulaev +3
cs.LG 2026-05-20 reviewed

Fine-tuned detectors amplify a pretrained typicality axis
Amplifying, Not Learning: Fine-Tuned AI Text Detectors Amplify a Pretrained Direction

Alexander Smirnov
cs.LG 2026-05-20 reviewed

Entmax turns KV cache truncation into exact support recovery
EntmaxKV: Support-Aware Decoding for Entmax Attention

Gon\c{c}alo Duarte +2

4 Piths