archive

Every paper Pith has read. Search by title, abstract, or pith.

14513 papers in cs.AI · page 9

cs.LG 2026-05-20 reviewed

Inductive logic turns neural circuit findings into transferable theories
From Circuit Evidence to Mechanistic Theory: An Inductive Logic Approach

Nura Aljaafari +2
cs.CL 2026-05-20 reviewed

LLMs follow logical rules for conditionals but miss human implications
Tracing the ongoing emergence of human-like reasoning in Large Language Models

Paolo Morosi +4
cs.LG 2026-05-20 reviewed

Semantic route cuts mental health prediction error across datasets
TimeSRL: Generalizable Time-Series Behavioral Modeling via Semantic RL-Tuned LLMs -- A Case Study in Mental Health

Yuang Fan +10
stat.ML 2026-05-20 reviewed

Large learning rates alter transformer attractors to cycles and chaos
Large-Step Training Dynamics of a Two-Factor Linear Transformer Model

Krishnakumar Balasubramanian
cs.LG 2026-05-20 reviewed

One-step MeanFlow policies beat Gaussian baselines in RL
Stochastic MeanFlow Policies: One-Step Generative Control with Entropic Mirror Descent

Zeyuan Wang +8
cs.LG 2026-05-20 reviewed

One-step generative policies add multimodal actions to mirror descent RL
Stochastic MeanFlow Policies: One-Step Generative Control with Entropic Mirror Descent

Zeyuan Wang +8
cs.CV 2026-05-20 reviewed

105M open image-text pairs train competitive text-to-image model
MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset

Benjamin Aubin +6
cs.LG 2026-05-20 reviewed

Moderate warm-up lets offline DPO surpass online RL on math reasoning
How Much Online RL is Enough? Informative Rollouts for Offline Preference Optimization in RLVR

Richa Verma +1
cs.RO 2026-05-20 reviewed

Structural latent points raise robotic task success rates
Learning Structural Latent Points for Efficient Visual Representations in Robotic Manipulation

YiCheng Jiang +10
cs.LG 2026-05-20 reviewed

Strategy-map DAG keeps self-evolving agents from repeating old routines
APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents

Yibo Li +7
cs.CV 2026-05-20 reviewed

Region-aware VAE completes full heart motion cycle from single frame
RePCM: Region-Specific and Phenotype-Adaptive Bi-Ventricular Cardiac Motion Synthesis

Xuan Yang +5
cs.LG 2026-05-20 reviewed

Octahedral triplet quantizer trims KV cache bits
OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

Mark Boss +3
cs.LG 2026-05-20 reviewed

Preference tuning cuts RL policy failures by over 60%
PREFINE: Preference-Based Implicit Reward and Cost Fine-Tuning for Safety Alignment

Richa Verma +3
physics.optics 2026-05-20 reviewed

AI Automates Every Stage of Microwave Photonics Systems
Artificial Intelligence Reshapes Microwave Photonics

Peng Li +4
cs.LG 2026-05-20 reviewed

QED cuts cross-run divergence in RL by two orders of magnitude
Behavior-Consistent Deep Reinforcement Learning

Marcel Hussing +4
cs.LG 2026-05-20 reviewed

QED makes RL policies 100 times more consistent across runs
Behavior-Consistent Deep Reinforcement Learning

Marcel Hussing +4
quant-ph 2026-05-20 reviewed

Quantum RL matches classical on chemical flowsheet design
Enhanced Reinforcement Learning-based Process Synthesis via Quantum Computing

Austin Braniff (1) +7
cs.SI 2026-05-20 reviewed

New benchmark finds naive baselines hard to beat on social media sentiment
SURGE: An Event-Centric Social Media Sentiment Time Series Benchmark with Interaction Structure

Chen Su +3
cs.NI 2026-05-20 reviewed

Hardware load balancing keeps AI networks at 98% line rate
High-speed Networking for Giga-Scale AI Factories

Sajy Khashab +13
cs.CV 2026-05-20 reviewed

SAM3 turns rough maps into sharp bacteria explanations
SAM-Sode: Towards Faithful Explanations for Tiny Bacteria Detection

Wanying Tan +9
cs.CL 2026-05-20 reviewed

Manga109 revised to correct 29,000 dialogue annotations
Manga109-v2026: Revisiting Manga109 Annotations for Modern Manga Understanding

Jeonghun Baek +4
stat.ML 2026-05-20 reviewed

Adaptive batch scaling unlocks large-batch RL
Scalable Reinforcement Learning via Adaptive Batch Scaling

Jongchan Park
cs.AI 2026-05-20 reviewed

Boundary-band generator lifts AV collision rates 6.2 points
ScenePilot: Controllable Boundary-Driven Critical Scenario Generation for Autonomous Driving

Qiyu Ruan +4
cs.CV 2026-05-20 reviewed

Weierstrass function supplies 2D patch encodings for vision transformers
Weierstrass Positional Encoding for Vision Transformers

Zhihang Xin +3
cs.CV 2026-05-20 reviewed

YOLOv11 detects military targets in synthetic thermal and night drone images
Comparative Analysis of Military Detection Using Drone Imagery Across Multiple Visual Spectrums

Sourov Roy Shuvo +5
cs.CL 2026-05-20 reviewed

Fine-tuned LLM reaches 0.866 F1 on Spanish psychiatric ICD coding
Automated ICD Classification of Psychiatric Diagnoses: From Classical NLP to Large Language Models

Fernando Ortega +5
cs.CR 2026-05-20 reviewed

Spectral distances flag Trojaned DNN updates after one step
Detecting Trojaned DNNs via Spectral Regression Analysis

Samuele Pasini +2
cs.LO 2026-05-20 reviewed

Complexity results proven for entailment in cumulative dependence logics
On the Complexity of Entailment for Cumulative Propositional Dependence Logics

Kai Sauerwald +2
cs.LG 2026-05-20 reviewed

Parallel Monte Carlo trains deep state space models 10x faster
Efficient Learning of Deep State Space Models via Importance Smoothing

John-Joseph Brady +2
cs.CL 2026-05-20 reviewed

Small classifier beats LLMs at pulling exact text from papers
ACL-Verbatim: hallucination-free question answering for research

G\'abor Recski +4
cs.MA 2026-05-20 reviewed

Decoupled messages sustain MARL performance at low bandwidth
Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints

Alexi Canesse +3
cs.AI 2026-05-20 reviewed

Distilling LLM agents yields RPA code that cuts token use 82-96%
AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions

Minghao Chen +3
cs.CL 2026-05-20 reviewed

New benchmark separates retrieval from generation errors in legal RAG
Fine-grained Claim-level RAG Benchmark for Law

Souvick Das +2
cs.CL 2026-05-20 reviewed

ClaimRAG-LAW benchmark separates retrieval and generation errors in legal RAG
Fine-grained Claim-level RAG Benchmark for Law

Souvick Das +2
cs.CL 2026-05-20 reviewed

New dataset separates retrieval from generation in legal RAG
Fine-grained Claim-level RAG Benchmark for Law

Souvick Das +2
cs.CV 2026-05-20 reviewed

0.5B driving model matches 7B models by adding future visual states
Grounding Driving VLA via Inverse Kinematics

Junsung Park +1
cs.LG 2026-05-20 reviewed

Vector quantization builds local calibration maps for multiclass models
Divide et Calibra: Multiclass Local Calibration via Vector Quantization

Cesare Barbera +4
stat.ML 2026-05-20 reviewed

Local boundary finds valid adjustment sets for causal effects
Local Covariate Selection for Average Causal Effect Estimation without Pretreatment and Causal Sufficiency Assumptions

Zeyu Liu +5
cs.CV 2026-05-20 reviewed

Dynamic sinks raise dynamic degree in long video generation
DySink: Dynamic Frame Sinks for Autoregressive Long Video Generation

Bo Ye +4
cs.CL 2026-05-20 reviewed

Agent turns natural language into governed enterprise API calls
Beyond Text-to-SQL: An Agentic LLM System for Governed Enterprise Analytics APIs

Gundeep Singh +7
cs.AI 2026-05-20 reviewed

Off-the-shelf persona vectors rival targeted sycophancy steering
Playing Devil's Advocate: Off-the-Shelf Persona Vectors Rival Targeted Steering for Sycophancy

Ishaan Kelkar +5
cs.CL 2026-05-20 reviewed

DABS cuts multi-aspect sentiment computation by up to 60%
Single-Pass, Depth-Selective Reading for Multi-Aspect Sentiment Analysis

Yan Xia +3
cs.CV 2026-05-20 reviewed

Landsat addition cuts TanDEM-X forest height RMSE by 13.5%
Hybrid Machine Learning Model for Forest Height Estimation from TanDEM-X and Landsat Data

Islam Mansour +3
cs.CL 2026-05-20 reviewed

Anchor regularization makes LLM safety consistent across prompt variations
Towards Context-Invariant Safety Alignment for Large Language Models

Yixu Wang +6
cs.LG 2026-05-20 reviewed

Flat minima enable non-vacuous bounds for transformers on sparse boolean tasks
A Sharper Picture of Generalization in Transformers

Paul Lintilhac +1
cs.DC 2026-05-20 reviewed

Routing imbalance in MoE stays fixed when expert parallelism scales
Diagnosing Overhead in Dispatch Operations: Cross-architecture Observatory

Bole Ma +3
cs.CV 2026-05-20 reviewed

VGG16 detects fake images at 91% accuracy
Comparative Evaluation of Deep Learning Models for Fake Image Detection

Akhitha Pakala +3
cs.SE 2026-05-20 reviewed

Refusal rate misranks LLMs on bio safety
RefusalBench: Why Refusal Rate Misranks Frontier LLMs on Biological Research Prompts

Lukas Weidener +4

4 Piths
cs.CV 2026-05-20 reviewed

Layer attention gaps reveal fix for LVLM hallucinations
Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy

Yutong Xie +5
cs.CV 2026-05-20 reviewed

Focus-then-context method trims VLM tokens to 22% with tiny accuracy cost
Focus-then-Context: Subject-Centric Progressive Visual Token Reduction for Vision-Language Models

Yulin Zhao +4