archive

Every paper Pith has read. Search by title, abstract, or pith.

14513 papers in cs.AI · page 20

cs.AI 2026-05-18 reviewed

Varying environment rules builds agents that generalize
Scalable Environments Drive Generalizable Agents

Jiayi Zhang +9
cs.CV 2026-05-18 reviewed

Agentic selector ranks second on four-day multimodal challenge
MARS: Technical Report for the CASTLE Challenge at EgoVis 2026

Haoyu Zhang +6
cs.AI 2026-05-18 reviewed

Proxy images from EEG let AI models interpret brain signals
Visualizing the Invisible: Generative Visual Grounding Empowers Universal EEG Understanding in MLLMs

Jun-Yu Pan +5
cs.AI 2026-05-18 reviewed

One universal fix reduces hallucinations in 15 models
TRACE: Trajectory Correction from Cross-layer Evidence for Hallucination Reduction

Tej Sanibh Ranade
cs.CV 2026-05-18 reviewed

Consistency reward lifts VLM spatial reasoning
Self-Evolving Spatial Reasoning in Vision Language Models via Geometric Logic Consistency

Junming Liu +6
cs.CV 2026-05-18 reviewed

New module keeps multimodal models true to images during long answers
Vision Inference Former: Sustaining Visual Consistency in Multimodal Large Language Models

Xinpeng Dong +5
cs.AI 2026-05-18 reviewed

Black-box agents revive erased concepts in image generators
Whispers in the Noise: Surrogate-Guided Concept Awakening via a Multi-Agent Framework

Mengyu Sun +5
cs.AI 2026-05-18 reviewed

pArticleMap recovers 10.8% of future papers from literature gaps
Evidence-Grounded Frontier Mapping and Agentic Hypothesis Generation in Nanomedicine

Christiaan G.A. Viviers +8
cs.AI 2026-05-18 reviewed

GenAI lifts average performance but splits gains by interaction skill
Generative AI and the Productivity Divide: Human-AI Complementarities in Education

Lihi Idan +1
cs.CR 2026-05-18 reviewed

Indirect injections hijack chatbots to leak user data
An Empirical Study of Privacy Leakage Chains via Prompt Injection in Black-Box Chatbot Environments

Hongjang Yang +2
cs.CV 2026-05-18 reviewed

3D generators leave fingerprints that identify their source
Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models

Sihan Ma +2
cs.AI 2026-05-18 reviewed

Adversarial priors raise recall in multivariate anomaly detection
POST: Prior-Observation Adversarial Learning of Spatio-Temporal Associations for Multivariate Time Series Anomaly Detection

Suofei Zhang +2
cs.AI 2026-05-18 reviewed

Grounding cuts tokens 18x while matching big models on home tasks
TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning

Zhiyuan Feng +13
math.OC 2026-05-18 reviewed

Symmetry-respecting updates beat AdamW in LLM pretraining
Symmetry-Compatible Principle for Optimizer Design: Embeddings, LM Heads, SwiGLU MLPs, and MoE Routers

Tim Tsz-Kit Lau +1
cs.AI 2026-05-18 reviewed

Modality drift collapses refusal geometry in multimodal LLMs
Safety Geometry Collapse in Multimodal LLMs and Adaptive Drift Correction

Jiahe Guo +8
cs.CV 2026-05-18 reviewed

Diffusion model generates aligned urban energy maps from roads
SENSE: Satellite-based ENergy Synthesis for Sustainable Environment

Kailai Sun +7
cs.AI 2026-05-18 reviewed

Attention plus contrastive learning solves mixed-geometry routing
Learning to Solve Compositional Geometry Routing Problems

Mingfeng Fan +5
cs.IR 2026-05-18 reviewed

SynGR boosts generative recs by limiting dominant modalities
SynGR: Unleashing the Potential of Cross-Modal Synergy for Generative Recommendation

Wei Chen +8
quant-ph 2026-05-18 reviewed

4-qubit game circuit forecasts disruptive capital trajectories
Parameterized 4-Qubit EWL Quantum Game Circuits with Dirac-Solow-Swan Hamiltonian Integration for Quadruple Helix Disruptive Innovation Recommender Systems

Agung Trisetyarso +2
cs.AI 2026-05-18 reviewed

LLM refines protocols so agents reconstruct states uniformly
LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning

Sangjun Bae +3
cs.SE 2026-05-18 reviewed

Multi-model feedback doubles AI solves on contest problems
A-ProS: Towards Reliable Autonomous Programming Through Multi-Model Feedback

Anika Tabassum +4
cs.LG 2026-05-18 reviewed

Curvature rewiring cuts over-squashing in forecasting models
Improving Spatio-Temporal Residual Error Propagation by Mitigating Over-Squashing

Seyed Mohamad Moghadas +3
cs.LG 2026-05-18 reviewed

New model predicts gene expression from tissue slides with better structure
FLAG: Foundation model representation with Latent diffusion Alignment via Graph for spatial gene expression prediction

Qi Si +7
cs.AI 2026-05-18 reviewed

GUI agents search docs for rare tasks
DocOS: Towards Proactive Document-Guided Actions in GUI Agents

Jingjing Liu +8
cs.RO 2026-05-18 reviewed

Softmax uncertainty performs like ensembles for robot gating decisions
Confidence-Gated Robot Autonomy: When Does Uncertainty Actually Help?

Johannes A. Gaus +2
cs.SE 2026-05-18 reviewed

ProcBench detects process defects in LLM coding agents missed by outcome scores
ProcCtrlBench: Evaluating Process-Level Defects and Control Preservation in LLM Coding Agents

Jiawei He +6
cs.SE 2026-05-18 reviewed

Process benchmark catches mid-task defects in LLM coding agents
ProcCtrlBench: Evaluating Process-Level Defects and Control Preservation in LLM Coding Agents

Jiawei He +6
cs.HC 2026-05-18 reviewed

Limitation disclosures calibrate case-by-case trust in XAI
Exploring Trust Calibration in XAI - The Impact of Exposing Model Limitations to Lay Users

Alfio Ventura +3
cs.AI 2026-05-18 reviewed

Variance reduction lifts ZO hard-thresholding direction limit
New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions

Xinzhe Yuan (1) +7
cs.CL 2026-05-18 reviewed

Tool localizes node errors in multi-agent LLM workflows
PROTEA: Offline Evaluation and Iterative Refinement for Multi-Agent LLM Workflows

Kazuki Kawamura +2
quant-ph 2026-05-18 reviewed

Quantum sidecars generate signals for AI optimizers
Quantum Sidecar Architectures for Hybrid AI Training and Inference: Stateful Protected Registers, Stateless Reset-and-Reprepare Circuits and Quantum Weight-State Outlook

Y.Mo +1
cs.LG 2026-05-18 reviewed

Rectification LoRA fixes hallucinations in federated self-distillation
FedSDR: Federated Self-Distillation with Rectification

Ziheng Ren +4
cs.AI 2026-05-18 reviewed

LLMs reach 90% on telecom language but only 30% on fixes
TeleCom-Bench: How Far Are Large Language Models from Industrial Telecommunication Applications?

Jieting Xiao +12
cs.LG 2026-05-18 reviewed

Framework trains agents to coordinate despite disrupted interactions
Interaction-Breaking Adversarial Learning Framework for Robust Multi-Agent Reinforcement Learning

Sunwoo Lee +3
cs.LG 2026-05-18 reviewed

Frequency extraction recovers hidden generalization at 80% noise
Unveiling Memorization-Generalization Coexistence: A Case Study on Arithmetic Tasks with Label Noise

Linyu Liu +1
cs.CV 2026-05-18 reviewed

Training fixes attention so text alone locates video objects
See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

Boyuan Sun +4
cs.CV 2026-05-18 reviewed

TinySAM 2 cuts SAM 2 memory tokens to 7 percent at 90 percent accuracy
TinySAM 2: Extreme Memory Compression for Efficient Track Anything Model

Zhaoyuan Ding +3
cs.CV 2026-05-18 reviewed

Semantic scoring refines distilled image datasets
SAS: Semantic-aware Sampling for Generative Dataset Distillation

Mingzhuo Li +6
cs.NE 2026-05-18 reviewed

FPGA accelerator adds on-device learning to spiking networks
Spiker-LL: An Energy-Efficient FPGA Accelerator Enabling Adaptive Local Learning in Spiking Neural Networks

Alessio Caviglia +3
cs.AI 2026-05-18 reviewed

Shared backbone PPO outperforms standard in multi-UAV coverage
Shared Backbone PPO for Multi-UAV Communication Coverage with Connection Preservation

Z. Jiang
cs.SE 2026-05-18 reviewed

Verify gate renders multi-agent completions inspectable and fail-closed
Verify-Gated Completion as Admission Control in a Governed Multi-Agent Runtime: A Bounded Architecture Case Study

Hai-Duong Nguyen +1
cs.SE 2026-05-18 reviewed

Verify gate turns agent completion into inspectable admission control
Verify-Gated Completion as Admission Control in a Governed Multi-Agent Runtime: A Bounded Architecture Case Study

Hai-Duong Nguyen +1
cs.LG 2026-05-18 reviewed

Per-module scaling lifts low-bit quantization accuracy
MARR: Module-Adaptive Residual Reconstruction for Low-Bit Post-Training Quantization

Le Su +2
cs.IR 2026-05-18 reviewed

E-commerce search lifts new-item GMV 5.3 percent via long-term value estimates
Towards Sustainable Growth: A Multi-Value-Aware Retrieval Framework for E-Commerce Search

Yifan Wang +4
cs.SD 2026-05-18 reviewed

Latent diffusion models generate minutes of audio in under 2 seconds
Stable Audio 3

Zach Evans +6
cs.CL 2026-05-18 reviewed

Predictive prefetching cuts RAG latency up to 43.5%
Predictive Prefetching for Retrieval-Augmented Generation

Wuyang Zhang +1
cs.CR 2026-05-18 reviewed

New benchmark finds 11-30% indirect prompt injection success in AI agents
LivePI: More Realistic Benchmarking of Agents Against Indirect Prompt Injection

Lei Zhao +2
cs.LG 2026-05-18 reviewed

Sensitivity-aware SVD compresses physics models at high ratios
SAFE-SVD: Sensitivity-Aware Fidelity-Enforcing SVD for Physics Foundation Models

Chengjie Hong +4
cs.LG 2026-05-18 reviewed

LLM search discovers top kernels for high-dimensional BO
Automated Kernel Discovery Towards Understanding High-dimensional Bayesian Optimization

Taeyoung Yun +4
cs.AI 2026-05-18 reviewed

LLMs guide Bayesian optimization to 90% performance in 6 iterations
Unleashing LLMs in Bayesian Optimization: Preference-Guided Framework for Scientific Discovery

Xinzhe Yuan +6