pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

14513 papers in cs.AI · page 9

  1. cs.LG 2026-05-20 reviewed
    Inductive logic turns neural circuit findings into transferable theories

    From Circuit Evidence to Mechanistic Theory: An Inductive Logic Approach

    Nura Aljaafari +2

  2. cs.CL 2026-05-20 reviewed
    LLMs follow logical rules for conditionals but miss human implications

    Tracing the ongoing emergence of human-like reasoning in Large Language Models

    Paolo Morosi +4

  3. cs.LG 2026-05-20 reviewed
    Semantic route cuts mental health prediction error across datasets

    TimeSRL: Generalizable Time-Series Behavioral Modeling via Semantic RL-Tuned LLMs -- A Case Study in Mental Health

    Yuang Fan +10

  4. stat.ML 2026-05-20 reviewed
    Large learning rates alter transformer attractors to cycles and chaos

    Large-Step Training Dynamics of a Two-Factor Linear Transformer Model

    Krishnakumar Balasubramanian

  5. cs.LG 2026-05-20 reviewed
    One-step MeanFlow policies beat Gaussian baselines in RL

    Stochastic MeanFlow Policies: One-Step Generative Control with Entropic Mirror Descent

    Zeyuan Wang +8

  6. cs.LG 2026-05-20 reviewed
    One-step generative policies add multimodal actions to mirror descent RL

    Stochastic MeanFlow Policies: One-Step Generative Control with Entropic Mirror Descent

    Zeyuan Wang +8

  7. cs.CV 2026-05-20 reviewed
    105M open image-text pairs train competitive text-to-image model

    MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset

    Benjamin Aubin +6

  8. cs.LG 2026-05-20 reviewed
    Moderate warm-up lets offline DPO surpass online RL on math reasoning

    How Much Online RL is Enough? Informative Rollouts for Offline Preference Optimization in RLVR

    Richa Verma +1

  9. cs.RO 2026-05-20 reviewed
    Structural latent points raise robotic task success rates

    Learning Structural Latent Points for Efficient Visual Representations in Robotic Manipulation

    YiCheng Jiang +10

  10. cs.LG 2026-05-20 reviewed
    Strategy-map DAG keeps self-evolving agents from repeating old routines

    APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents

    Yibo Li +7

  11. cs.CV 2026-05-20 reviewed
    Region-aware VAE completes full heart motion cycle from single frame

    RePCM: Region-Specific and Phenotype-Adaptive Bi-Ventricular Cardiac Motion Synthesis

    Xuan Yang +5

  12. cs.LG 2026-05-20 reviewed
    Octahedral triplet quantizer trims KV cache bits

    OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

    Mark Boss +3

  13. cs.LG 2026-05-20 reviewed
    Preference tuning cuts RL policy failures by over 60%

    PREFINE: Preference-Based Implicit Reward and Cost Fine-Tuning for Safety Alignment

    Richa Verma +3

  14. physics.optics 2026-05-20 reviewed
    AI Automates Every Stage of Microwave Photonics Systems

    Artificial Intelligence Reshapes Microwave Photonics

    Peng Li +4

  15. cs.LG 2026-05-20 reviewed
    QED cuts cross-run divergence in RL by two orders of magnitude

    Behavior-Consistent Deep Reinforcement Learning

    Marcel Hussing +4

  16. cs.LG 2026-05-20 reviewed
    QED makes RL policies 100 times more consistent across runs

    Behavior-Consistent Deep Reinforcement Learning

    Marcel Hussing +4

  17. quant-ph 2026-05-20 reviewed
    Quantum RL matches classical on chemical flowsheet design

    Enhanced Reinforcement Learning-based Process Synthesis via Quantum Computing

    Austin Braniff (1) +7

  18. cs.SI 2026-05-20 reviewed
    New benchmark finds naive baselines hard to beat on social media sentiment

    SURGE: An Event-Centric Social Media Sentiment Time Series Benchmark with Interaction Structure

    Chen Su +3

  19. cs.NI 2026-05-20 reviewed
    Hardware load balancing keeps AI networks at 98% line rate

    High-speed Networking for Giga-Scale AI Factories

    Sajy Khashab +13

  20. cs.CV 2026-05-20 reviewed
    SAM3 turns rough maps into sharp bacteria explanations

    SAM-Sode: Towards Faithful Explanations for Tiny Bacteria Detection

    Wanying Tan +9

  21. cs.CL 2026-05-20 reviewed
    Manga109 revised to correct 29,000 dialogue annotations

    Manga109-v2026: Revisiting Manga109 Annotations for Modern Manga Understanding

    Jeonghun Baek +4

  22. stat.ML 2026-05-20 reviewed
    Adaptive batch scaling unlocks large-batch RL

    Scalable Reinforcement Learning via Adaptive Batch Scaling

    Jongchan Park

  23. cs.AI 2026-05-20 reviewed
    Boundary-band generator lifts AV collision rates 6.2 points

    ScenePilot: Controllable Boundary-Driven Critical Scenario Generation for Autonomous Driving

    Qiyu Ruan +4

  24. cs.CV 2026-05-20 reviewed
    Weierstrass function supplies 2D patch encodings for vision transformers

    Weierstrass Positional Encoding for Vision Transformers

    Zhihang Xin +3

  25. cs.CV 2026-05-20 reviewed
    YOLOv11 detects military targets in synthetic thermal and night drone images

    Comparative Analysis of Military Detection Using Drone Imagery Across Multiple Visual Spectrums

    Sourov Roy Shuvo +5

  26. cs.CL 2026-05-20 reviewed
    Fine-tuned LLM reaches 0.866 F1 on Spanish psychiatric ICD coding

    Automated ICD Classification of Psychiatric Diagnoses: From Classical NLP to Large Language Models

    Fernando Ortega +5

  27. cs.CR 2026-05-20 reviewed
    Spectral distances flag Trojaned DNN updates after one step

    Detecting Trojaned DNNs via Spectral Regression Analysis

    Samuele Pasini +2

  28. cs.LO 2026-05-20 reviewed
    Complexity results proven for entailment in cumulative dependence logics

    On the Complexity of Entailment for Cumulative Propositional Dependence Logics

    Kai Sauerwald +2

  29. cs.LG 2026-05-20 reviewed
    Parallel Monte Carlo trains deep state space models 10x faster

    Efficient Learning of Deep State Space Models via Importance Smoothing

    John-Joseph Brady +2

  30. cs.CL 2026-05-20 reviewed
    Small classifier beats LLMs at pulling exact text from papers

    ACL-Verbatim: hallucination-free question answering for research

    G\'abor Recski +4

  31. cs.MA 2026-05-20 reviewed
    Decoupled messages sustain MARL performance at low bandwidth

    Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints

    Alexi Canesse +3

  32. cs.AI 2026-05-20 reviewed
    Distilling LLM agents yields RPA code that cuts token use 82-96%

    AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions

    Minghao Chen +3

  33. cs.CL 2026-05-20 reviewed
    New benchmark separates retrieval from generation errors in legal RAG

    Fine-grained Claim-level RAG Benchmark for Law

    Souvick Das +2

  34. cs.CL 2026-05-20 reviewed
    ClaimRAG-LAW benchmark separates retrieval and generation errors in legal RAG

    Fine-grained Claim-level RAG Benchmark for Law

    Souvick Das +2

  35. cs.CL 2026-05-20 reviewed
    New dataset separates retrieval from generation in legal RAG

    Fine-grained Claim-level RAG Benchmark for Law

    Souvick Das +2

  36. cs.CV 2026-05-20 reviewed
    0.5B driving model matches 7B models by adding future visual states

    Grounding Driving VLA via Inverse Kinematics

    Junsung Park +1

  37. cs.LG 2026-05-20 reviewed
    Vector quantization builds local calibration maps for multiclass models

    Divide et Calibra: Multiclass Local Calibration via Vector Quantization

    Cesare Barbera +4

  38. stat.ML 2026-05-20 reviewed
    Local boundary finds valid adjustment sets for causal effects

    Local Covariate Selection for Average Causal Effect Estimation without Pretreatment and Causal Sufficiency Assumptions

    Zeyu Liu +5

  39. cs.CV 2026-05-20 reviewed
    Dynamic sinks raise dynamic degree in long video generation

    DySink: Dynamic Frame Sinks for Autoregressive Long Video Generation

    Bo Ye +4

  40. cs.CL 2026-05-20 reviewed
    Agent turns natural language into governed enterprise API calls

    Beyond Text-to-SQL: An Agentic LLM System for Governed Enterprise Analytics APIs

    Gundeep Singh +7

  41. cs.AI 2026-05-20 reviewed
    Off-the-shelf persona vectors rival targeted sycophancy steering

    Playing Devil's Advocate: Off-the-Shelf Persona Vectors Rival Targeted Steering for Sycophancy

    Ishaan Kelkar +5

  42. cs.CL 2026-05-20 reviewed
    DABS cuts multi-aspect sentiment computation by up to 60%

    Single-Pass, Depth-Selective Reading for Multi-Aspect Sentiment Analysis

    Yan Xia +3

  43. cs.CV 2026-05-20 reviewed
    Landsat addition cuts TanDEM-X forest height RMSE by 13.5%

    Hybrid Machine Learning Model for Forest Height Estimation from TanDEM-X and Landsat Data

    Islam Mansour +3

  44. cs.CL 2026-05-20 reviewed
    Anchor regularization makes LLM safety consistent across prompt variations

    Towards Context-Invariant Safety Alignment for Large Language Models

    Yixu Wang +6

  45. cs.LG 2026-05-20 reviewed
    Flat minima enable non-vacuous bounds for transformers on sparse boolean tasks

    A Sharper Picture of Generalization in Transformers

    Paul Lintilhac +1

  46. cs.DC 2026-05-20 reviewed
    Routing imbalance in MoE stays fixed when expert parallelism scales

    Diagnosing Overhead in Dispatch Operations: Cross-architecture Observatory

    Bole Ma +3

  47. cs.CV 2026-05-20 reviewed
    VGG16 detects fake images at 91% accuracy

    Comparative Evaluation of Deep Learning Models for Fake Image Detection

    Akhitha Pakala +3

  48. cs.SE 2026-05-20 reviewed
    Refusal rate misranks LLMs on bio safety

    RefusalBench: Why Refusal Rate Misranks Frontier LLMs on Biological Research Prompts

    Lukas Weidener +4

    4 Piths
  49. cs.CV 2026-05-20 reviewed
    Layer attention gaps reveal fix for LVLM hallucinations

    Finding the Correct Visual Evidence Without Forgetting: Mitigating Hallucination in LVLMs via Inter-Layer Visual Attention Discrepancy

    Yutong Xie +5

  50. cs.CV 2026-05-20 reviewed
    Focus-then-context method trims VLM tokens to 22% with tiny accuracy cost

    Focus-then-Context: Subject-Centric Progressive Visual Token Reduction for Vision-Language Models

    Yulin Zhao +4