archive

Every paper Pith has read. Search by title, abstract, or pith.

1286 papers in cs.IR · page 10

cs.IR 2026-04-22 reviewed

Normalization plus graph alignment removes LLM recommender training barrier
Break the Optimization Barrier of LLM-Enhanced Recommenders: A Theoretical Analysis and Practical Framework

Zhangchi Zhu +1
cs.SE 2026-04-22 reviewed

Hybrid detector finds 893k eliminable duplicate BDD steps
Reducing Maintenance Burden in Behaviour-Driven Development: A Paraphrase-Robust Duplicate-Step Detector with a 1.1M-Step Open Benchmark

Ali Hassaan Mughal +2
cs.IR 2026-04-22 reviewed

HaS cuts RAG retrieval latency by up to 37 percent
HaS: Accelerating RAG through Homology-Aware Speculative Retrieval

Peng Peng +4
cs.IR 2026-04-22 reviewed

Discrete tokens turn user multimodal interactions into personalized text and images
Discrete Preference Learning for Personalized Multimodal Generation

Yuting Zhang +8
cs.IR 2026-04-22 reviewed

Semantic recall fixes vector search evaluation by ignoring irrelevant neighbors
Semantic Recall for Vector Search

Leonardo Kuffo +5
cs.IR 2026-04-22 reviewed

Unified library gathers over twenty metrics for SPARQL query evaluation
T2S-Metrics: Unified Library for Evaluating SPARQL Queries Generated From Natural Language

Yousouf Taghzouti (ICN +9
cs.IR 2026-04-22 reviewed

R³AG picks retrievers by both relevance and answer-generation utility
R$^3$AG: Retriever Routing for Retrieval-Augmented Generation

Tong Zhao +3
cs.IR 2026-04-22 reviewed

Model learns to judge when to retrieve external knowledge for entity recognition
SAKE: Self-aware Knowledge Exploitation-Exploration for Grounded Multimodal Named Entity Recognition

Jielong Tang +8
cs.CL 2026-04-22 reviewed

MLLM attributes sharpen product retrieval in e-commerce
AFMRL: Attribute-Enhanced Fine-Grained Multi-Modal Representation Learning in E-commerce

Biao Zhang +5
cs.IR 2026-04-22 reviewed

Knowledge graphs learn their own forgetting rates from data
Not All Memories Age the Same: Autodiscovery of Adaptive Decay in Knowledge Graphs

Mandar Karhade
cs.IR 2026-04-22 reviewed

LLM agents could turn hidden profiles into governable personalization
From Hidden Profiles to Governable Personalization: Recommender Systems in the Age of LLM Agents

Jiahao Liu +8
cs.IR 2026-04-21 reviewed

Five agents plus skill hub automate recsys config tuning
AgenticRecTune: Multi-Agent with Self-Evolving Skillhub for Recommendation System Optimization

Xidong Wu +9
cs.IR 2026-04-21 reviewed

LLM agents automate recsys configuration optimization
AgenticRecTune: Multi-Agent with Self-Evolving Skillhub for Recommendation System Optimization

Xidong Wu +9
cs.IR 2026-04-21 reviewed

MetaRAG reproduces with relative gains but lower absolute scores
A Reproducibility Study of Metacognitive Retrieval-Augmented Generation

Gabriel Iturra-Bocaz +1
cs.LG 2026-04-21 reviewed

4B agent beats larger models on research tasks with 10K open data
DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

Venus Team +12
cs.IR 2026-04-21 reviewed

ECLASS boosts dense retrieval to 94% hit rate on component queries
ECLASS-Augmented Semantic Product Search for Electronic Components

Nico Baumgart +2
cs.IR 2026-04-21 reviewed

Benchmark shows CE trade-offs for recommenders vary by method and format
From Top-1 to Top-K: A Reproducibility Study and Benchmarking of Counterfactual Explanations for Recommender Systems

Quang-Huy Nguyen +8
cs.CL 2026-04-21 reviewed

AI conference reviews lengthened and standardized after LLMs
Impact of large language models on peer review opinions from a fine-grained perspective: Evidence from top conference proceedings in AI

Wenqing Wu +3
cs.IR 2026-04-21 reviewed

ColBERT embeddings aligned to clinical concepts for easier debugging
Diagnosable ColBERT: Debugging Late-Interaction Retrieval Models Using a Learned Latent Space as Reference

Fran\c{c}ois Remy
cs.IR 2026-04-21 reviewed

CTR models train with loops but infer in one pass
LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction

Jiakai Tang +9
cs.IR 2026-04-21 reviewed

Structure matching in RAG improves factual answers by up to 50 points
Structure Guided Retrieval-Augmented Generation for Factual Queries

Miao Xie +3
cs.IR 2026-04-21 reviewed

Highlights and abstracts together improve keyword extraction
Enhancing Unsupervised Keyword Extraction in Academic Papers through Integrating Highlights with Abstract

Yi Xiang +1
cs.IR 2026-04-21 reviewed

Masking and retriever ensembles correct facts 14% better
Mask-to-Correct$^+$: Leveraging Retriever Diversity for Masking-guided Faithful Fact Correction

Payel Santra +3
cs.IR 2026-04-21 reviewed

Semantic code transitions improve next-item predictions
CAST: Modeling Semantic-Level Transitions for Complementary-Aware Sequential Recommendation

Qian Zhang +3
cs.CL 2026-04-21 reviewed

IndiaFinBench gives first public test of LLMs on Indian financial rules
IndiaFinBench: An Evaluation Benchmark for Large Language Model Performance on Indian Financial Regulatory Text

Rajveer Singh Pall
cs.IR 2026-04-21 reviewed

CS3 lifts two-tower ad revenue 8.36% at ms latency
CS3: Efficient Online Capability Synergy for Two-Tower Recommendation

Lixiang Wang +4
cs.IR 2026-04-21 reviewed

Query-aware diffusion recovers relevant graph subgraphs with guarantees
Query-Aware Flow Diffusion for Graph-Based RAG with Retrieval Guarantees

Zhuoping Zhou +9
cs.IR 2026-04-21 reviewed

GraphRAG-IRL fuses graph IRL with LLM re-ranking for recommendations
GraphRAG-IRL: Personalized Recommendation with Graph-Grounded Inverse Reinforcement Learning and LLM Re-ranking

Siqi Liang +3
cs.IR 2026-04-21 reviewed

Feature optimization raises AI citation rates without quality loss
Think Before Writing: Feature-Level Multi-Objective Optimization for Generative Citation Visibility

Zikang Liu +1
cs.CL 2026-04-21 reviewed

Retriever performance drops sharply on redundant corpora
RARE: Redundancy-Aware Retrieval Evaluation Framework for High-Similarity Corpora

Hanjun Cho +1
cs.IR 2026-04-21 reviewed

Adapter with three MoE modules improves temporal graph event forecasts
STK-Adapter: Incorporating Evolving Graph and Event Chain for Temporal Knowledge Graph Extrapolation

Shuyuan Zhao +8
cs.AI 2026-04-21 reviewed

Most users rank LLMs differently from aggregate leaderboards
Personalized Benchmarking: Evaluating LLMs by Individual Preferences

Cristina Garbacea +2
cs.IR 2026-04-20 reviewed

Dual-view training raises instruction-following retrieval 45%
Dual-View Training for Instruction-Following Information Retrieval

Qingcheng Zeng +4
cs.AI 2026-04-20 reviewed

New dataset challenges AI on global math contests
MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

Shaden Alshammari +7
cs.IR 2026-04-20 reviewed

Text embeddings beat page images for science paper search
Document-as-Image Representations Fall Short for Scientific Retrieval

Ghazal Khalighinejad +3
cs.IR 2026-04-20 reviewed

Gaussian limits bound TF-IDF errors from lost tokens
Context-Aware Search and Retrieval Under Token Erasure

Sara Ghasvarianjahromi +3
cs.CL 2026-04-20 reviewed

ArbGraph arbitration cuts hallucinations in long-form RAG
ArbGraph: Conflict-Aware Evidence Arbitration for Reliable Long-Form Retrieval-Augmented Generation

Qingying Niu +4
cs.IR 2026-04-20 reviewed

Balanced clustering cuts recommender embeddings by 75%
Balanced Co-Clustering of Users and Items for Embedding Table Compression in Recommender Systems

Runhao Jiang +2
cs.IR 2026-04-20 reviewed

Trie guidance lets T5 and BART beat larger models on document queries
DocQAC: Adaptive Trie-Guided Decoding for Effective In-Document Query Auto-Completion

Rahul Mehta +5
cs.IR 2026-04-20 reviewed

Full-context LLM judging outperforms isolated checks for multi-hop RAG
Evaluating Multi-Hop Reasoning in RAG Systems: A Comparison of LLM-Based Retriever Evaluation Strategies

Lorenz Brehme +2
cs.IR 2026-04-20 reviewed

Multi-LLM filtering improves sequential recommendations without text
Multi-LLM Token Filtering and Routing for Sequential Recommendation

Wuhan Chen +5
cs.IR 2026-04-20 reviewed

Modular control fixes LLM final-layer weakness for recommendations
Modular Representation Compression: Adapting LLMs for Efficient and Effective Recommendations

Yunjia Xi +6
cs.HC 2026-04-20 reviewed

Human-AI collaboration gaps trace to mismatched grounding capacity
The Collaboration Gap in Human-AI Work

Varad Vishwarupe +3
cs.IR 2026-04-20 reviewed

Memory wins for precise finance math
Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints

Jianan Liu +6
cs.IR 2026-04-20 reviewed

Memory wins precise finance math
Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints

Jianan Liu +6
cs.IR 2026-04-20 reviewed

Gaussian process beats passive LLM reranking for passage retrieval
Bayesian Active Learning with Gaussian Processes Guided by LLM Relevance Scoring for Dense Passage Retrieval

JunYoung Kim +7
cs.IR 2026-04-20 reviewed

RankUp lifts ad GMV up to 4.81% by preserving higher-rank representations
RankUp: Towards High-rank Representations for Large Scale Advertising Recommender Systems

Jin Chen +18
cs.IR 2026-04-20 reviewed

RankUp lifts ad recommender GMV up to 4.81% by raising representation rank
RankUp: Towards High-rank Representations for Large Scale Advertising Recommender Systems

Jin Chen +18
cs.IR 2026-04-20 reviewed

Semantic bridge enables private cross-domain recs without overlaps
FedCRF: A Federated Cross-domain Recommendation Method with Semantic-driven Deep Knowledge Fusion

Lei Guo +5
cs.IR 2026-04-20 reviewed

150k AI papers show must-cite retrieval still unsolved
MasterSet: A Large-Scale Benchmark for Must-Cite Citation Recommendation in the AI/ML Literature

Md Toyaha Rahman Ratul +4