Imperfect Response

Guorui Zhou, Honghui Bao, Jiaming Huang, Jiaxin Deng, Jinghao Zhang, Junda She, Kuo Cai, Lejian Ren, Lu Ren, Qiang Luo, Qianqian Wang, Qigen Hu, Rongzhou Zhang, Ruiming Tang, Shiyao Wang, Wuchao Li, Xiangyu Wu, Xinchen Luo, Xingmei Wang, Yi · 2025 · arXiv 2512.24762

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

representative citing papers

RecRM-Bench: Benchmarking Multidimensional Reward Modeling for Agentic Recommender Systems

cs.IR · 2026-05-12 · unverdicted · novelty 7.0

RecRM-Bench is a new large-scale benchmark dataset and framework for multi-dimensional reward modeling in agentic recommender systems, spanning instruction following, factual consistency, query-item relevance, and user behavior prediction.

TubiFM: Unified Item, Carousel, and Search Ranking for Streaming Discovery

cs.IR · 2026-05-22 · unverdicted · novelty 6.0

A Llama-based model trained on serialized user stories unifies item, carousel, and search ranking and outperforms specialist baselines offline while improving some online metrics and reducing latency.

Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe

cs.LG · 2026-05-05 · unverdicted · novelty 6.0

Uni-OPD unifies on-policy distillation across LLMs and MLLMs with dual-perspective strategies that promote student exploration and enforce order-consistent teacher supervision based on outcome rewards.

TriAlignGR: Triangular Multitask Alignment with Multimodal Deep Interest Mining for Generative Recommendation

cs.IR · 2026-05-05 · unverdicted · novelty 6.0 · 2 refs

TriAlignGR proposes a triangular multitask alignment framework with cross-modal semantic alignment, deep interest mining via chain-of-thought, and joint training on eight tasks to address content degradation and semantic opacity in Semantic ID-based generative recommendation.

Echoes in Filter Bubble: Diagnosing and Curing Popularity Bias in Generative Recommenders

cs.IR · 2026-05-16

citing papers explorer

Showing 5 of 5 citing papers.

RecRM-Bench: Benchmarking Multidimensional Reward Modeling for Agentic Recommender Systems cs.IR · 2026-05-12 · unverdicted · none · ref 55
RecRM-Bench is a new large-scale benchmark dataset and framework for multi-dimensional reward modeling in agentic recommender systems, spanning instruction following, factual consistency, query-item relevance, and user behavior prediction.
TubiFM: Unified Item, Carousel, and Search Ranking for Streaming Discovery cs.IR · 2026-05-22 · unverdicted · none · ref 27
A Llama-based model trained on serialized user stories unifies item, carousel, and search ranking and outperforms specialist baselines offline while improving some online metrics and reducing latency.
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe cs.LG · 2026-05-05 · unverdicted · none · ref 52
Uni-OPD unifies on-policy distillation across LLMs and MLLMs with dual-perspective strategies that promote student exploration and enforce order-consistent teacher supervision based on outcome rewards.
TriAlignGR: Triangular Multitask Alignment with Multimodal Deep Interest Mining for Generative Recommendation cs.IR · 2026-05-05 · unverdicted · none · ref 35 · 2 links
TriAlignGR proposes a triangular multitask alignment framework with cross-modal semantic alignment, deep interest mining via chain-of-thought, and joint training on eight tasks to address content degradation and semantic opacity in Semantic ID-based generative recommendation.
Echoes in Filter Bubble: Diagnosing and Curing Popularity Bias in Generative Recommenders cs.IR · 2026-05-16 · unreviewed · ref 47

Imperfect Response

fields

years

verdicts

representative citing papers

citing papers explorer