ANCHOR: Agentic Noise Creation Framework for Human Simulation and Denoising Recommendation
Pith reviewed 2026-06-27 23:52 UTC · model grok-4.3
The pith
Recommendation denoising shifts from heuristics to supervised learning by creating labeled noise examples with an agent simulator.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By adopting a recommender-in-the-loop agentic architecture to synthesize diverse out-of-preference noise through five extensible simulation mechanisms and informative boundary-adjacent noise through adversarial boundary refinement, the framework generates labeled noisy interactions that train a reusable parametric recognizer integrating collaborative and semantic signals, thereby transforming recommendation denoising from heuristic filtering into supervised learning.
What carries the argument
The Creation-Recognition paradigm, in which an agentic simulator proactively generates labeled noise for training a dedicated recognizer.
If this is right
- Denoising becomes a supervised learning problem with explicit labels rather than unsupervised inference.
- The generated labels cover both diverse out-of-preference noise and challenging boundary-adjacent cases.
- The resulting recognizer can be reused on real interaction data to clean user preference signals.
- True user preferences are better preserved by reducing misidentification of noise.
Where Pith is reading between the lines
- The same creation step could supply training data for noise detection in other implicit-feedback domains such as search logs or social media.
- If the recognizer proves reusable across datasets, it could lower dependence on dataset-specific heuristics in production systems.
- Integrating the agent with more advanced user-behavior models might further close the gap between simulated and real noise distributions.
Load-bearing premise
The noise distributions produced by the agent's simulation mechanisms and boundary refinement are close enough to real-world noisy implicit feedback for the recognizer to generalize.
What would settle it
A benchmark experiment where the trained recognizer is applied to a dataset containing known real noise annotations and fails to outperform strong heuristic baselines on noise identification accuracy or downstream recommendation metrics.
Figures
read the original abstract
Distilling accurate user preferences from noisy implicit feedback remains a fundamental bottleneck in recommendation systems, highlighting the need for recommendation denoising. However, real-world data lack explicit noise annotations, forcing existing methods to rely on unsupervised side information or handcrafted heuristics. These approaches often incur high external costs, generalize poorly, or depend on unreliable priors, causing noise misidentification and corrupting true user preference representations. To address these limitations, we propose a paradigm-level reformulation of recommendation denoising. Instead of indirectly inferring noisy interactions through heuristics, our Creation-Recognition paradigm proactively creates labeled noisy interactions and trains a dedicated recognizer to identify them, transforming denoising from heuristic filtering into supervised learning. Based on this paradigm, we present ANCHOR, an agent-based framework inspired by recent LLM-as-User research. ANCHOR simulates user behaviors to generate realistic noise labels and enables supervised denoising through two stages: noise creation and noise recognition. In the noise creation stage, ANCHOR adopts a recommender-in-the-loop agentic architecture to synthesize both diverse out-of-preference noise and informative boundary-adjacent noise. For out-of-preference noise, it implements five extensible simulation mechanisms to approximate major sources of noisy implicit feedback. For boundary-adjacent noise, an adversarial boundary refinement mechanism generates ambiguous interactions that challenge the recognizer and target the decision boundary. In the noise recognition stage, ANCHOR leverages the generated labels to train a reusable parametric recognizer that integrates collaborative signals and semantic representations to detect noise patterns in real interaction data.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes the Creation-Recognition paradigm for recommendation denoising, arguing that real-world implicit feedback lacks noise labels and forces reliance on heuristics. It introduces ANCHOR, an LLM-as-User agentic framework with a recommender-in-the-loop architecture. In the noise creation stage, five extensible simulation mechanisms generate out-of-preference noise while an adversarial boundary refinement produces boundary-adjacent noise; the resulting labeled synthetic interactions are then used in the noise recognition stage to train a parametric recognizer that combines collaborative signals and semantic representations for application to real data.
Significance. If the synthetic noise distributions prove representative of real-world sources, the shift from heuristic filtering to supervised recognition could reduce dependence on unreliable priors and improve denoising fidelity. The manuscript offers a conceptually coherent framework but contains no experiments, ablation studies, downstream recommendation metrics, or validation of simulation fidelity, so its practical significance remains unassessable from the presented material.
major comments (2)
- [Abstract] Abstract and noise creation stage: the central claim that labels produced by the five simulation mechanisms plus adversarial refinement are sufficiently representative for the recognizer to generalize rests on an unverified distributional match; the manuscript provides no experiments, distributional comparisons, or downstream performance results to test this assumption.
- [Noise recognition stage] Noise recognition stage: the description of the reusable parametric recognizer that integrates collaborative signals and semantic representations lacks any architectural details, training procedure, or loss formulation, rendering the supervised-learning transformation impossible to evaluate or reproduce.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback highlighting the need for empirical validation and technical specificity. We address the major comments point by point below.
read point-by-point responses
-
Referee: [Abstract] Abstract and noise creation stage: the central claim that labels produced by the five simulation mechanisms plus adversarial refinement are sufficiently representative for the recognizer to generalize rests on an unverified distributional match; the manuscript provides no experiments, distributional comparisons, or downstream performance results to test this assumption.
Authors: We agree that the representativeness of the synthetic noise labels is a central assumption requiring verification. The submitted manuscript focuses on defining the Creation-Recognition paradigm and the agentic architecture of ANCHOR. In the revision we will add (1) distributional comparisons between the generated noise and real-world implicit feedback patterns (using available proxy metrics such as interaction entropy or category deviation) and (2) downstream recommendation experiments that measure denoising impact on standard ranking metrics. revision: yes
-
Referee: [Noise recognition stage] Noise recognition stage: the description of the reusable parametric recognizer that integrates collaborative signals and semantic representations lacks any architectural details, training procedure, or loss formulation, rendering the supervised-learning transformation impossible to evaluate or reproduce.
Authors: We acknowledge that the current description of the noise recognition stage remains at a high level. The revised manuscript will specify the recognizer architecture (e.g., a hybrid model combining graph-based collaborative embeddings with text-derived semantic features), the supervised training procedure on the synthetic labeled set, and the concrete loss function (binary cross-entropy with optional contrastive regularization) used to train the noise detector. revision: yes
Circularity Check
No circularity: framework is a novel construction without self-referential derivations or fitted predictions.
full rationale
The paper proposes a new Creation-Recognition paradigm and ANCHOR framework for generating synthetic noise labels via agentic simulation and training a recognizer. No equations, parameters fitted to data subsets, or self-citations are presented as load-bearing for a derivation. The central claim is the construction itself, not a reduction of a result to prior fitted inputs or self-cited uniqueness theorems. This matches the default expectation of a self-contained proposal against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Lisa P Argyle, Ethan C Busby, Nancy Fulda, Joshua R Gubler, Christopher Rytting, and David Wingate. 2023. Out of one, many: Using language models to simulate human samples.Political Analysis31, 3 (2023), 337–351
2023
-
[2]
Georg Buscher, Ludger Van Elst, and Andreas Dengel. 2009. Segment-level display time as implicit feedback: a comparison to eye tracking. InProceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. 67–74
2009
-
[3]
Yixin Cao, Xiang Wang, Xiangnan He, Zikun Hu, and Tat-Seng Chua. 2019. Unifying knowledge graph learning and recommendation: Towards a better understanding of user preferences. InThe world wide web conference. 151–161
2019
-
[4]
Nick Craswell, Onno Zoeter, Michael Taylor, and Bill Ramsey. 2008. An ex- perimental comparison of click position-bias models. InProceedings of the 2008 international conference on web search and data mining. 87–94
2008
-
[5]
Jingtao Ding, Guanghui Yu, Xiangnan He, Fuli Feng, Yong Li, and Depeng Jin
-
[6]
IEEE transactions on knowledge and data engineering33, 2 (2019), 667–681
Sampler design for bayesian personalized ranking by leveraging view data. IEEE transactions on knowledge and data engineering33, 2 (2019), 667–681
2019
-
[7]
Xin Dong, Lei Yu, Zhonghuo Wu, Yuxia Sun, Lingfeng Yuan, and Fangxi Zhang
-
[8]
InProceedings of the AAAI Conference on artificial intelligence, Vol
A hybrid collaborative filtering model with deep structure for recommender systems. InProceedings of the AAAI Conference on artificial intelligence, Vol. 31
-
[9]
Xin Fu. 2010. Towards a model of implicit feedback for web search.Journal of the American Society for Information Science and Technology61, 1 (2010), 30–49
2010
-
[10]
Yunjun Gao, Yuntao Du, Yujia Hu, Lu Chen, Xinjun Zhu, Ziquan Fang, and Baihua Zheng. 2022. Self-guided learning to denoise for robust recommendation. InProceedings of the 45th international ACM SIGIR conference on research and development in information retrieval. 1412–1422
2022
-
[11]
Yongqiang Han, Hao Wang, Kefan Wang, Likang Wu, Zhi Li, Wei Guo, Yong Liu, Defu Lian, and Enhong Chen. 2024. Efficient noise-decoupling for multi-behavior sequential recommendation. InProceedings of the ACM Web Conference 2024. 3297–3306
2024
-
[12]
Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. InProceedings of the 40th International ACM SIGIR conference on Research and Development in Information Retrieval. 355–364
2017
-
[13]
Xiangnan He, Kuan Deng, Xiang Wang, Yan Li, Yongdong Zhang, and Meng Wang. 2020. Lightgcn: Simplifying and powering graph convolution network for recommendation. InProceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval. 639–648
2020
-
[14]
Zhuangzhuang He, Yifan Wang, Yonghui Yang, Peijie Sun, Le Wu, Haoyue Bai, Jinqi Gong, Richang Hong, and Min Zhang. 2024. Double correction frame- work for denoising recommendation. InProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 1062–1072
2024
-
[15]
Katja Hofmann, Fritz Behr, and Filip Radlinski. 2012. On caption bias in inter- leaving experiments. InProceedings of the 21st ACM international conference on Information and knowledge management. 115–124
2012
-
[16]
Zhiyuan Hu, Yue Feng, Anh Tuan Luu, Bryan Hooi, and Aldo Lipani. 2023. Unlocking the potential of user feedback: Leveraging large language model as user simulators to enhance dialogue system. InProceedings of the 32nd ACM International Conference on Information and Knowledge Management. 3953–3957
2023
-
[17]
Zheng Hu, Zhe Li, Ziyun Jiao, Satoshi Nakagawa, Jiawen Deng, Shimin Cai, Tao Zhou, and Fuji Ren. 2025. Bridging the user-side knowledge gap in knowledge- aware recommendations with large language models. InProceedings of the AAAI Conference on Artificial Intelligence. 11799–11807
2025
-
[18]
Anastasiia Klimashevskaia, Dietmar Jannach, Mehdi Elahi, and Christoph Trat- tner. 2024. A survey on popularity bias in recommender systems.User Modeling and User-Adapted Interaction34, 5 (2024), 1777–1834
2024
-
[19]
Tommaso Di Noia, Vito Claudio Ostuni, Paolo Tomeo, and Eugenio Di Sciascio
-
[20]
Sprank: Semantic path-based ranking for top-n recommendations using linked open data.ACM Transactions on Intelligent Systems and Technology (TIST) 8, 1 (2016), 1–34
2016
-
[21]
Yuhan Quan, Jingtao Ding, Chen Gao, Lingling Yi, Depeng Jin, and Yong Li. 2023. Robust preference-guided denoising for graph based social recommendation. In Proceedings of the ACM web conference 2023. 1097–1108
2023
- [22]
-
[23]
Xubin Ren, Lianghao Xia, Jiashu Zhao, Dawei Yin, and Chao Huang. 2023. Disen- tangled contrastive collaborative filtering. InProceedings of the 46th international ACM SIGIR conference on research and development in information retrieval. 1137– 1146
2023
-
[24]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme
-
[25]
BPR: Bayesian personalized ranking from implicit feedback.arXiv preprint arXiv:1205.2618(2012)
work page internal anchor Pith review Pith/arXiv arXiv 2012
-
[26]
Usha Ruby, Vamsidhar Yendapalli, et al. 2020. Binary cross entropy with deep learning technique for image classification.Int. J. Adv. Trends Comput. Sci. Eng9, 10 (2020)
2020
-
[27]
Youchen Sun, Zhu Sun, Yingpeng Du, Jie Zhang, and Yew Soon Ong. 2025. Model- agnostic social network refinement with diffusion models for robust social rec- ommendation. InProceedings of the ACM on Web Conference 2025. 370–378
2025
-
[28]
Yatong Sun, Bin Wang, Zhu Sun, and Xiaochun Yang. 2021. Does Every Data In- stance Matter? Enhancing Sequential Recommendation by Eliminating Unreliable Data.. InIJCAI. 1579–1585
2021
-
[29]
Gu Tang, Xiaoying Gan, Jinghe Wang, Bin Lu, Lyuwen Wu, Luoyi Fu, and Chenghu Zhou. 2024. EditKG: Editing knowledge graph for recommendation. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 112–122
2024
-
[30]
Lei Wang, Jingsen Zhang, Hao Yang, Zhi-Yuan Chen, Jiakai Tang, Zeyu Zhang, Xu Chen, Yankai Lin, Hao Sun, Ruihua Song, et al. 2025. User behavior simulation with large language model-based agents.ACM Transactions on Information Systems43, 2 (2025), 1–37
2025
-
[31]
Shuyao Wang, Zhi Zheng, Yongduo Sui, and Hui Xiong. 2025. Unleashing the Power of Large Language Model for Denoising Recommendation. InProceedings of the ACM on Web Conference 2025. 252–263
2025
-
[32]
Wenjie Wang, Fuli Feng, Xiangnan He, Liqiang Nie, and Tat-Seng Chua. 2021. Denoising implicit feedback for recommendation. InProceedings of the 14th ACM international conference on web search and data mining. 373–381
2021
-
[33]
Wenhui Wang, Furu Wei, Li Dong, Hangbo Bao, Nan Yang, and Ming Zhou
-
[34]
Minilm: Deep self-attention distillation for task-agnostic compression of pre-trained transformers.Advances in neural information processing systems33 (2020), 5776–5788
2020
-
[35]
Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Yanbin Lu, Xiaojiang Huang, and Yingzhen Yang. 2024. Recmind: Large language model powered agent for recommendation. InFindings of the Association for Computational Linguistics: NAACL 2024. 4351–4364
2024
-
[36]
Yu Wang, Xin Xin, Zaiqiao Meng, Joemon M Jose, Fuli Feng, and Xiangnan He. 2022. Learning robust recommenders through cross-model agreement. In Proceedings of the ACM web conference 2022. 2015–2025
2022
-
[37]
Zongwei Wang, Min Gao, Wentao Li, Junliang Yu, Linxin Guo, and Hongzhi Yin
-
[38]
InProceedings of the 29th ACM SIGKDD conference on knowledge discovery and data mining
Efficient bi-level optimization for recommendation denoising. InProceedings of the 29th ACM SIGKDD conference on knowledge discovery and data mining. 2502–2511
- [39]
-
[40]
Zhefan Wang, Yuanqing Yu, Wendi Zheng, Weizhi Ma, and Min Zhang. 2024. Macrec: A multi-agent collaboration framework for recommendation. InProceed- ings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2760–2764
2024
-
[41]
Xin Xin, Xiangyuan Liu, Hanbing Wang, Pengjie Ren, Zhumin Chen, Jiahuan Lei, Xinlei Shi, Hengliang Luo, Joemon M Jose, Maarten de Rijke, et al . 2023. Improving implicit feedback-based recommendation through multi-behavior alignment. InProceedings of the 46th international ACM SIGIR conference on research and development in information retrieval. 932–941
2023
-
[42]
An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, and Tat-Seng Chua. 2024. On generative agents in recommendation. InProceedings of the 47th international ANCHOR: Agentic Noise Creation Framework for Human Simulation and Denoising Recommendation Conference’17, July 2017, Washington, DC, USA ACM SIGIR conference on research and development in Information Ret...
2024
-
[43]
Shuai Zhang, Hua Chu, Jianan Li, Yangtao Zhou, Shirong Wang, and Qiaofei Sun. 2025. DeMBR: Denoising Model with Memory Pruning and Semantic Guidance for Multi-Behavior Recommendation. InProceedings of the Eighteenth ACM International Conference on Web Search and Data Mining. 521–529
2025
-
[44]
Yu Zhang, Shutong Qiao, Jiaqi Zhang, Tzu-Heng Lin, Chen Gao, and Yong Li
- [45]
-
[46]
Qian Zhao, Shuo Chang, F Maxwell Harper, and Joseph A Konstan. 2016. Gaze prediction for recommender systems. InProceedings of the 10th ACM Conference on Recommender Systems. 131–138
2016
-
[47]
near-miss
Xinjun Zhu, Yuntao Du, Yuren Mao, Lu Chen, Yujia Hu, and Yunjun Gao. 2023. Knowledge-refined denoising network for robust recommendation. InProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 362–371. A Appendix A.1 User Profiling Prompt An example prompt used to initialize the agent with a deep...
2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.