Sharpness-Aware Poisoning: Enhancing Transferability of Injective Attacks on Recommender Systems
Pith reviewed 2026-05-08 12:29 UTC · model grok-4.3
The pith
Sharpness-aware minimization finds worst-case models to improve poisoning attack transferability across recommender systems.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By integrating the sharpness-aware minimization principle into the attack process, SharpAP seeks an approximately worst-case victim model and optimizes poisoned data specifically against it. The attack is cast as a min-max-min tri-level optimization problem. Integrating this step into iterative attack generation yields poisoned data less sensitive to model structure shifts and mitigates overfitting to the surrogate.
What carries the argument
Sharpness-Aware Poisoning (SharpAP), which applies sharpness-aware minimization to locate and optimize against an approximately worst-case victim model inside the poisoning loop.
If this is right
- Poisoned data becomes less sensitive to structural differences between the surrogate and actual victim models.
- Attack success improves on real victim models never seen during poison generation.
- The tri-level optimization integrates directly into existing iterative attack procedures.
- Experiments confirm stronger transfer on three real-world datasets compared with prior surrogate-only methods.
Where Pith is reading between the lines
- Similar sharpness-based optimization could be applied to adversarial attacks in other domains where the target model is unknown in advance.
- Recommender system defenders might adopt sharpness-aware training to reduce vulnerability to poisons optimized this way.
- The approach underscores that accounting for model variability during attack design can narrow the gap between black-box and white-box effectiveness.
Load-bearing premise
Sharpness-aware minimization can reliably locate an approximately worst-case victim model whose optimization produces poisoned data that transfers better to real victim models.
What would settle it
If SharpAP-generated poisoned data shows no higher attack success rate than standard surrogate-based poisons when evaluated on victim models with architectures clearly different from the surrogate, such as switching from matrix factorization to a deep neural recommender.
Figures
read the original abstract
Recommender Systems~(RS) have been shown to be vulnerable to injective attacks, where attackers inject limited fake user profiles to promote the exposure of target items to real users for unethical gains (e.g., economic or political advantages). Since attackers typically lack knowledge of the victim model deployed in the target RS, existing methods resort to using a fixed surrogate model to mimic the potential victim model. Despite considerable progress, we argue that the assumption that \textit{poisoned data generated for the surrogate model can be used to attack other victim models} is wishful. When there are significant structural discrepancies between the surrogate and victim models, the attack transferability inevitably suffers. Intuitively, if we can identify the worst-case victim model and iteratively optimize the poisoning effect specifically against it, then the generated poisoned data would be better transferred to other victim models. However, exactly identifying the worst-case victim model during the attack process is challenging due to the large space of victim models. To this end, in this work, we propose a novel attack method called Sharpness-Aware Poisoning (\textit{SharpAP}). Specifically, it employs the sharpness-aware minimization principle to seek the approximately worst-case victim model and optimizes the poisoned data specifically for this worst-case model. The poisoning attack with SharpAP is formulated as a min-max-min tri-level optimization problem. By integrating SharpAP into the iterative process for attacks, our method can generate more robust poisoned data which is less sensitive to the shift of model structure, mitigating the overfitting to the surrogate model. Comprehensive experimental comparisons on three real-world datasets demonstrate that \name~can significantly enhance the attack transferability.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes Sharpness-Aware Poisoning (SharpAP) to improve the transferability of injective poisoning attacks on recommender systems. It formulates the attack generation as a min-max-min tri-level optimization that uses sharpness-aware minimization (SAM) to approximate a worst-case victim model around the surrogate, then optimizes the poisoned data specifically against that approximation. The central claim is that this produces poisoned profiles less sensitive to model-structure shifts (e.g., MF to NeuMF or GCN) than standard surrogate-based attacks, with experiments on three real-world datasets reported to show significant gains in attack success on unseen victims.
Significance. If the empirical gains hold and the mechanism is shown to operate as described, the work would offer a practical way to reduce surrogate overfitting in poisoning attacks, a known weakness when victim architectures differ from the attacker's surrogate. The tri-level formulation and application of SAM to discrete model-family shifts represent a novel technical direction that could influence both attack and defense research in RS security.
major comments (2)
- [§3.2] §3.2 (Method), Eq. (3)–(5): the inner maximization is performed via SAM-style perturbations in the surrogate parameter space, yet the paper provides no analysis showing that high-curvature directions around the surrogate parameters correspond to structural dissimilarity with unseen victim families. Because model architecture is discrete and non-differentiable, it is unclear why the local sharpness approximation yields a representative worst-case victim rather than simply regularizing the poison against the surrogate; this directly affects whether the claimed transferability improvement follows from the stated mechanism.
- [§4] §4 (Experiments): while the abstract states that SharpAP yields significant improvement on three datasets, the reported results must include (i) concrete metrics (HR@K, NDCG@K, attack success rate) with standard deviations across multiple runs, (ii) explicit comparison against both surrogate-only baselines and other transferability-enhancing attacks, and (iii) an ablation isolating the contribution of the SAM inner loop versus plain min-max. Without these, the magnitude and robustness of the transferability gain cannot be assessed.
minor comments (2)
- [§3] Notation: the tri-level objective is introduced without a clear statement of which variables are optimized in each level and which are held fixed; adding an explicit variable legend would improve readability.
- [§2] Related work: the discussion of prior injective attacks should cite the specific transferability limitations they exhibit (e.g., performance drop when victim is GCN vs. MF) rather than generic statements.
Simulated Author's Rebuttal
Thank you for your detailed review and constructive feedback on our manuscript. We have carefully considered each comment and provide point-by-point responses below. We will revise the paper to address the concerns raised, particularly by enhancing the experimental section and adding further discussion on the methodological justification.
read point-by-point responses
-
Referee: [§3.2] §3.2 (Method), Eq. (3)–(5): the inner maximization is performed via SAM-style perturbations in the surrogate parameter space, yet the paper provides no analysis showing that high-curvature directions around the surrogate parameters correspond to structural dissimilarity with unseen victim families. Because model architecture is discrete and non-differentiable, it is unclear why the local sharpness approximation yields a representative worst-case victim rather than simply regularizing the poison against the surrogate; this directly affects whether the claimed transferability improvement follows from the stated mechanism.
Authors: We thank the referee for highlighting this important aspect. Our motivation for using SAM is to find a worst-case perturbation in the surrogate's parameter space that maximizes the attack loss, based on the idea that such directions may correspond to models that are more dissimilar in behavior. Although architectures are discrete, the continuous parameter perturbations can simulate variations in model capacity or fitting that mimic structural differences, as evidenced by our cross-architecture experiments. We agree that a deeper theoretical analysis would be beneficial. In the revised manuscript, we will include additional discussion in Section 3 explaining this intuition with references to related work on sharpness and generalization, and acknowledge the limitations of the approximation. revision: partial
-
Referee: [§4] §4 (Experiments): while the abstract states that SharpAP yields significant improvement on three datasets, the reported results must include (i) concrete metrics (HR@K, NDCG@K, attack success rate) with standard deviations across multiple runs, (ii) explicit comparison against both surrogate-only baselines and other transferability-enhancing attacks, and (iii) an ablation isolating the contribution of the SAM inner loop versus plain min-max. Without these, the magnitude and robustness of the transferability gain cannot be assessed.
Authors: We appreciate this suggestion for improving the clarity and rigor of our experimental evaluation. The current manuscript reports HR@K and NDCG@K as attack success metrics on three datasets, but we will add standard deviations computed over multiple independent runs. We will also ensure explicit comparisons to surrogate-only attacks and include relevant transferability methods from the literature. Furthermore, we will add an ablation study comparing SharpAP to a variant without the SAM inner maximization (i.e., plain min-max optimization). These enhancements will be incorporated into the revised version of the paper. revision: yes
Circularity Check
No circularity: derivation relies on external SAM principle and independent experiments
full rationale
The paper formulates SharpAP as a min-max-min optimization that applies the established sharpness-aware minimization principle (from prior literature) to approximate a worst-case victim model during poisoning. No equation reduces by construction to a fitted parameter from the same data, no self-citation bears the central claim, and the transferability improvement is supported by experiments on three real-world datasets rather than by redefinition. The derivation chain is self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Sharpness-aware minimization can be used to seek the approximately worst-case victim model during the attack process
Reference graph
Works this paper leans on
- [1]
-
[2]
Z. Chen, J. Zhang, Y . Kou, X. Chen, C.-J. Hsieh, and Q. Gu. Why does sharpness-aware minimization generalize better than sgd? InNeurIPS, pages 72325–72376, 2023
work page 2023
- [3]
-
[4]
E. Cho, S. A. Myers, and J. Leskovec. Friendship and mobility: user movement in location-based social networks. InKDD, pages 1082–1090, 2011
work page 2011
-
[5]
W. Fan, T. Derr, X. Zhao, Y . Ma, H. Liu, J. Wang, J. Tang, and Q. Li. Attacking black-box recommendations via copying cross-domain user profiles. InICDE, pages 1583–1594. IEEE, 2021
work page 2021
-
[6]
M. Fang, N. Z. Gong, and J. Liu. Influence function based data poisoning attacks to top-n recommender systems. InWWW, pages 3019–3025, 2020
work page 2020
- [7]
-
[8]
S. Guo, T. Bai, and W. Deng. Targeted shilling attacks on gnn-based recommender systems. InCIKM, pages 649–658, 2023
work page 2023
-
[9]
F. M. Harper and J. A. Konstan. The movielens datasets: History and context.TIIS, 5(4):1–19, 2015
work page 2015
-
[10]
P. He, H. Xu, J. Ren, Y . Cui, S. Zeng, H. Liu, C. Aggarwal, and J. Tang. Sharpness-aware data poisoning attack. InICLR, 2024
work page 2024
-
[11]
X. He, K. Deng, X. Wang, Y . Li, Y . Zhang, and M. Wang. Lightgcn: Simplifying and powering graph convolution network for recommenda- tion. InSIGIR, pages 639–648, 2020
work page 2020
-
[12]
R. Jin, J. Y . Chai, and L. Si. An automatic weighting scheme for collaborative filtering. InSIGIR, pages 337–344, 2004. JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 14
work page 2004
-
[13]
N. S. Keskar, D. Mudigere, J. Nocedal, M. Smelyanskiy, and P. T. P. Tang. On large-batch training for deep learning: Generalization gap and sharp minima. InICLR, 2017
work page 2017
- [14]
-
[15]
J. Kwon, J. Kim, H. Park, and I. K. Choi. Asam: Adaptive sharpness- aware minimization for scale-invariant learning of deep neural networks. InICML, pages 5905–5914. PMLR, 2021
work page 2021
-
[16]
S. K. Lam and J. Riedl. Shilling recommender systems for fun and profit. InProceedings of the 13th international conference on World Wide Web, pages 393–402, 2004
work page 2004
-
[17]
B. Li, Y . Wang, A. Singh, and Y . V orobeychik. Data poisoning attacks on factorization-based collaborative filtering.NeurIPS, 29, 2016
work page 2016
-
[18]
H. Li, S. Di, and L. Chen. Revisiting injective attacks on recommender systems.NeurIPS, 35:29989–30002, 2022
work page 2022
-
[19]
H. Li, Z. Xu, G. Taylor, C. Studer, and T. Goldstein. Visualizing the loss landscape of neural nets.Advances in neural information processing systems, 31, 2018
work page 2018
-
[20]
T. Li, P. Zhou, Z. He, X. Cheng, and X. Huang. Friendly sharpness- aware minimization. InCVPR, pages 5631–5640, 2024
work page 2024
-
[21]
C. Lin, S. Chen, H. Li, Y . Xiao, L. Li, and Q. Yang. Attacking recommender systems with augmented user profiles. InCIKM, pages 855–864, 2020
work page 2020
-
[22]
C. Lin, S. Chen, M. Zeng, S. Zhang, M. Gao, and H. Li. Shilling black- box recommender systems by learning to generate fake user profiles. IEEE TNNLS, 35(1):1305–1319, 2022
work page 2022
-
[23]
J. McAuley, C. Targett, Q. Shi, and A. Van Den Hengel. Image-based recommendations on styles and substitutes. InSIGIR, pages 43–52, 2015
work page 2015
-
[24]
B. Mehta and W. Nejdl. Unsupervised strategies for shilling detection and robust collaborative filtering.User Modeling and User-Adapted Interaction, 19:65–97, 2009
work page 2009
-
[25]
B. Mobasher, R. Burke, R. Bhaumik, and C. Williams. Toward trustworthy recommender systems: An analysis of attack models and algorithm robustness.ACM Transactions on Internet Technology (TOIT), 7(4):23–es, 2007
work page 2007
- [26]
- [27]
-
[28]
J. Tang and K. Wang. Personalized top-n sequential recommendation via convolutional sequence embedding. InWSDM, pages 565–573, 2018
work page 2018
-
[29]
J. Tang, H. Wen, and K. Wang. Revisiting adversarially learned injection attacks against recommender systems. InRecSys, pages 318–327, 2020
work page 2020
-
[30]
C. Wang, H. Zhu, C. Zhu, C. Qin, and H. Xiong. Setrank: A setwise bayesian approach for collaborative ranking from implicit feedback. In AAAI, volume 34, pages 6127–6136, 2020
work page 2020
-
[31]
W. Wang, C. Wang, F. Feng, W. Shi, D. Ding, and T.-S. Chua. Uplift modeling for target user attacks on recommender systems. InWWW, pages 3343–3354, 2024
work page 2024
-
[32]
Z. Wang, M. Gao, J. Li, J. Zhang, and J. Zhong. Gray-box shilling attack: An adversarial learning approach.TIST, 13(5):1–21, 2022
work page 2022
-
[33]
Z. Wang, M. Gao, J. Yu, X. Gao, Q. V . H. Nguyen, S. Sadiq, and H. Yin. Id-free not risk-free: Llm-powered agents unveil risks in id- free recommender systems. InProceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1902–1911, 2025
work page 1902
-
[34]
Z. Wang, M. Gao, J. Yu, S. Sadiq, H. Yin, and L. Liu. When graph contrastive learning backfires: Spectral vulnerability and defense in recommendation.ACM Transactions on Information Systems, 2025
work page 2025
-
[35]
Z. Wang, J. Yu, M. Gao, H. Yin, B. Cui, and S. Sadiq. Unveiling vulnerabilities of contrastive recommender systems to poisoning attacks. InKDD, pages 3311–3322, 2024
work page 2024
- [36]
-
[37]
K. Wen, T. Ma, and Z. Li. How sharpness-aware minimization minimizes sharpness? InICLR, 2023
work page 2023
-
[38]
C. Wu, D. Lian, Y . Ge, Z. Zhu, and E. Chen. Triple adversarial learning for influence based poisoning attack in recommender systems. InKDD, pages 1830–1840, 2021
work page 2021
-
[39]
C. Wu, D. Lian, Y . Ge, Z. Zhu, E. Chen, and S. Yuan. Fight fire with fire: towards robust recommender systems via adversarial poisoning training. InSIGIR, pages 1074–1083, 2021
work page 2021
- [40]
-
[41]
J. Wu, X. Wang, F. Feng, X. He, L. Chen, J. Lian, and X. Xie. Self- supervised graph learning for recommendation. InSIGIR, pages 726– 735, 2021
work page 2021
-
[42]
L. Wu, X. He, X. Wang, K. Zhang, and M. Wang. A survey on accuracy-oriented neural recommendation: From collaborative filtering to information-rich recommendation.IEEE TKDE, 35(5):4425–4445, 2022
work page 2022
-
[43]
L. Wu, P. Sun, R. Hong, Y . Ge, and M. Wang. Collaborative neural social recommendation.IEEE transactions on systems, man, and cybernetics: systems, 51(1):464–476, 2018
work page 2018
-
[44]
G. Yang, N. Z. Gong, and Y . Cai. Fake co-visitation injection attacks to recommender systems. InNDSS, 2017
work page 2017
-
[45]
Y . Yang, Z. Wu, L. Wu, K. Zhang, R. Hong, Z. Zhang, J. Zhou, and M. Wang. Generative-contrastive graph learning for recommendation. InSIGIR, pages 1117–1126, 2023
work page 2023
-
[46]
J. Yu, H. Yin, X. Xia, T. Chen, L. Cui, and Q. V . H. Nguyen. Are graph augmentations necessary? simple graph contrastive learning for recommendation. InSIGIR, pages 1294–1303, 2022
work page 2022
- [47]
- [48]
- [49]
- [50]
-
[51]
Y . Zhao, T. Chen, J. Yu, K. Zheng, L. Cui, and H. Yin. Diversity- aware dual-promotion poisoning attack on sequential recommendation. InSIGIR, pages 1634–1644, 2025
work page 2025
- [52]
-
[53]
G. Zhou, X. Zhu, C. Song, Y . Fan, H. Zhu, X. Ma, Y . Yan, J. Jin, H. Li, and K. Gai. Deep interest network for click-through rate prediction. In KDD, pages 1059–1068, 2018
work page 2018
-
[54]
Y . Zhou, Y . Qu, X. Xu, and H. Shen. Imbsam: A closer look at sharpness-aware minimization in class-imbalanced recognition. In CVPR, pages 11345–11355, 2023. Junsong Xieis currently pursuing a Ph.D. degree at Hefei University of Technology (HFUT), China. He received the master’s degree from the University of Science and Technology of China (USTC). He has...
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.