Value-Aware Product Recommendation by Customer Segmentation using a suitable High-Dimensional Similarity Measure
Pith reviewed 2026-05-07 15:01 UTC · model grok-4.3
The pith
Encoding revenue into user-item data enables customer segmentation by purchase value for profitability-aligned recommendations
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By encoding revenue contributions directly into the user-item matrix and applying a high-dimensional similarity measure, the approach segments customers according to the revenue similarity of their purchase baskets. This segmentation supports recommendation strategies based on revenue share, product popularity within segments, and expected profit generation, offering an alternative to conventional similarity metrics that ignore value differences.
What carries the argument
The revenue-augmented user-item matrix with a tailored high-dimensional similarity measure, which computes customer likeness based on shared revenue contributions from products rather than purchase counts alone.
Load-bearing premise
That including revenue amounts in the similarity calculation will produce segments and recommendations that actually increase profitability more than standard methods do.
What would settle it
A controlled test on the UCI dataset where profit from the new recommendations is not higher than from traditional collaborative filtering baselines.
Figures
read the original abstract
This paper presents a novel value-aware approach to product recommendation that simultaneously addresses the high dimensionality and sparsity of user-item data while explicitly incorporating the contribution of each product and user to overall sales revenue. The proposed framework encodes revenue contributions in the user-item matrix and computes customer similarity directly on this basis using suitable distance measures. This enables the segmentation of users according to the revenue-based similarity of their purchase baskets and supports recommendations aligned with profitability objectives. We compare conventional similarity metrics with a novel alternative tailored to high-dimensional contexts and propose three recommendation strategies based on revenue share, product popularity, and expected profit generation. The effectiveness of the proposed method is validated through simulation experiments and a real-world application using the UCI Online Retail dataset.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a value-aware product recommendation framework that encodes revenue contributions directly into the user-item matrix, applies a high-dimensional similarity measure for customer segmentation based on revenue-weighted purchase baskets, and introduces three recommendation strategies (revenue share, product popularity, and expected profit). It claims this approach addresses sparsity and high dimensionality while aligning recommendations with profitability objectives, and validates the method via simulation experiments plus the UCI Online Retail dataset, comparing against conventional similarity metrics.
Significance. If the central claim holds and the revenue-encoded similarity demonstrably produces recommendations with higher realized profit than standard baselines, the work would offer a practical advance in business-oriented recommender systems by shifting evaluation from proxy accuracy metrics to direct value alignment. The idea of revenue-weighted similarity is conceptually simple and extensible, but its significance depends on closing the evaluation gap noted below.
major comments (1)
- Validation section (simulation and UCI Online Retail experiments): the paper reports clustering quality and recommendation performance using standard metrics but does not include a direct profitability comparison (e.g., total revenue or profit generated by the top-k recommended items under the three strategies versus conventional cosine or Jaccard on binary matrices). This is load-bearing for the claim that the method 'supports recommendations aligned with profitability objectives,' as the causal step from revenue encoding to improved business outcomes remains untested.
minor comments (2)
- Abstract: provides only a high-level description with no equations, performance numbers, error bars, or baseline results, which hinders immediate assessment of the 'suitable high-dimensional similarity measure' and the three strategies.
- The manuscript does not specify the exact form of the novel high-dimensional similarity measure (e.g., no equation showing how revenue is incorporated into the distance computation), making it difficult to reproduce or compare against existing weighted metrics.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed feedback. We agree that strengthening the direct link between our revenue-encoded approach and realized profitability outcomes will improve the manuscript, and we outline specific revisions below.
read point-by-point responses
-
Referee: Validation section (simulation and UCI Online Retail experiments): the paper reports clustering quality and recommendation performance using standard metrics but does not include a direct profitability comparison (e.g., total revenue or profit generated by the top-k recommended items under the three strategies versus conventional cosine or Jaccard on binary matrices). This is load-bearing for the claim that the method 'supports recommendations aligned with profitability objectives,' as the causal step from revenue encoding to improved business outcomes remains untested.
Authors: We acknowledge that the current validation focuses on clustering quality (e.g., silhouette scores) and standard recommendation metrics (precision, recall) when comparing the revenue-weighted similarity measure against conventional cosine and Jaccard on binary matrices. While the simulation experiments illustrate how revenue encoding affects segmentation and the UCI Online Retail results demonstrate practical applicability, we agree these do not directly quantify the profit or revenue generated by the top-k recommendations under the three proposed strategies. In the revised manuscript we will add a dedicated profitability evaluation subsection. This will compute and report the total revenue (or profit) realized from the top-k items recommended by each of our three strategies versus the same strategies applied with standard cosine/Jaccard on binary data, using both the simulated datasets and the UCI Online Retail transactions. These new results will be presented alongside the existing metrics to close the evaluation gap. revision: yes
Circularity Check
No circularity: framework is a methodological proposal with empirical validation, not a self-referential derivation
full rationale
The abstract and summary describe encoding revenue into the user-item matrix, applying high-dimensional similarity measures for segmentation, and proposing three recommendation strategies (revenue share, popularity, expected profit). These are presented as novel but straightforward extensions of existing techniques, validated on simulation and the UCI Online Retail dataset. No equations, derivations, fitted parameters renamed as predictions, or self-citation chains appear that would reduce any claimed result to its inputs by construction. The profitability-alignment claim is an empirical hypothesis tested via experiments rather than a definitional or fitted tautology. This is a standard non-circular applied paper.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Abdollahpouri, H., Adomavicius, G., Burke, R., Guy, I., Jannach, D., Kamishima, T., Krasnodebski, J., and Pizzato, L. (2020). Multistakeholder recommendation: Survey and research directions. User Modeling and User-Adapted Interaction , 30(1):127--158
work page 2020
-
[2]
Aggarwal, C. C. et al. (2016). Recommender systems , volume 1. Springer
work page 2016
-
[3]
Alamdari, P. M., Navimipour, N. J., Hosseinzadeh, M., Safaei, A. A., and Darwesh, A. (2020). A systematic study on the recommender systems in the e-commerce. Ieee Access , 8:115694--115716
work page 2020
-
[4]
Alves Gomes, M. and Meisen, T. (2023). A review on customer segmentation methods for personalized customer targeting in e-commerce use cases. Information Systems and e-Business Management , 21(3):527--570
work page 2023
-
[5]
Bae, H. J., Shin, H., Kim, W., Long, S., Blumestein, G., Chang, M., Lewin-Eytan, Y., Huang, L., and Yom-Tov, E. (2025). Ranking items by the current-preferences and profits: A list-wise learning-to-rank approach to profit maximization. Proceedings of the ACM on Web Conference 2025 , pages 5010--5021
work page 2025
-
[6]
Bag, S., Kumar, S. K., and Tiwari, M. K. (2019). An efficient recommendation generation using relevant jaccard similarity. Information Sciences , 483:53--64
work page 2019
-
[7]
Bansal, M. A., Sharma, D. R., and Kathuria, D. M. (2022). A systematic review on data scarcity problem in deep learning: solution and applications. ACM Computing Surveys (Csur) , 54(10s):1--29
work page 2022
-
[8]
Beregovskaya, I. and Koroteev, M. (2021). Review of clustering-based recommender systems. arXiv preprint arXiv:2109.12839
-
[9]
Cai, Y. and Zhu, D. (2019). Trustworthy and profit: A new value-based neighbor selection method in recommender systems under shilling attacks. Decision Support Systems , 124:113112
work page 2019
-
[10]
Chen, D. (2012). Online Retail II . UCI Machine Learning Repository. DOI : https://doi.org/10.24432/C5CG6D
-
[11]
Chen, L.-S., Hsu, F.-H., Chen, M.-C., and Hsu, Y.-C. (2008). Developing recommender systems with the consideration of product profitability for sellers. Information Sciences , 178(4):1032--1048
work page 2008
-
[12]
Chen, Z., Gan, W., Wu, J., Hu, K., and Lin, H. (2025). Data scarcity in recommendation systems: A survey. ACM Transactions on Recommender Systems , 3(3):1--31
work page 2025
-
[13]
J., Umamakeswari, A., Priyatharsini, L., and Neyaa, A
Christy, A. J., Umamakeswari, A., Priyatharsini, L., and Neyaa, A. (2021). Rfm ranking – an effective approach to customer segmentation. Journal of King Saud University - Computer and Information Sciences , 33(10):1251--1257
work page 2021
-
[14]
Concha-Carrasco, J. A., Vega-Rodr \' guez, M. A., and P \'e rez, C. J. (2023). A multi-objective artificial bee colony approach for profit-aware recommender systems. Information Sciences , 625:476--488
work page 2023
-
[15]
De Biasio, A., Jannach, D., and Navarin, N. (2024a). Model-based approaches to profit-aware recommendation. Expert Systems with Applications , 249:123642
-
[16]
De Biasio, A., Montagna, A., Aiolli, F., and Navarin, N. (2023). A systematic review of value-aware recommender systems. Expert Systems with Applications , 226:120131
work page 2023
-
[17]
De Biasio, A., Navarin, N., and Jannach, D. (2024b). Economic recommender systems--a systematic review. Electronic Commerce Research and Applications , 63:101352
-
[18]
Fayyaz, Z., Ebrahimian, M., Nawara, D., Ibrahim, A., and Kashef, R. (2020). Recommendation systems: Algorithms, challenges, metrics, and business opportunities. applied sciences , 10(21):7748
work page 2020
-
[19]
Felfernig, A., Wundara, M., Tran, T. N. T., Polat-Erdeniz, S., Lubos, S., El Mansi, M., Garber, D., and Le, V.-M. (2023). Recommender systems for sustainability: overview and research issues. Frontiers in big Data , 6:1284511
work page 2023
-
[20]
Fkih, F. (2022). Similarity measures for collaborative filtering-based recommender systems: Review and experimental comparison. Journal of King Saud University-Computer and Information Sciences , 34(9):7645--7669
work page 2022
-
[21]
Garcin, F., Faltings, B., Donatsch, O., Alazzawi, A., Bruttin, C., and Huber, A. (2014). Offline and online evaluation of news recommender systems at swissinfo. ch. In Proceedings of the 8th ACM Conference on Recommender systems , pages 169--176
work page 2014
-
[22]
Hassanieh, L. A., Jaoudeh, C. A., Abdo, J. B., and Demerjian, J. (2018). Similarity measures for collaborative filtering recommender systems. In 2018 IEEE Middle East and North Africa Communications Conference (MENACOMM) , pages 1--5
work page 2018
-
[23]
Hossain, A. S. (2017). Customer segmentation using centroid based and density based clustering algorithms. In 2017 3rd International Conference on Electrical Information and Communication Technology (EICT) , pages 1--6. IEEE
work page 2017
-
[24]
Jaccard, P. (1901). Étude comparative de la distribution florale dans une portion des alpes et du jura. Bulletin de la Société Vaudoise des Sciences Naturelles , 37:547--579
work page 1901
-
[25]
Jannach, D. and Adomavicius, G. (2017). Price and profit awareness in recommender systems. In Proceedings of the Workshop on Value-Aware and Multistakeholder Recommendation (VAMS)
work page 2017
-
[26]
Johnson, R. A. and Wichern, D. W. (2007). Applied Multivariate Statistical Analysis . Pearson Prentice Hall, 6th edition
work page 2007
-
[27]
Järvelin, K. and Kekäläinen, J. (2002). Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems , 20(4):422--446
work page 2002
-
[28]
Kaufman, L. and Rousseeuw, P. J. (1987). Clustering by means of medoids. In Dodge, Y., editor, Statistical Data Analysis Based on the L1 Norm and Related Methods , pages 405--416
work page 1987
-
[29]
Kaufman, L. and Rousseeuw, P. J. (1990). Finding Groups in Data: An Introduction to Cluster Analysis . John Wiley & Sons
work page 1990
-
[30]
Ko, H., Lee, S., Park, Y., and Choi, A. (2022). A survey of recommendation systems: recommendation models, techniques, and application fields. Electronics , 11(1):141
work page 2022
-
[31]
Kompan, M., Gaspar, P., Macina, J., Cimerman, M., and Bielikova, M. (2021). Exploring customer price preference and product profit role in recommender systems. IEEE Intelligent Systems , 37(1):89--98
work page 2021
- [32]
-
[33]
D., Raghavan, P., and Sch \"u tze, H
Manning, C. D., Raghavan, P., and Sch \"u tze, H. (2008a). Introduction to Information Retrieval . Cambridge University Press, Cambridge, UK
-
[34]
D., Raghavan, P., and Schütze, H
Manning, C. D., Raghavan, P., and Schütze, H. (2008b). Introduction to Information Retrieval . Cambridge University Press
-
[35]
Modarres, R. (2022). A high dimensional dissimilarity measure. Computational Statistics & Data Analysis , 175:107560
work page 2022
-
[36]
Nemati, Y. and Khademolhosseini, H. (2020). Devising a profit-aware recommender system using multi-objective ga. Journal of Advances in Computer Research , 4(3):109
work page 2020
-
[37]
N., Ning, X., Desrosiers, C., and Karypis, G
Nikolakopoulos, A. N., Ning, X., Desrosiers, C., and Karypis, G. (2021). Trust your neighbors: A comprehensive survey of neighborhood-based methods for recommender systems. Recommender systems handbook , pages 39--89
work page 2021
-
[38]
Panniello, U., Hill, S., and Gorgoglione, M. (2016). The impact of profit incentives on the relevance of online recommendations. Electronic Commerce Research and Applications , 20:87--104
work page 2016
- [39]
-
[40]
Sarkar, S. and Ghosh, A. K. (2020). On perfect clustering of high dimension, low sample size data. IEEE Transactions on Pattern Analysis and Machine Intelligence , 42(11):2643--2656
work page 2020
-
[41]
Shao, B., Li, X., and Bian, G. (2021). A survey of research hotspots and frontier trends of recommendation systems from the perspective of knowledge graph. Expert Systems with Applications , 165:113764
work page 2021
-
[42]
Su, X. and Khoshgoftaar, T. M. (2009). A survey of collaborative filtering techniques. Advances in artificial intelligence , 2009(1):421425
work page 2009
-
[43]
Tan, P.-N., Steinbach, M., and Kumar, V. (2018). Introduction to Data Mining . Pearson, 2nd edition
work page 2018
-
[44]
Verma, V. and Aggarwal, R. K. (2020). A comparative analysis of similarity measures akin to the jaccard index in collaborative recommendations: empirical and theoretical perspective. Social Network Analysis and Mining , 10(1):43
work page 2020
-
[45]
Xia, Z., Sun, A., Xu, J., Peng, Y., Ma, R., and Cheng, M. (2024). Contemporary recommendation systems on big data and their applications: A survey. IEEE Access
work page 2024
-
[46]
Xiaojun, L. (2017). An improved clustering-based collaborative filtering recommendation algorithm. Cluster computing , 20(2):1281--1288
work page 2017
-
[47]
Y ld z, E., G \"u ng \"o r S en, C., and I s k, E. E. (2023). A hyper-personalized product recommendation system focused on customer segmentation: An application in the fashion retail industry. Journal of Theoretical and Applied Electronic Commerce Research , 18(1):571--596
work page 2023
-
[48]
Yu, J., Yin, H., Xia, X., Chen, T., Li, J., and Huang, Z. (2023). Self-supervised learning for recommender systems: A survey. IEEE Transactions on Knowledge and Data Engineering , 36(1):335--355
work page 2023
-
[49]
Zhao, L., Pan, S., Xiang, E., Zhong, E., Lu, Z., and Yang, Q. (2013). Active transfer learning for cross-system recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence , volume 27, pages 1205--1211
work page 2013
-
[50]
Zhu, J., Zhang, J., He, L., Wu, Q., Zhou, B., Zhang, C., and Yu, P. S. (2017). Broad learning based multi-source collaborative recommendation. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management , pages 1409--1418
work page 2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.