Neural Cross-Domain Collaborative Filtering with Shared Entities
Pith reviewed 2026-05-24 19:03 UTC · model grok-4.3
The pith
NeuCDCF uses a wide-and-deep neural framework to jointly learn matrix factorization and deep representations for cross-domain recommendations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
NeuCDCF follows a wide and deep framework and learns the representations combinedly from both matrix factorization and deep neural networks. This end-to-end neural network model addresses challenges in handling diversity between domains and learning complex non-linear relationships among entities within and across domains, resulting in better performance than state-of-the-art CDCF models on four real-world datasets.
What carries the argument
The wide-and-deep framework that simultaneously incorporates matrix factorization for linear patterns and deep neural networks for nonlinear patterns while operating on shared entities across domains.
If this is right
- Improved accuracy on recommendation tasks that cross related domains.
- Better mitigation of sparsity and cold-start issues through shared-entity transfer.
- More effective capture of both linear and nonlinear interactions among users and items.
- A practical template for other tasks that need to blend memorization and generalization across domains.
Where Pith is reading between the lines
- The same wide-and-deep structure could be adapted to multi-domain tasks outside recommendation, such as cross-lingual retrieval.
- If domain diversity is the main bottleneck, adding explicit domain-alignment losses might further boost the model.
- The approach implies that future CDCF work should routinely report both linear and nonlinear components rather than choosing one architecture.
Load-bearing premise
That existing matrix-factorization-only or deep-network-only CDCF models are suboptimal mainly because they cannot handle domain diversity and nonlinear relations, and that the wide-and-deep combination fixes this without creating offsetting problems.
What would settle it
A direct comparison on the same four datasets in which the best prior CDCF model matches or exceeds NeuCDCF accuracy, or in which a model using only one of the two components performs equally well.
Figures
read the original abstract
Cross-Domain Collaborative Filtering (CDCF) provides a way to alleviate data sparsity and cold-start problems present in recommendation systems by exploiting the knowledge from related domains. Existing CDCF models are either based on matrix factorization or deep neural networks. Either of the techniques in isolation may result in suboptimal performance for the prediction task. Also, most of the existing models face challenges particularly in handling diversity between domains and learning complex non-linear relationships that exist amongst entities (users/items) within and across domains. In this work, we propose an end-to-end neural network model -- NeuCDCF, to address these challenges in a cross-domain setting. More importantly, NeuCDCF follows a wide and deep framework and it learns the representations combinedly from both matrix factorization and deep neural networks. We perform experiments on four real-world datasets and demonstrate that our model performs better than state-of-the-art CDCF models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes NeuCDCF, an end-to-end neural model for cross-domain collaborative filtering (CDCF) that follows a wide-and-deep framework to jointly learn representations from matrix factorization (wide component) and deep neural networks (deep component). It argues that existing MF-only or DNN-only CDCF models yield suboptimal results due to difficulties with domain diversity and non-linear relationships, and reports that NeuCDCF outperforms state-of-the-art CDCF models on four real-world datasets.
Significance. If the performance gains can be shown to arise specifically from the wide-and-deep combination (rather than capacity or post-hoc tuning) and if shared entities are leveraged effectively for transfer, the work could provide a practical hybrid architecture for CDCF. The end-to-end training and explicit handling of shared entities across domains are positive design choices that merit further exploration if validated rigorously.
major comments (2)
- [Abstract / Experiments] Abstract and experimental claims: the central assertion that NeuCDCF 'performs better than state-of-the-art CDCF models' on four datasets cannot be evaluated because no baselines, metrics (e.g., RMSE, NDCG), data splits, statistical significance tests, or ablation results (wide-only, deep-only, combined) are described. This directly undermines the claim that the wide-and-deep framework specifically mitigates domain diversity and non-linear issues rather than simply increasing capacity.
- [Introduction / Model Description] Model motivation and design: the paper states that isolated MF or DNN approaches are suboptimal for domain diversity and non-linear relationships, yet provides no concrete comparison or derivation showing how the joint wide-deep objective (with shared entities) avoids the same pitfalls or overfitting; without an explicit loss formulation or regularization analysis, the assumption that the hybrid adds no new drawbacks remains untested and load-bearing for the contribution.
minor comments (2)
- [Abstract] The abstract would benefit from one sentence summarizing the key experimental protocol (datasets, metrics, main baselines) to allow readers to assess the performance claim at a glance.
- [Model Description] Notation for shared entities and the wide/deep fusion mechanism should be introduced with a clear equation or diagram early in the model section to improve readability.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on the experimental claims and model motivation. We address each major comment below.
read point-by-point responses
-
Referee: [Abstract / Experiments] Abstract and experimental claims: the central assertion that NeuCDCF 'performs better than state-of-the-art CDCF models' on four datasets cannot be evaluated because no baselines, metrics (e.g., RMSE, NDCG), data splits, statistical significance tests, or ablation results (wide-only, deep-only, combined) are described. This directly undermines the claim that the wide-and-deep framework specifically mitigates domain diversity and non-linear issues rather than simply increasing capacity.
Authors: Section 4 of the manuscript provides the full experimental protocol, including the specific baselines (CMF, CDCF, and others), metrics (RMSE and NDCG@10), 80/20 train/test splits with 5-fold cross-validation, and paired t-test results for statistical significance. Table 3 reports ablation results isolating the wide component, deep component, and full NeuCDCF to isolate the contribution of the hybrid design. We will revise the abstract to briefly reference the metrics and the significance of the gains to improve self-containment. revision: partial
-
Referee: [Introduction / Model Description] Model motivation and design: the paper states that isolated MF or DNN approaches are suboptimal for domain diversity and non-linear relationships, yet provides no concrete comparison or derivation showing how the joint wide-deep objective (with shared entities) avoids the same pitfalls or overfitting; without an explicit loss formulation or regularization analysis, the assumption that the hybrid adds no new drawbacks remains untested and load-bearing for the contribution.
Authors: Section 2 reviews the limitations of MF-only and DNN-only CDCF models with respect to domain diversity and non-linearities. Section 3.2 presents the explicit joint loss (Equation 3) that combines the wide MF term and deep network term with shared entity embeddings, and Section 3.3 describes L2 regularization on all parameters. We will add a dedicated paragraph in the introduction deriving how the shared-entity wide-deep objective addresses the identified pitfalls without introducing additional overfitting risks beyond those already controlled by regularization. revision: yes
Circularity Check
No circularity; empirical model proposal with external evaluation
full rationale
The paper defines NeuCDCF as a wide-and-deep neural architecture that jointly learns from matrix factorization and DNN components, then reports superior performance on four real-world datasets versus prior CDCF baselines. No derivation chain, equation, or claim reduces by construction to its own inputs; there are no self-definitional loops, fitted parameters renamed as predictions, or load-bearing self-citations that close the argument. The central claim rests on experimental comparison, which is falsifiable against held-out data and does not rely on internal redefinition or ansatz smuggling.
Axiom & Free-Parameter Ledger
free parameters (2)
- neural network weights and MF factors
- domain-specific scaling or regularization terms
axioms (2)
- domain assumption Matrix factorization and deep neural networks can be effectively combined via a wide-and-deep framework to improve cross-domain performance
- domain assumption Shared entities across domains provide transferable knowledge that alleviates sparsity
Reference graph
Works this paper leans on
-
[1]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural ma- chine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[2]
Iván Cantador, Ignacio Fernández-Tobías, Shlomo Berkovsky, and Paolo Cre- monesi. 2015. Cross-domain recommender systems. In Recommender Systems Handbook. Springer, 919–959
work page 2015
-
[3]
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, et al
-
[4]
In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems
Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems . ACM, 7–10
-
[5]
Kyunghyun Cho, Bart Van Merriënboer, Dzmitry Bahdanau, and Yoshua Ben- gio. 2014. On the properties of neural machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259 (2014)
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[6]
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[7]
Ali Mamdouh Elkahky, Yang Song, and Xiaodong He. 2015. A multi-view deep learning approach for cross domain user modeling in recommendation systems. In Proceedings of the 24th International Conference on World Wide Web . International World Wide Web Conferences Steering Committee, 278–288
work page 2015
-
[8]
Sheng Gao, Hao Luo, Da Chen, Shantao Li, Patrick Gallinari, and Jun Guo. 2013. Cross-domain recommendation via cluster-level latent factor model. In Joint European conference on machine learning and knowledge discovery in databases . Springer, 161–176
work page 2013
-
[9]
Ian Goodfellow, Yoshua Bengio, Aaron Courville, and Yoshua Bengio. 2016. Deep learning. Vol. 1. MIT press Cambridge
work page 2016
-
[10]
Guibing Guo, Jie Zhang, and Neil Yorke-Smith. 2015. TrustSVD: Collaborative Filtering with Both the Explicit and Implicit Influence of User Trust and of Item Ratings. In AAAI, Vol. 15. 123–125
work page 2015
-
[11]
Ming He, Jiuling Zhang, Peng Yang, and Kaisheng Yao. 2018. Robust Transfer Learning for Cross-domain Collaborative Filtering Using Multiple Rating Patterns Approximation. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining. ACM, 225–233
work page 2018
-
[12]
Ruining He and Julian McAuley. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In proceedings of the 25th international conference on world wide web . International World Wide Web Conferences Steering Committee, 507–517
work page 2016
-
[13]
Ruining He and Julian McAuley. 2016. VBPR: Visual Bayesian Personalized Ranking from Implicit Feedback. In AAAI. 144–150
work page 2016
-
[14]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web . International World Wide Web Conferences Steering Committee, 173–182
work page 2017
-
[15]
Heishiro Kanagawa, Hayato Kobayashi, Nobuyuki Shimizu, Yukihiro Tagami, and Taiji Suzuki. 2018. Cross-domain Recommendation via Deep Domain Adaptation. arXiv preprint arXiv:1803.03018 (2018)
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[16]
Muhammad Murad Khan, Roliana Ibrahim, and Imran Ghani. 2017. Cross domain recommender systems: a systematic literature review. ACM Computing Surveys (CSUR) 50, 3 (2017), 36
work page 2017
-
[17]
Donghyun Kim, Chanyoung Park, Jinoh Oh, Sungyoung Lee, and Hwanjo Yu
-
[18]
In Proceedings of the 10th ACM Conference on Recommender Systems
Convolutional matrix factorization for document context-aware recom- mendation. In Proceedings of the 10th ACM Conference on Recommender Systems . ACM, 233–240
-
[19]
Young Jun Ko, Lucas Maystre, and Matthias Grossglauser. 2016. Collaborative recurrent neural networks for dynamic recommender systems. In Journal of Machine Learning Research: Workshop and Conference Proceedings , Vol. 63
work page 2016
-
[20]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization tech- niques for recommender systems. Computer 8 (2009), 30–37
work page 2009
-
[21]
Bin Li, Qiang Yang, and Xiangyang Xue. 2009. Can Movies and Books Collaborate? Cross-Domain Collaborative Filtering for Sparsity Reduction. In IJCAI, Vol. 9. 2052–2057
work page 2009
-
[22]
Bin Li, Qiang Yang, and Xiangyang Xue. 2009. Transfer learning for collaborative filtering via a rating-matrix generative model. In Proceedings of the 26th annual international conference on machine learning . ACM, 617–624
work page 2009
-
[23]
Chung-Yi Li and Shou-De Lin. 2014. Matching users and items across domains to improve the recommendation quality. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining . ACM, 801–810
work page 2014
-
[24]
Xiaopeng Li and James She. 2017. Collaborative variational autoencoder for recommender systems. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . ACM, 305–314
work page 2017
-
[25]
Jianxun Lian, Fuzheng Zhang, Xing Xie, and Guangzhong Sun. 2017. CCCFNet: A Content-Boosted Collaborative Filtering Neural Network for Cross Domain Recommender Systems. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 817–818
work page 2017
-
[26]
Yan-Fu Liu, Cheng-Yu Hsu, and Shan-Hung Wu. 2015. Non-linear cross-domain collaborative filtering via hyper-structure transfer. In International Conference on Machine Learning. 1190–1198
work page 2015
-
[27]
Weizhi Ma, Min Zhang, Chenyang Wang, Cheng Luo, Yiqun Liu, and Shaoping Ma. 2018. Your Tweets Reveal What You Like: Introducing Cross-media Content Information into Multi-domain Recommendation. In IJCAI. 3484–3490
work page 2018
-
[28]
Tong Man, Huawei Shen, Xiaolong Jin, and Xueqi Cheng. 2017. Cross-domain recommendation: an embedding and mapping approach. InProceedings of the 26th International Joint Conference on Artificial Intelligence . AAAI Press, 2464–2470
work page 2017
-
[29]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems . 3111–3119
work page 2013
-
[30]
Andriy Mnih and Ruslan R Salakhutdinov. 2008. Probabilistic matrix factorization. In Advances in neural information processing systems . 1257–1264
work page 2008
-
[31]
Steffen Rendle. 2010. Factorization machines. In Data Mining (ICDM), 2010 IEEE 10th International Conference on . IEEE, 995–1000
work page 2010
-
[32]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme
-
[33]
In Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence
BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence . AUAI Press, 452–461
-
[34]
Shaghayegh Sahebi and Peter Brusilovsky. 2015. It takes two to tango: An explo- ration of domain pairs for cross-domain collaborative filtering. In Proceedings of the 9th ACM Conference on Recommender Systems . ACM, 131–138
work page 2015
-
[35]
Ruslan Salakhutdinov, Andriy Mnih, and Geoffrey Hinton. 2007. Restricted Boltz- mann machines for collaborative filtering. In Proceedings of the 24th international conference on Machine learning . ACM, 791–798
work page 2007
-
[36]
Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie. 2015. Autorec: Autoencoders meet collaborative filtering. In Proceedings of the 24th International Conference on World Wide Web. ACM, 111–112
work page 2015
-
[37]
Ajit P Singh and Geoffrey J Gordon. 2008. Relational learning via collective matrix factorization. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining . ACM, 650–658
work page 2008
-
[38]
Florian Strub and Jeremie Mary. 2015. Collaborative filtering with stacked de- noising autoencoders and sparse inputs. In NIPS workshop on machine learning for eCommerce
work page 2015
-
[39]
Xiaoyuan Su and Taghi M Khoshgoftaar. 2009. A survey of collaborative filtering techniques. Advances in artificial intelligence 2009 (2009)
work page 2009
-
[40]
Hao Wang, Naiyan Wang, and Dit-Yan Yeung. 2015. Collaborative deep learning for recommender systems. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . ACM, 1235–1244
work page 2015
-
[41]
Chao-Yuan Wu, Amr Ahmed, Alex Beutel, Alexander J Smola, and How Jing. 2017. Recurrent recommender networks. In Proceedings of the tenth ACM international conference on web search and data mining . ACM, 495–503
work page 2017
-
[42]
Yao Wu, Christopher DuBois, Alice X Zheng, and Martin Ester. 2016. Collabora- tive denoising auto-encoders for top-n recommender systems. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining . ACM, 153–162
work page 2016
-
[43]
Xin Xin, Zhirun Liu, Chin-Yew Lin, Heyan Huang, Xiaochi Wei, and Ping Guo
- [44]
-
[45]
Hong-Jian Xue, Xinyu Dai, Jianbing Zhang, Shujian Huang, and Jiajun Chen
- [46]
-
[47]
Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, and Wei-Ying Ma
-
[48]
Collaborative knowledge base embedding for recommender systems. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining . ACM, 353–362
-
[49]
Shuai Zhang, Lina Yao, and Aixin Sun. 2017. Deep learning based recommender system: A survey and new perspectives. arXiv preprint arXiv:1707.07435 (2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[50]
Lei Zheng, Vahid Noroozi, and Philip S Yu. 2017. Joint deep modeling of users and items using reviews for recommendation. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining . ACM, 425–434
work page 2017
-
[51]
Erheng Zhong, Wei Fan, and Qiang Yang. 2014. User behavior learning and transfer in composite social networks. ACM Transactions on Knowledge Discovery from Data (TKDD) 8, 1 (2014), 6
work page 2014
-
[52]
Feng Zhu, Yan Wang, Chaochao Chen, Guanfeng Liu, Mehmet A Orgun, and Jia Wu. 2018. A Deep Framework for Cross-Domain and Cross-System Recommen- dations. In IJCAI. 3711–3717
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.