Block based Singular Value Decomposition approach to matrix factorization for recommender systems

Prasad Bhavana; Vikas Kumar; Vineet Padmanabhan

arxiv: 1907.07410 · v1 · pith:BXUTHSWKnew · submitted 2019-07-17 · 💻 cs.LG · cs.AI· cs.IR· stat.ML

Block based Singular Value Decomposition approach to matrix factorization for recommender systems

Prasad Bhavana , Vikas Kumar , Vineet Padmanabhan This is my paper

Pith reviewed 2026-05-24 20:29 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.IRstat.ML

keywords recommender systemsmatrix factorizationsingular value decompositionblock based processingGPUCUDAscalabilityperformance

0 comments

The pith

Dividing rating matrices into blocks for SVD enables parallel GPU processing for recommender systems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a block-based approach to singular value decomposition for matrix factorization in recommender systems. By splitting the user-item matrix into blocks and applying SVD to each, the method aims to combine the predictive power of SVD with the scalability of block processing. This setup supports multi-threading, GPU acceleration through CUDA, and distributed computation. The goal is to achieve better performance and lower memory usage compared to standard matrix factorization while maintaining recommendation quality.

Core claim

Block based matrix factorization paired with SVD allows taking advantage of SVD over basic matrix factorization while gaining parallelism and scalability through BMF, with CUDA and GPU demonstrating advantages in performance and memory for recommender systems.

What carries the argument

Block based matrix factorization (BMF) paired with SVD, which divides the matrix into blocks for independent SVD computation to enable parallel processing.

If this is right

Scalable recommendations become feasible for larger datasets using multi-threading and multiple computational units.
GPU hardware can be leveraged via CUDA to reduce computation time and memory requirements.
Distributed computation setups can apply the same block-based technique for even larger scale problems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The block approach might extend to other matrix decomposition methods in collaborative filtering.
Affected blocks could be reprocessed independently when new user ratings arrive, supporting incremental updates.

Load-bearing premise

Dividing the matrix into blocks and computing SVD separately on each block will not reduce the accuracy of the resulting recommendations compared to applying SVD to the full matrix.

What would settle it

Measuring the root mean square error or other accuracy metrics on a standard recommender dataset using both the block-based SVD and the full SVD to check if the block method produces comparable or better predictions.

Figures

Figures reproduced from arXiv: 1907.07410 by Prasad Bhavana, Vikas Kumar, Vineet Padmanabhan.

**Figure 2.** Figure 2: An example scenario of parallel BMF is proposed for the approximate Alternative Least Square (ALS) algorithm. The authors propose to use SGD for the optimization. GPU accelerated Non-Negative Matrix Factorization (NNMF) for Compute Unified Device Architecture (CUDA) capable hardware has been proposed in (7). Similarly, in (11), NNMF with GPU acceleration is used for text mining purposes. From the literatu… view at source ↗

**Figure 1.** Figure 1: Example of Block Matrix Factorization a simple example wherein X is divided into 4 blocks and each of these blocks are factorized individually so as to combine together to form U and V that exactly explain X. BMF considers each element exactly once per iteration, with the difference in change of sequence of processing of elements. As MF does not constrain the sequence in which the data elements are proces… view at source ↗

**Figure 3.** Figure 3: PMF vs SVD for MovieLens 1M data set 5.1. Experimental results [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: SVD vs Block based SVD for MovieLens 1M data set [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

**Figure 5.** Figure 5: BGSVD run on MovieLens 1M data set factorize data sets of any scale. The block based approach can be adapted to run block level factorization on multiple GPUs as well as on distributed systems. SVD is also useful for incremental data where all the data is not available initially and the new data may arrive after the model building phase. In such a scenario, the models can be incrementally computed with the… view at source ↗

read the original abstract

With the abundance of data in recent years, interesting challenges are posed in the area of recommender systems. Producing high quality recommendations with scalability and performance is the need of the hour. Singular Value Decomposition(SVD) based recommendation algorithms have been leveraged to produce better results. In this paper, we extend the SVD technique further for scalability and performance in the context of 1) multi-threading 2) multiple computational units (with the use of Graphical Processing Units) and 3) distributed computation. We propose block based matrix factorization (BMF) paired with SVD. This enabled us to take advantage of SVD over basic matrix factorization(MF) while taking advantage of parallelism and scalability through BMF. We used Compute Unified Device Architecture (CUDA) platform and related hardware for leveraging Graphical Processing Unit (GPU) along with block based SVD to demonstrate the advantages in terms of performance and memory.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper sketches a block-SVD plus CUDA setup for scaling recommender MF but supplies no numbers, no accuracy checks, and no error analysis, so the practical claim stays untested.

read the letter

The paper sketches a block-SVD plus CUDA setup for scaling recommender MF but supplies no numbers, no accuracy checks, and no error analysis, so the practical claim stays untested. The authors start from the observation that SVD-based factorization often beats plain MF on quality yet struggles with large matrices on single machines. Their response is to split the rating matrix into blocks, run SVD on each block separately, and use CUDA to exploit GPU parallelism for multi-threaded and distributed cases. That direction addresses a real engineering constraint in production recommenders where memory and wall-clock time matter. The write-up is clear about the intended targets: multi-threading, GPU hardware, and distributed settings. It also correctly notes that the block route keeps the SVD accuracy advantage while adding parallelism that basic MF already enjoys. Those points are straightforward and worth stating. The soft spot is the missing link between the blocked decomposition and the final recommendation quality. Independent SVDs on separate blocks ignore cross-block user-item interactions, yet the abstract offers no perturbation bound, no ablation on block size, and no comparison of RMSE or ranking metrics against a full-matrix SVD baseline. The claim that advantages were demonstrated therefore rests on an unshown assumption that the approximation error stays small enough. Without tables, timing data, or even a description of how block results are combined, it is impossible to judge whether the method improves on existing GPU libraries or simply reproduces known trade-offs. This work would interest practitioners who already maintain CUDA code for recommenders and want a quick sketch of a blocking strategy. For the broader literature it adds little until the empirical gap is closed. I would not send it to peer review in this form; the authors would need to supply the missing experiments and at least a basic error argument before a referee could evaluate the central claim.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes block-based matrix factorization (BMF) combined with SVD for recommender systems. It claims this retains SVD's accuracy advantages over basic MF while adding scalability via blocking for multi-threading, CUDA/GPU acceleration, and distributed computation, with demonstrated gains in performance and memory.

Significance. If the blocked SVD approximation is shown to preserve recommendation quality, the work could enable practical deployment of SVD-based methods on large datasets by exploiting parallel hardware, a relevant contribution to scalable recommender systems.

major comments (2)

[Abstract] Abstract: the assertion that advantages 'were demonstrated' in performance and memory is unsupported by any quantitative results, baselines, error metrics (e.g., RMSE), ablation studies, or experimental setup; the central claim therefore rests on an unevidenced statement.
[None (central claim)] No section provides an error analysis: performing independent SVD on blocks ignores inter-block user-item interactions, yet no perturbation bound (e.g., via Weyl or Davis-Kahan theorems) or empirical study of block-size effects on approximation quality or ranking metrics is given; this directly undermines the claim that BMF+SVD retains SVD accuracy.

minor comments (1)

[Abstract] Abstract: the phrasing 'multiple computational units (with the use of Graphical Processing Units)' is unclear on whether distributed computation beyond single-GPU CUDA is actually implemented or evaluated.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback. We address each major comment below and indicate the revisions we will incorporate to strengthen the manuscript.

read point-by-point responses

Referee: [Abstract] Abstract: the assertion that advantages 'were demonstrated' in performance and memory is unsupported by any quantitative results, baselines, error metrics (e.g., RMSE), ablation studies, or experimental setup; the central claim therefore rests on an unevidenced statement.

Authors: We agree the abstract overstates the demonstration without supporting numbers. The manuscript text describes the CUDA/GPU implementation but does not embed the actual experimental results, baselines, or metrics in the provided sections. We will revise the abstract to include key quantitative outcomes (e.g., runtime and memory reductions versus standard SVD/MF, with RMSE values) and will ensure the experimental setup is summarized there. This change will be made in the next version. revision: yes
Referee: [None (central claim)] No section provides an error analysis: performing independent SVD on blocks ignores inter-block user-item interactions, yet no perturbation bound (e.g., via Weyl or Davis-Kahan theorems) or empirical study of block-size effects on approximation quality or ranking metrics is given; this directly undermines the claim that BMF+SVD retains SVD accuracy.

Authors: The referee correctly identifies a gap: the manuscript contains no formal error analysis or empirical evaluation of blocking effects on accuracy. We will add a new subsection with an empirical study varying block sizes and reporting impacts on RMSE and ranking metrics (e.g., NDCG). A brief discussion of approximation error relative to full SVD will also be included, referencing relevant perturbation results where applicable or explicitly noting limitations if tight bounds are not derivable. These additions will directly support or qualify the accuracy-retention claim. revision: yes

Circularity Check

0 steps flagged

No circularity: proposal paper states method without derivations or fitted predictions

full rationale

The manuscript is a methods proposal for block-based SVD matrix factorization on GPU. No equations, parameter fits, or derivation chain appear in the provided text. Claims of accuracy retention and performance gains are asserted without reduction to self-referential inputs, self-citations, or renamed empirical patterns. The central premise (BMF preserves SVD advantages while adding parallelism) is not shown to be equivalent to its inputs by construction; it remains an unproven engineering claim open to external validation. This is the expected non-finding for a purely descriptive systems paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only abstract available; no free parameters, axioms, or invented entities are described.

pith-pipeline@v0.9.0 · 5691 in / 1053 out tokens · 20802 ms · 2026-05-24T20:29:21.206562+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages · 1 internal anchor

[1]

M. W. Berry, S.T. Dumais, and G.W. O’Brien. Using linear algebra for intelligent information retrieval. SIAM REVIEW, 37:573–595, 1995

work page 1995
[2]

W. Chen, Y . Li, B. Pan, and C. Xu. Block kernel non-negative matrix factorization and its application to face recognition. In IJCNN, pages 3446–3452, 2016

work page 2016
[3]

A learning-rate schedule for stochastic gradient methods to matrix factoriza- tion

Wei-Sheng Chin, Yong Zhuang, Yu-Chin Juan, and Chih-Jen Lin. A learning-rate schedule for stochastic gradient methods to matrix factoriza- tion. In KDD, pages 442–455, 2015

work page 2015
[4]

Dumais, George W

Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Lan- dauer, and Richard Harshman. Indexing by latent semantic analysis. Jour- nal of the American Society for Information Science, 41(6):391–407, 1990

work page 1990
[5]

Dc-nmf: Non- negative matrix factorization based on divide-and-conquer for fast cluster- ing and topic modeling

Rundong Du, Da Kuang, Barry Drake, and Haesun Park. Dc-nmf: Non- negative matrix factorization based on divide-and-conquer for fast cluster- ing and topic modeling. Journal of Global Optimization, 68(4):777–798, Aug 2017

work page 2017
[6]

Haas, and Yannis Sismanis

Rainer Gemulla, Erik Nijkamp, Peter J. Haas, and Yannis Sismanis. Large- scale matrix factorization with distributed stochastic gradient descent. In SIGKDD, pages 69–77, 2011

work page 2011
[7]

nmfgpu4r: Gpu-accelerated computation of the non-negative matrix factorization using cuda capable hardware

Sven Koitka and Christoph M Friedrich. nmfgpu4r: Gpu-accelerated computation of the non-negative matrix factorization using cuda capable hardware. The R Journal, 8(2):382–392, 2016

work page 2016
[8]

1 the bellkor solution to the netﬂix grand prize, 2009

Yehuda Koren. 1 the bellkor solution to the netﬂix grand prize, 2009

work page 2009
[9]

Pujari, Sandeep Kumar Sahu, Venkateswara Rao Kagita, and Vineet Padmanabhan

Vikas Kumar, Arun K. Pujari, Sandeep Kumar Sahu, Venkateswara Rao Kagita, and Vineet Padmanabhan. Collaborative ﬁltering using multiple binary maximum margin matrix factorizations. Information Sciences, 380:1–11, 2017

work page 2017
[10]

Pujari, Sandeep Kumar Sahu, Venkateswara Rao Kagita, and Vineet Padmanabhan

Vikas Kumar, Arun K. Pujari, Sandeep Kumar Sahu, Venkateswara Rao Kagita, and Vineet Padmanabhan. Proximal maximum margin matrix factorization for collaborative ﬁltering.PRL, 86:62–67, 2017

work page 2017
[11]

Gpu-accelerated non-negative matrix factorization for text mining

V olodymyr Kysenko, Karl Rupp, Oleksandr Marchenko, Siegfried Sel- berherr, and Anatoly Anisimov. Gpu-accelerated non-negative matrix factorization for text mining. In NIPS, pages 158–163, 2012

work page 2012
[12]

Dimensionality reduction in higher-order signal processing and rank-(r1,r2,,rn) reduction in multilinear algebra

Lieven De Lathauwer and Joos Vandewalle. Dimensionality reduction in higher-order signal processing and rank-(r1,r2,,rn) reduction in multilinear algebra. Linear Algebra and its Applications, 391:31 – 55, 2004. Special Issue on Linear Algebra in Signal and Image Processing

work page 2004
[13]

Sparkler: supporting large-scale matrix factorization

Boduo Li, Sandeep Tata, and Yannis Sismanis. Sparkler: supporting large-scale matrix factorization. In Joint EDBT/ICDT Conferences, pages 625–636, 2013

work page 2013
[14]

Mackey, Michael I

Lester W. Mackey, Michael I. Jordan, and Ameet Talwalkar. Divide-and- conquer matrix factorization. In NIPS, pages 1134–1142. 2011

work page 2011
[15]

Rathnayake, and Geoﬀrey J

Vladimir Nikulin, Tian-Hsiang Huang, Shu-Kay Ng, Suren I. Rathnayake, and Geoﬀrey J. McLachlan. A very fast algorithm for matrix factoriza- tion. Statistics & Probability Letters, 81(7):773–782, 2011. Statistics in Biological and Medical Sciences

work page 2011
[16]

Fast and robust parallel SGD matrix factorization

Jinoh Oh, Wook-Shin Han, Hwanjo Yu, and Xiaoqian Jiang. Fast and robust parallel SGD matrix factorization. In KDD, pages 865–874, 2015

work page 2015
[17]

Wright, and Feng Niu

Benjamin Recht, Christopher R´e, Stephen J. Wright, and Feng Niu. Hog- wild: A lock-free approach to parallelizing stochastic gradient descent. In NIPS, pages 693–701, 2011

work page 2011
[18]

Ross, Jongwoo Lim, Ruei-Sung Lin, and Ming-Hsuan Yang

David A. Ross, Jongwoo Lim, Ruei-Sung Lin, and Ming-Hsuan Yang. Incremental learning for robust visual tracking. International Journal of Computer Vision, 77(1):125–141, May 2008

work page 2008
[19]

Probabilistic matrix factorization

Ruslan Salakhutdinov and Andriy Mnih. Probabilistic matrix factorization. In NIPS, pages 1257–1264, 2007

work page 2007
[20]

In- cremental singular value decomposition algorithms for highly scalable recommender systems

Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. In- cremental singular value decomposition algorithms for highly scalable recommender systems. In Fifth International Conference on Computer and Information Science, pages 27–28, 2002

work page 2002
[21]

Factorbird - a Parameter Server Approach to Distributed Matrix Factorization

Sebastian Schelter, Venu Satuluri, and Reza Zadeh. Factorbird - a parameter server approach to distributed matrix factorization. CoRR, abs/1411.0602, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[22]

A survey of collaborative ﬁltering techniques

Xiaoyuan Su and Taghi M Khoshgoftaar. A survey of collaborative ﬁltering techniques. Advances in artiﬁcial intelligence, 2009, 2009

work page 2009
[23]

Fong, Cheng Li, Zijun Wang, and Liang Cao

Wei Tan, Shiyu Chang, Liana L. Fong, Cheng Li, Zijun Wang, and Liang Cao. Matrix factorization on gpus with memory optimization and approxi- mate computing. In ICPP, 2018

work page 2018
[24]

Schindler, and D

Tat-Jun Chin, K. Schindler, and D. Suter. Incremental kernel svd for face recognition with image sets. In 7th International Conference on Automatic Face and Gesture Recognition (FGR06), pages 461–466, April 2006

work page 2006
[25]

Split-and-combine singular value decomposition for large- scale matrix

Jengnan Tzeng. Split-and-combine singular value decomposition for large- scale matrix. Journal of Applied Mathematics, 2013, 03 2013

work page 2013
[26]

Hsiang-Fu Yu, Cho-Jui Hsieh, Si Si, and Inderjit S. Dhillon. Scalable coor- dinate descent approaches to parallel matrix factorization for recommender systems. In ICDM, pages 765–774, 2012

work page 2012
[27]

Hyokun Yun, Hsiang-Fu Yu, Cho-Jui Hsieh, S. V . N. Vishwanathan, and Inderjit Dhillon. Nomad: Non-locking, stochastic multi-machine algo- rithm for asynchronous and decentralized matrix completion. Proc. VLDB Endow., 7(11):975–986

work page
[28]

Localized matrix factorization for recommendation based on matrix block diagonal forms

Yongfeng Zhang, Min Zhang, Yiqun Liu, Shaoping Ma, and Shi Feng. Localized matrix factorization for recommendation based on matrix block diagonal forms. In WWW, pages 1511–1520. ACM, 2013

work page 2013
[29]

A fast parallel stochastic gradient descent for matrix factorization in shared memory systems

Yong Zhuang, Wei-Sheng Chin, Yu-Chin Juan, and Chih-Jen Lin. A fast parallel stochastic gradient descent for matrix factorization in shared memory systems. In RecSys, pages 249–256, 2013

work page 2013

[1] [1]

M. W. Berry, S.T. Dumais, and G.W. O’Brien. Using linear algebra for intelligent information retrieval. SIAM REVIEW, 37:573–595, 1995

work page 1995

[2] [2]

W. Chen, Y . Li, B. Pan, and C. Xu. Block kernel non-negative matrix factorization and its application to face recognition. In IJCNN, pages 3446–3452, 2016

work page 2016

[3] [3]

A learning-rate schedule for stochastic gradient methods to matrix factoriza- tion

Wei-Sheng Chin, Yong Zhuang, Yu-Chin Juan, and Chih-Jen Lin. A learning-rate schedule for stochastic gradient methods to matrix factoriza- tion. In KDD, pages 442–455, 2015

work page 2015

[4] [4]

Dumais, George W

Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Lan- dauer, and Richard Harshman. Indexing by latent semantic analysis. Jour- nal of the American Society for Information Science, 41(6):391–407, 1990

work page 1990

[5] [5]

Dc-nmf: Non- negative matrix factorization based on divide-and-conquer for fast cluster- ing and topic modeling

Rundong Du, Da Kuang, Barry Drake, and Haesun Park. Dc-nmf: Non- negative matrix factorization based on divide-and-conquer for fast cluster- ing and topic modeling. Journal of Global Optimization, 68(4):777–798, Aug 2017

work page 2017

[6] [6]

Haas, and Yannis Sismanis

Rainer Gemulla, Erik Nijkamp, Peter J. Haas, and Yannis Sismanis. Large- scale matrix factorization with distributed stochastic gradient descent. In SIGKDD, pages 69–77, 2011

work page 2011

[7] [7]

nmfgpu4r: Gpu-accelerated computation of the non-negative matrix factorization using cuda capable hardware

Sven Koitka and Christoph M Friedrich. nmfgpu4r: Gpu-accelerated computation of the non-negative matrix factorization using cuda capable hardware. The R Journal, 8(2):382–392, 2016

work page 2016

[8] [8]

1 the bellkor solution to the netﬂix grand prize, 2009

Yehuda Koren. 1 the bellkor solution to the netﬂix grand prize, 2009

work page 2009

[9] [9]

Pujari, Sandeep Kumar Sahu, Venkateswara Rao Kagita, and Vineet Padmanabhan

Vikas Kumar, Arun K. Pujari, Sandeep Kumar Sahu, Venkateswara Rao Kagita, and Vineet Padmanabhan. Collaborative ﬁltering using multiple binary maximum margin matrix factorizations. Information Sciences, 380:1–11, 2017

work page 2017

[10] [10]

Pujari, Sandeep Kumar Sahu, Venkateswara Rao Kagita, and Vineet Padmanabhan

Vikas Kumar, Arun K. Pujari, Sandeep Kumar Sahu, Venkateswara Rao Kagita, and Vineet Padmanabhan. Proximal maximum margin matrix factorization for collaborative ﬁltering.PRL, 86:62–67, 2017

work page 2017

[11] [11]

Gpu-accelerated non-negative matrix factorization for text mining

V olodymyr Kysenko, Karl Rupp, Oleksandr Marchenko, Siegfried Sel- berherr, and Anatoly Anisimov. Gpu-accelerated non-negative matrix factorization for text mining. In NIPS, pages 158–163, 2012

work page 2012

[12] [12]

Dimensionality reduction in higher-order signal processing and rank-(r1,r2,,rn) reduction in multilinear algebra

Lieven De Lathauwer and Joos Vandewalle. Dimensionality reduction in higher-order signal processing and rank-(r1,r2,,rn) reduction in multilinear algebra. Linear Algebra and its Applications, 391:31 – 55, 2004. Special Issue on Linear Algebra in Signal and Image Processing

work page 2004

[13] [13]

Sparkler: supporting large-scale matrix factorization

Boduo Li, Sandeep Tata, and Yannis Sismanis. Sparkler: supporting large-scale matrix factorization. In Joint EDBT/ICDT Conferences, pages 625–636, 2013

work page 2013

[14] [14]

Mackey, Michael I

Lester W. Mackey, Michael I. Jordan, and Ameet Talwalkar. Divide-and- conquer matrix factorization. In NIPS, pages 1134–1142. 2011

work page 2011

[15] [15]

Rathnayake, and Geoﬀrey J

Vladimir Nikulin, Tian-Hsiang Huang, Shu-Kay Ng, Suren I. Rathnayake, and Geoﬀrey J. McLachlan. A very fast algorithm for matrix factoriza- tion. Statistics & Probability Letters, 81(7):773–782, 2011. Statistics in Biological and Medical Sciences

work page 2011

[16] [16]

Fast and robust parallel SGD matrix factorization

Jinoh Oh, Wook-Shin Han, Hwanjo Yu, and Xiaoqian Jiang. Fast and robust parallel SGD matrix factorization. In KDD, pages 865–874, 2015

work page 2015

[17] [17]

Wright, and Feng Niu

Benjamin Recht, Christopher R´e, Stephen J. Wright, and Feng Niu. Hog- wild: A lock-free approach to parallelizing stochastic gradient descent. In NIPS, pages 693–701, 2011

work page 2011

[18] [18]

Ross, Jongwoo Lim, Ruei-Sung Lin, and Ming-Hsuan Yang

David A. Ross, Jongwoo Lim, Ruei-Sung Lin, and Ming-Hsuan Yang. Incremental learning for robust visual tracking. International Journal of Computer Vision, 77(1):125–141, May 2008

work page 2008

[19] [19]

Probabilistic matrix factorization

Ruslan Salakhutdinov and Andriy Mnih. Probabilistic matrix factorization. In NIPS, pages 1257–1264, 2007

work page 2007

[20] [20]

In- cremental singular value decomposition algorithms for highly scalable recommender systems

Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. In- cremental singular value decomposition algorithms for highly scalable recommender systems. In Fifth International Conference on Computer and Information Science, pages 27–28, 2002

work page 2002

[21] [21]

Factorbird - a Parameter Server Approach to Distributed Matrix Factorization

Sebastian Schelter, Venu Satuluri, and Reza Zadeh. Factorbird - a parameter server approach to distributed matrix factorization. CoRR, abs/1411.0602, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[22] [22]

A survey of collaborative ﬁltering techniques

Xiaoyuan Su and Taghi M Khoshgoftaar. A survey of collaborative ﬁltering techniques. Advances in artiﬁcial intelligence, 2009, 2009

work page 2009

[23] [23]

Fong, Cheng Li, Zijun Wang, and Liang Cao

Wei Tan, Shiyu Chang, Liana L. Fong, Cheng Li, Zijun Wang, and Liang Cao. Matrix factorization on gpus with memory optimization and approxi- mate computing. In ICPP, 2018

work page 2018

[24] [24]

Schindler, and D

Tat-Jun Chin, K. Schindler, and D. Suter. Incremental kernel svd for face recognition with image sets. In 7th International Conference on Automatic Face and Gesture Recognition (FGR06), pages 461–466, April 2006

work page 2006

[25] [25]

Split-and-combine singular value decomposition for large- scale matrix

Jengnan Tzeng. Split-and-combine singular value decomposition for large- scale matrix. Journal of Applied Mathematics, 2013, 03 2013

work page 2013

[26] [26]

Hsiang-Fu Yu, Cho-Jui Hsieh, Si Si, and Inderjit S. Dhillon. Scalable coor- dinate descent approaches to parallel matrix factorization for recommender systems. In ICDM, pages 765–774, 2012

work page 2012

[27] [27]

Hyokun Yun, Hsiang-Fu Yu, Cho-Jui Hsieh, S. V . N. Vishwanathan, and Inderjit Dhillon. Nomad: Non-locking, stochastic multi-machine algo- rithm for asynchronous and decentralized matrix completion. Proc. VLDB Endow., 7(11):975–986

work page

[28] [28]

Localized matrix factorization for recommendation based on matrix block diagonal forms

Yongfeng Zhang, Min Zhang, Yiqun Liu, Shaoping Ma, and Shi Feng. Localized matrix factorization for recommendation based on matrix block diagonal forms. In WWW, pages 1511–1520. ACM, 2013

work page 2013

[29] [29]

A fast parallel stochastic gradient descent for matrix factorization in shared memory systems

Yong Zhuang, Wei-Sheng Chin, Yu-Chin Juan, and Chih-Jen Lin. A fast parallel stochastic gradient descent for matrix factorization in shared memory systems. In RecSys, pages 249–256, 2013

work page 2013