pith. sign in

arxiv: 1803.02661 · v2 · pith:X5MJTIOHnew · submitted 2018-03-07 · 🧮 math.NA · cs.DS· cs.LG· cs.NA

Sketching for Principal Component Regression

classification 🧮 math.NA cs.DScs.LGcs.NA
keywords approximateregressionwhenalgorithmscomponentcomputinghandhigh
0
0 comments X
read the original abstract

Principal component regression (PCR) is a useful method for regularizing linear regression. Although conceptually simple, straightforward implementations of PCR have high computational costs and so are inappropriate when learning with large scale data. In this paper, we propose efficient algorithms for computing approximate PCR solutions that are, on one hand, high quality approximations to the true PCR solutions (when viewed as minimizer of a constrained optimization problem), and on the other hand entertain rigorous risk bounds (when viewed as statistical estimators). In particular, we propose an input sparsity time algorithms for approximate PCR. We also consider computing an approximate PCR in the streaming model, and kernel PCR. Empirical results demonstrate the excellent performance of our proposed methods.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. State-of-art minibatches via novel DPP kernels: discretization, wavelets, and rough objectives

    stat.ML 2026-05 unverdicted novelty 7.0

    Wavelet DPP kernels deliver improved continuous variance reduction and a discretization procedure that preserves decay rates for discrete ML subsampling tasks including rough objectives.