pith. sign in

arxiv: 1501.06478 · v2 · pith:G4B577PXnew · submitted 2015-01-26 · 💻 cs.LG

Compressed Support Vector Machines

classification 💻 cs.LG
keywords supportdatasetsalgorithmduringmachinesreducingseveral
0
0 comments X p. Extension
pith:G4B577PX Add to your LaTeX paper What is a Pith Number?
\usepackage{pith}
\pithnumber{G4B577PX}

Prints a linked pith:G4B577PX badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Support vector machines (SVM) can classify data sets along highly non-linear decision boundaries because of the kernel-trick. This expressiveness comes at a price: During test-time, the SVM classifier needs to compute the kernel inner-product between a test sample and all support vectors. With large training data sets, the time required for this computation can be substantial. In this paper, we introduce a post-processing algorithm, which compresses the learned SVM model by reducing and optimizing support vectors. We evaluate our algorithm on several medium-scaled real-world data sets, demonstrating that it maintains high test accuracy while reducing the test-time evaluation cost by several orders of magnitude---in some cases from hours to seconds. It is fair to say that most of the work in this paper was previously been invented by Burges and Sch\"olkopf almost 20 years ago. For most of the time during which we conducted this research, we were unaware of this prior work. However, in the past two decades, computing power has increased drastically, and we can therefore provide empirical insights that were not possible in their original paper.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.