Deep Kernel Learning

Andrew Gordon Wilson; Eric P. Xing; Ruslan Salakhutdinov; Zhiting Hu

arxiv: 1511.02222 · v1 · pith:U32GDZC7new · submitted 2015-11-06 · 💻 cs.LG · cs.AI· stat.ME· stat.ML

Deep Kernel Learning

Andrew Gordon Wilson , Zhiting Hu , Ruslan Salakhutdinov , Eric P. Xing This is my paper

classification 💻 cs.LG cs.AIstat.MEstat.ML

keywords kerneldeepkernelslearningscalablearchitecturescostgaussian

0 comments

read the original abstract

We introduce scalable deep kernels, which combine the structural properties of deep learning architectures with the non-parametric flexibility of kernel methods. Specifically, we transform the inputs of a spectral mixture base kernel with a deep architecture, using local kernel interpolation, inducing points, and structure exploiting (Kronecker and Toeplitz) algebra for a scalable kernel representation. These closed-form kernels can be used as drop-in replacements for standard kernels, with benefits in expressive power and scalability. We jointly learn the properties of these kernels through the marginal likelihood of a Gaussian process. Inference and learning cost $O(n)$ for $n$ training points, and predictions cost $O(1)$ per test point. On a large and diverse collection of applications, including a dataset with 2 million examples, we show improved performance over scalable Gaussian processes with flexible kernel learning models, and stand-alone deep architectures.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Flow Matching Algorithm for Many-Shot Adaptation to Unseen Distributions
cs.LG 2026-05 unverdicted novelty 7.0

FP-FM adapts flow matching models to unseen distributions via least-squares projection onto basis functions spanning training velocity fields, yielding improved precision and recall without inference-time training.