Exploiting gradients and Hessians in Bayesian optimization and Bayesian quadrature

Anqi Wu; Jonathan W. Pillow; Mikio C. Aoi

arxiv: 1704.00060 · v2 · pith:Z2JOKULRnew · submitted 2017-03-31 · 📊 stat.ML

Exploiting gradients and Hessians in Bayesian optimization and Bayesian quadrature

Anqi Wu , Mikio C. Aoi , Jonathan W. Pillow This is my paper

classification 📊 stat.ML

keywords bayesianmethodsfunctionoptimizationquadraturederivativederivativesevaluate

0 comments

read the original abstract

An exciting branch of machine learning research focuses on methods for learning, optimizing, and integrating unknown functions that are difficult or costly to evaluate. A popular Bayesian approach to this problem uses a Gaussian process (GP) to construct a posterior distribution over the function of interest given a set of observed measurements, and selects new points to evaluate using the statistics of this posterior. Here we extend these methods to exploit derivative information from the unknown function. We describe methods for Bayesian optimization (BO) and Bayesian quadrature (BQ) in settings where first and second derivatives may be evaluated along with the function itself. We perform sampling-based inference in order to incorporate uncertainty over hyperparameters, and show that both hyperparameter and function uncertainty decrease much more rapidly when using derivative information. Moreover, we introduce techniques for overcoming ill-conditioning issues that have plagued earlier methods for gradient-enhanced Gaussian processes and kriging. We illustrate the efficacy of these methods using applications to real and simulated Bayesian optimization and quadrature problems, and show that exploting derivatives can provide substantial gains over standard methods.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Efficient Gradient-Enhanced Bayesian Optimizer with Comparisons to Conjugate-Gradient and Quasi-Newton Optimizers for Unconstrained Local Optimization
math.OC 2025-04 unverdicted novelty 5.0

A gradient-enhanced local Bayesian optimization framework that converges optimality as deeply as standard optimizers but with significantly fewer function evaluations on 2-40 dimensional unimodal problems, outperformi...