Posterior Variance Analysis of Gaussian Processes with Application to Average Learning Curves

· 2019 · cs.LG · arXiv 1906.01404

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

The posterior variance of Gaussian processes is a valuable measure of the learning error which is exploited in various applications such as safe reinforcement learning and control design. However, suitable analysis of the posterior variance which captures its behavior for finite and infinite number of training data is missing. This paper derives a novel bound for the posterior variance function which requires only local information because it depends only on the number of training samples in the proximity of a considered test point. Furthermore, we prove sufficient conditions which ensure the convergence of the posterior variance to zero. Finally, we demonstrate that the extension of our bound to an average learning bound outperforms existing approaches.

representative citing papers

PolicyGuard: Towards Test-time and Step-level Adversary (Backdoor) Defense for Reinforcement Learning Agent

cs.LG · 2026-06-11 · unverdicted · novelty 7.0

PolicyGuard provides a test-time step-level defense against backdoor attacks in RL using GP posterior variance, showing high detection AUROC on seven games.

Estimating Mixture Distributions via Stochastic Mirror Descent

stat.ML · 2026-05-24 · unverdicted · novelty 6.0

Proposes stochastic mirror descent estimators for mixture models that scale to many components, avoid strict support bounds for discrete cases, and achieve near-optimal KL and l2 rates under mild conditions.

citing papers explorer

Showing 2 of 2 citing papers after filters.

PolicyGuard: Towards Test-time and Step-level Adversary (Backdoor) Defense for Reinforcement Learning Agent cs.LG · 2026-06-11 · unverdicted · none · ref 4 · internal anchor
PolicyGuard provides a test-time step-level defense against backdoor attacks in RL using GP posterior variance, showing high detection AUROC on seven games.
Estimating Mixture Distributions via Stochastic Mirror Descent stat.ML · 2026-05-24 · unverdicted · none · ref 14 · internal anchor
Proposes stochastic mirror descent estimators for mixture models that scale to many components, avoid strict support bounds for discrete cases, and achieve near-optimal KL and l2 rates under mild conditions.

Posterior Variance Analysis of Gaussian Processes with Application to Average Learning Curves

fields

years

verdicts

representative citing papers

citing papers explorer