pith. sign in

arxiv: 1903.11907 · v2 · pith:FRBXTQ4Knew · submitted 2019-03-28 · 📊 stat.ML · cs.LG

Meta-Learning surrogate models for sequential decision making

classification 📊 stat.ML cs.LG
keywords decisionframeworklearningmakingprobabilisticproblemsapproachdata
0
0 comments X
read the original abstract

We introduce a unified probabilistic framework for solving sequential decision making problems ranging from Bayesian optimisation to contextual bandits and reinforcement learning. This is accomplished by a probabilistic model-based approach that explains observed data while capturing predictive uncertainty during the decision making process. Crucially, this probabilistic model is chosen to be a Meta-Learning system that allows learning from a distribution of related problems, allowing data efficient adaptation to a target task. As a suitable instantiation of this framework, we explore the use of Neural processes due to statistical and computational desiderata. We apply our framework to a broad range of problem domains, such as control problems, recommender systems and adversarial attacks on RL agents, demonstrating an efficient and general black-box learning approach.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.