Learning Awareness Models

· 2018 · cs.AI · arXiv 1804.06318

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

We consider the setting of an agent with a fixed body interacting with an unknown and uncertain external world. We show that models trained to predict proprioceptive information about the agent's body come to represent objects in the external world. In spite of being trained with only internally available signals, these dynamic body models come to represent external objects through the necessity of predicting their effects on the agent's own body. That is, the model learns holistic persistent representations of objects in the world, even though the only training signals are body signals. Our dynamics model is able to successfully predict distributions over 132 sensor readings over 100 steps into the future and we demonstrate that even when the body is no longer in contact with an object, the latent variables of the dynamics model continue to represent its shape. We show that active data collection by maximizing the entropy of predictions about the body---touch sensors, proprioception and vestibular information---leads to learning of dynamic models that show superior performance when used for control. We also collect data from a real robotic hand and show that the same models can be used to answer questions about properties of objects in the real world. Videos with qualitative results of our models are available at https://goo.gl/mZuqAV.

representative citing papers

Bayesian updates from coalgebraic determinisation

cs.LO · 2026-06-24 · unverdicted · novelty 7.0

Unifilarisation of stochastic Mealy machines is an instance of coalgebraic determinisation over monads with support structure, producing causal stochastic behaviours rather than Moore-style output distributions.

Shaping Belief States with Generative Environment Models for RL

cs.LG · 2019-06-21 · unverdicted · novelty 5.0

Multi-step predictive generative models form stable belief states capturing environment layout and agent pose, yielding higher data efficiency on RL tasks than model-free agents.

citing papers explorer

Showing 2 of 2 citing papers.

Bayesian updates from coalgebraic determinisation cs.LO · 2026-06-24 · unverdicted · none · ref 78 · internal anchor
Unifilarisation of stochastic Mealy machines is an instance of coalgebraic determinisation over monads with support structure, producing causal stochastic behaviours rather than Moore-style output distributions.
Shaping Belief States with Generative Environment Models for RL cs.LG · 2019-06-21 · unverdicted · none · ref 34 · internal anchor
Multi-step predictive generative models form stable belief states capturing environment layout and agent pose, yielding higher data efficiency on RL tasks than model-free agents.

Learning Awareness Models

fields

years

verdicts

representative citing papers

citing papers explorer