pith. machine review for the scientific record. sign in

arxiv: 1905.07861 · v1 · submitted 2019-05-20 · 💻 cs.LG · cs.AI· stat.ML

Recognition: unknown

Perceptual Values from Observation

Authors on Pith no claims yet
classification 💻 cs.LG cs.AIstat.ML
keywords learningvaluesapproachobservationrewardsactionaction-valuesactions
0
0 comments X
read the original abstract

Imitation by observation is an approach for learning from expert demonstrations that lack action information, such as videos. Recent approaches to this problem can be placed into two broad categories: training dynamics models that aim to predict the actions taken between states, and learning rewards or features for computing them for Reinforcement Learning (RL). In this paper, we introduce a novel approach that learns values, rather than rewards, directly from observations. We show that by using values, we can significantly speed up RL by removing the need to bootstrap action-values, as compared to sparse-reward specifications.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

    cs.RO 2024-01 conditional novelty 6.0

    A low-cost whole-body teleoperation system enables effective imitation learning for complex bimanual mobile manipulation by co-training on mobile and static demonstration datasets.

  2. R3M: A Universal Visual Representation for Robot Manipulation

    cs.RO 2022-03 unverdicted novelty 6.0

    A visual encoder pre-trained on diverse human videos with contrastive and language objectives improves simulated robot manipulation success by over 20% versus training from scratch and enables real Franka arm tasks fr...