pith. machine review for the scientific record. sign in

arxiv: 1702.08212 · v1 · submitted 2017-02-27 · 💻 cs.RO · cs.CV· cs.HC

Recognition: unknown

Anticipating many futures: Online human motion prediction and synthesis for human-robot collaboration

Authors on Pith no claims yet
classification 💻 cs.RO cs.CVcs.HC
keywords motionhumanapproachfutureonlinepredictioncuesdata
0
0 comments X
read the original abstract

Fluent and safe interactions of humans and robots require both partners to anticipate the others' actions. A common approach to human intention inference is to model specific trajectories towards known goals with supervised classifiers. However, these approaches do not take possible future movements into account nor do they make use of kinematic cues, such as legible and predictable motion. The bottleneck of these methods is the lack of an accurate model of general human motion. In this work, we present a conditional variational autoencoder that is trained to predict a window of future human motion given a window of past frames. Using skeletal data obtained from RGB depth images, we show how this unsupervised approach can be used for online motion prediction for up to 1660 ms. Additionally, we demonstrate online target prediction within the first 300-500 ms after motion onset without the use of target specific training data. The advantage of our probabilistic approach is the possibility to draw samples of possible future motions. Finally, we investigate how movements and kinematic cues are represented on the learned low dimensional manifold.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. SASI: Leveraging Sub-Action Semantics for Robust Early Action Recognition in Human-Robot Interaction

    cs.RO 2026-04 unverdicted novelty 5.0

    SASI combines skeleton-based graph convolutions with sub-action semantics for improved early action recognition on the BABEL dataset.