Learning 3D Human Pose from Structure and Motion

Abhishek Sharma; Anurag Mundhada; Arjun Jain; Rishabh Dabral; Safeer Afaque; Uday Kusupati

arxiv: 1711.09250 · v2 · pith:SJN7GCVSnew · submitted 2017-11-25 · 💻 cs.CV

Learning 3D Human Pose from Structure and Motion

Rishabh Dabral , Anurag Mundhada , Uday Kusupati , Safeer Afaque , Abhishek Sharma , Arjun Jain This is my paper

classification 💻 cs.CV

keywords posedatahumanin-the-wildlearninglosstemporalanalysis

0 comments

read the original abstract

3D human pose estimation from a single image is a challenging problem, especially for in-the-wild settings due to the lack of 3D annotated data. We propose two anatomically inspired loss functions and use them with a weakly-supervised learning framework to jointly learn from large-scale in-the-wild 2D and indoor/synthetic 3D data. We also present a simple temporal network that exploits temporal and structural cues present in predicted pose sequences to temporally harmonize the pose estimations. We carefully analyze the proposed contributions through loss surface visualizations and sensitivity analysis to facilitate deeper understanding of their working mechanism. Our complete pipeline improves the state-of-the-art by 11.8% and 12% on Human3.6M and MPI-INF-3DHP, respectively, and runs at 30 FPS on a commodity graphics card.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera
cs.CV 2019-07 unverdicted novelty 7.0

A dual-branch decoder network trained on the new xR-EgoPose synthetic dataset achieves state-of-the-art egocentric 3D pose estimation from HMD fish-eye cameras and generalizes to real footage.