Hand Pose Estimation via Latent 2.5D Heatmap Regression

· 2018 · cs.CV · arXiv 1804.09534

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Estimating the 3D pose of a hand is an essential part of human-computer interaction. Estimating 3D pose using depth or multi-view sensors has become easier with recent advances in computer vision, however, regressing pose from a single RGB image is much less straightforward. The main difficulty arises from the fact that 3D pose requires some form of depth estimates, which are ambiguous given only an RGB image. In this paper we propose a new method for 3D hand pose estimation from a monocular image through a novel 2.5D pose representation. Our new representation estimates pose up to a scaling factor, which can be estimated additionally if a prior of the hand size is given. We implicitly learn depth maps and heatmap distributions with a novel CNN architecture. Our system achieves the state-of-the-art estimation of 2D and 3D hand pose on several challenging datasets in presence of severe occlusions.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

TouchAnything: A Dataset and Framework for Bimanual Tactile Estimation from Egocentric Video

cs.RO · 2026-05-13 · unverdicted · novelty 7.0

EgoTouch is a new multi-view egocentric dataset with dense bimanual tactile supervision, and TouchAnything is a baseline framework showing that wrist views improve vision-based tactile prediction over egocentric input alone.

citing papers explorer

Showing 1 of 1 citing paper.

TouchAnything: A Dataset and Framework for Bimanual Tactile Estimation from Egocentric Video cs.RO · 2026-05-13 · unverdicted · none · ref 15 · internal anchor
EgoTouch is a new multi-view egocentric dataset with dense bimanual tactile supervision, and TouchAnything is a baseline framework showing that wrist views improve vision-based tactile prediction over egocentric input alone.

Hand Pose Estimation via Latent 2.5D Heatmap Regression

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer