Hands Deep in Deep Learning for Hand Pose Estimation

Markus Oberweger; Paul Wohlhart; Vincent Lepetit

arxiv: 1502.06807 · v2 · pith:YFRGK6ZPnew · submitted 2015-02-24 · 💻 cs.CV

Hands Deep in Deep Learning for Hand Pose Estimation

Markus Oberweger , Paul Wohlhart , Vincent Lepetit This is my paper

classification 💻 cs.CV

keywords accuracydeephandposeseveralsignificantlyallowambiguities

0 comments

read the original abstract

We introduce and evaluate several architectures for Convolutional Neural Networks to predict the 3D joint locations of a hand given a depth map. We first show that a prior on the 3D pose can be easily introduced and significantly improves the accuracy and reliability of the predictions. We also show how to use context efficiently to deal with ambiguities between fingers. These two contributions allow us to significantly outperform the state-of-the-art on several challenging benchmarks, both in terms of accuracy and computation times.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

GeoHand: Unlocking Prior Geometry Knowledge for Monocular 3D Hand Reconstruction
cs.CV 2026-05 unverdicted novelty 6.0

GeoHand adapts priors from a general-scene geometry estimator via a GeoAdapter, gated fusion, and keypoint-queried refiner to reach SOTA monocular 3D hand reconstruction on FreiHAND, DexYCB, and HO3Dv3 under heavy occlusion.
Construct Dynamic Graphs for Hand Gesture Recognition via Spatial-Temporal Attention
cs.CV 2019-07 unverdicted novelty 6.0

DG-STA builds dynamic graphs from hand skeletons, applies spatial-temporal self-attention to learn features, and uses a mask to cut cost by 99%, outperforming prior methods on DHG-14/28 and SHREC'17.
Touchless Intraoperative Image Access System Based on Vision-Based Hand Tracking
cs.CV 2026-04 unverdicted novelty 3.0

A vision-based system maps real-time hand gestures from a single camera to image translation, rotation, and zoom commands for touchless intraoperative navigation.