Mo2Cap2: Real-time Mobile 3D Motion Capture with a Cap-mounted Fisheye Camera

· 2018 · cs.CV · arXiv 1803.05959

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We propose the first real-time approach for the egocentric estimation of 3D human body pose in a wide range of unconstrained everyday activities. This setting has a unique set of challenges, such as mobility of the hardware setup, and robustness to long capture sessions with fast recovery from tracking failures. We tackle these challenges based on a novel lightweight setup that converts a standard baseball cap to a device for high-quality pose estimation based on a single cap-mounted fisheye camera. From the captured egocentric live stream, our CNN based 3D pose estimation approach runs at 60Hz on a consumer-level GPU. In addition to the novel hardware setup, our other main contributions are: 1) a large ground truth training corpus of top-down fisheye images and 2) a novel disentangled 3D pose estimation approach that takes the unique properties of the egocentric viewpoint into account. As shown by our evaluation, we achieve lower 3D joint error as well as better 2D overlay than the existing baselines.

representative citing papers

Ultra Diffusion Poser: Diffusion-Based Human Motion Tracking From Sparse Inertial Sensors and Ranging-Based Between-Sensor Distances

cs.CV · 2026-06-01 · unverdicted · novelty 6.0

Ultra Diffusion Poser improves sparse inertial human pose estimation by reconstructing 3D sensor layouts from UWB ranging measurements and using UWB-Diffusion Guidance in a diffusion model, claiming up to 22% lower joint position error than prior work.

citing papers explorer

Showing 1 of 1 citing paper.

Ultra Diffusion Poser: Diffusion-Based Human Motion Tracking From Sparse Inertial Sensors and Ranging-Based Between-Sensor Distances cs.CV · 2026-06-01 · unverdicted · none · ref 42 · internal anchor
Ultra Diffusion Poser improves sparse inertial human pose estimation by reconstructing 3D sensor layouts from UWB ranging measurements and using UWB-Diffusion Guidance in a diffusion model, claiming up to 22% lower joint position error than prior work.

Mo2Cap2: Real-time Mobile 3D Motion Capture with a Cap-mounted Fisheye Camera

fields

years

verdicts

representative citing papers

citing papers explorer