SAM 3D Animal is the first promptable framework for multi-animal 3D reconstruction from single images, built on SMAL+ and trained on the new Herd3D dataset, achieving SOTA results on Animal3D, APTv2, and Animal Kingdom benchmarks.
End-to-end recovery of human shape and pose
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CV 4years
2026 4verdicts
UNVERDICTED 4roles
background 3polarities
background 3representative citing papers
A new method accumulates historical pose features across layers in a Transformer network to reach state-of-the-art 3D human pose estimation accuracy.
UniCon3R infers 4D contact from pose and geometry to correct human mesh and scene alignment in monocular video, yielding more physically plausible joint reconstructions than prior feed-forward methods.
RAM outperforms prior methods on PoseTrack and 3DPW for zero-shot multi-person 3D motion tracking and reconstruction by fusing semantic tracking, memory-augmented pose estimation, and predictive fusion.
citing papers explorer
-
SAM 3D Animal: Promptable Animal 3D Reconstruction from Images in the Wild
SAM 3D Animal is the first promptable framework for multi-animal 3D reconstruction from single images, built on SMAL+ and trained on the new Herd3D dataset, achieving SOTA results on Animal3D, APTv2, and Animal Kingdom benchmarks.
-
L2A: Learning to Accumulate Pose History for Accurate 3D Human Pose Estimation
A new method accumulates historical pose features across layers in a Transformer network to reach state-of-the-art 3D human pose estimation accuracy.
-
UniCon3R: Unified Contact-aware 4D Human-Scene Reconstruction from Monocular Video
UniCon3R infers 4D contact from pose and geometry to correct human mesh and scene alignment in monocular video, yielding more physically plausible joint reconstructions than prior feed-forward methods.
-
RAM: Recover Any 3D Human Motion in-the-Wild
RAM outperforms prior methods on PoseTrack and 3DPW for zero-shot multi-person 3D motion tracking and reconstruction by fusing semantic tracking, memory-augmented pose estimation, and predictive fusion.