LHM: large animatable human reconstruction model from a single image in seconds

Lingteng Qiu, Xiaodong Gu, Peihao Li, Qi Zuo, Weichao Shen, Junfei Zhang, Kejie Qiu, Weihao Yuan, Guanying Chen, Zilong Dong, et al · 2025 · arXiv 2503.10625

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 2 method 1

citation-polarity summary

background 2 use method 1

representative citing papers

HumANDiff: Articulated Noise Diffusion for Motion-Consistent Human Video Generation

cs.CV · 2026-04-07 · unverdicted · novelty 7.0

HumANDiff improves motion consistency in human video generation by sampling diffusion noise on an articulated human body template and adding joint appearance-motion prediction plus a geometric consistency loss.

Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining

cs.CV · 2026-04-02 · unverdicted · novelty 7.0

Pretraining on 1M wild videos followed by post-training on curated data yields high-fidelity feedforward 3D avatars that generalize across identities, clothing, and lighting with emergent relightability and loose-garment support.

VRGaussianAvatar: Integrating 3D Gaussian Avatars into VR

cs.CV · 2026-02-02 · conditional · novelty 7.0

VRGaussianAvatar enables real-time full-body 3D Gaussian Splatting avatars in VR from HMD tracking alone via inverse kinematics and binocular batching for efficient stereo rendering, outperforming mesh baselines in performance and user ratings.

High-Fidelity Single-Image Head Modeling with Industry-Grade Topology

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

A single-image head reconstruction method uses coarse-to-fine optimization with normal consistency, landmarks, and geometry-aware constraints on curvature and conformality to produce meshes with industry-grade topology and preserved facial identity.

Visually-grounded Humanoid Agents

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

A coupled world-agent framework uses 3D Gaussian reconstruction and first-person RGB-D perception with iterative planning to enable goal-directed, collision-avoiding humanoid behavior in novel reconstructed scenes.

Generator-Refiner-Examiner: A Tri-Module Data Augmentation Framework for 3D Human Avatar Learning from Monocular Videos

cs.CV · 2026-05-22 · unverdicted · novelty 5.0

TrioMan is a tri-module data augmentation framework using a Generator for pose/camera perturbations, a Refiner with one-step diffusion, and an Examiner with dual-branch attention to improve 3D avatar learning from monocular videos, claiming better results than prior methods on two benchmarks.

Human Interaction-Aware 3D Reconstruction from a Single Image

cs.CV · 2026-04-07 · unverdicted · novelty 5.0

HUG3D uses group-instance multi-view diffusion and physics-based optimization to create physically plausible 3D reconstructions of interacting people from a single image.

citing papers explorer

Showing 7 of 7 citing papers.

HumANDiff: Articulated Noise Diffusion for Motion-Consistent Human Video Generation cs.CV · 2026-04-07 · unverdicted · none · ref 47
HumANDiff improves motion consistency in human video generation by sampling diffusion noise on an articulated human body template and adding joint appearance-motion prediction plus a geometric consistency loss.
Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining cs.CV · 2026-04-02 · unverdicted · none · ref 50
Pretraining on 1M wild videos followed by post-training on curated data yields high-fidelity feedforward 3D avatars that generalize across identities, clothing, and lighting with emergent relightability and loose-garment support.
VRGaussianAvatar: Integrating 3D Gaussian Avatars into VR cs.CV · 2026-02-02 · conditional · none · ref 42
VRGaussianAvatar enables real-time full-body 3D Gaussian Splatting avatars in VR from HMD tracking alone via inverse kinematics and binocular batching for efficient stereo rendering, outperforming mesh baselines in performance and user ratings.
High-Fidelity Single-Image Head Modeling with Industry-Grade Topology cs.CV · 2026-05-06 · unverdicted · none · ref 159
A single-image head reconstruction method uses coarse-to-fine optimization with normal consistency, landmarks, and geometry-aware constraints on curvature and conformality to produce meshes with industry-grade topology and preserved facial identity.
Visually-grounded Humanoid Agents cs.CV · 2026-04-09 · unverdicted · none · ref 72
A coupled world-agent framework uses 3D Gaussian reconstruction and first-person RGB-D perception with iterative planning to enable goal-directed, collision-avoiding humanoid behavior in novel reconstructed scenes.
Generator-Refiner-Examiner: A Tri-Module Data Augmentation Framework for 3D Human Avatar Learning from Monocular Videos cs.CV · 2026-05-22 · unverdicted · none · ref 52
TrioMan is a tri-module data augmentation framework using a Generator for pose/camera perturbations, a Refiner with one-step diffusion, and an Examiner with dual-branch attention to improve 3D avatar learning from monocular videos, claiming better results than prior methods on two benchmarks.
Human Interaction-Aware 3D Reconstruction from a Single Image cs.CV · 2026-04-07 · unverdicted · none · ref 35
HUG3D uses group-instance multi-view diffusion and physics-based optimization to create physically plausible 3D reconstructions of interacting people from a single image.

LHM: large animatable human reconstruction model from a single image in seconds

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer