Pretraining on 1M wild videos followed by post-training on curated data yields high-fidelity feedforward 3D avatars that generalize across identities, clothing, and lighting with emergent relightability and loose-garment support.
Up2you: Fast reconstruc- tion of yourself from unconstrained photo collections
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CV 3years
2026 3verdicts
UNVERDICTED 3representative citing papers
CoMoVi co-generates 3D human motions and 2D videos synchronously in a single diffusion denoising loop using 3D-to-2D projection and dual-branch diffusion with 3D-2D cross attentions.
Skelebones compresses 4D Gaussian shapes into compact, controllable bones and skeletons, delivering 17.3% PSNR gains over LBS and 21.7% over BoB for unseen poses while preserving reconstruction quality.
citing papers explorer
-
Large-scale Codec Avatars: The Unreasonable Effectiveness of Large-scale Avatar Pretraining
Pretraining on 1M wild videos followed by post-training on curated data yields high-fidelity feedforward 3D avatars that generalize across identities, clothing, and lighting with emergent relightability and loose-garment support.
-
CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos
CoMoVi co-generates 3D human motions and 2D videos synchronously in a single diffusion denoising loop using 3D-to-2D projection and dual-branch diffusion with 3D-2D cross attentions.
-
GaussiAnimate: Reconstruct and Rig Animatable Categories with Level of Dynamics
Skelebones compresses 4D Gaussian shapes into compact, controllable bones and skeletons, delivering 17.3% PSNR gains over LBS and 21.7% over BoB for unseen poses while preserving reconstruction quality.