arXiv preprint arXiv:2512.18814 , year=

EchoMotion: Unified Human Video, Motion Generation via Dual-Modality Diffusion Transformer , author= · 2025 · arXiv 2512.18814

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos

cs.CV · 2026-01-15 · unverdicted · novelty 7.0

CoMoVi co-generates 3D human motions and 2D videos synchronously in a single diffusion denoising loop using 3D-to-2D projection and dual-branch diffusion with 3D-2D cross attentions.

AnyAct: Towards Human Reenactment of Character Motion From Video

cs.CV · 2026-05-15 · unverdicted · novelty 6.0 · 2 refs

AnyAct generates editable human reenactments from character videos via conditional motion generation from transferable sparse local 2D articulated cues, with designs for human-only supervision and global-local decoupling.

citing papers explorer

Showing 2 of 2 citing papers.

CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos cs.CV · 2026-01-15 · unverdicted · none · ref 106
CoMoVi co-generates 3D human motions and 2D videos synchronously in a single diffusion denoising loop using 3D-to-2D projection and dual-branch diffusion with 3D-2D cross attentions.
AnyAct: Towards Human Reenactment of Character Motion From Video cs.CV · 2026-05-15 · unverdicted · none · ref 64 · 2 links
AnyAct generates editable human reenactments from character videos via conditional motion generation from transferable sparse local 2D articulated cues, with designs for human-only supervision and global-local decoupling.

arXiv preprint arXiv:2512.18814 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer