CMC decouples trajectory control and text-conditioned motion completion with selective inpainting to achieve state-of-the-art accuracy and quality in multimodal human motion generation.
Hoi-diff: Text-driven syn- thesis of 3d human-object interactions using diffusion mod- els
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CV 2roles
background 1polarities
background 1representative citing papers
GenHSI is a training-free three-stage pipeline that turns a scene image, character image, and complex HSI prompt into long videos with plausible chained interactions by generating atomic actions, 3D keyframes via 2D inpainting plus optimization, and then feeding them to pre-trained video diffusion.
citing papers explorer
-
Coordinating Multiple Conditions for Trajectory-Controlled Human Motion Generation
CMC decouples trajectory control and text-conditioned motion completion with selective inpainting to achieve state-of-the-art accuracy and quality in multimodal human motion generation.
-
GenHSI: Controllable Generation of Human-Scene Interaction Videos
GenHSI is a training-free three-stage pipeline that turns a scene image, character image, and complex HSI prompt into long videos with plausible chained interactions by generating atomic actions, 3D keyframes via 2D inpainting plus optimization, and then feeding them to pre-trained video diffusion.