MotionHiFlow generates text-aligned 3D human motions using hierarchical flow matching across temporal scales, cross-scale transitions, a Text-Motion Diffusion Transformer, and a topology-aware Motion VAE, achieving state-of-the-art results on HumanML3D and KIT-ML.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Tora3 uses shared object trajectories as kinematic priors to jointly guide visual motion and acoustic events in audio-video generation, improving realism and synchronization.
citing papers explorer
-
MotionHiFlow: Text-to-motion via hierarchical flow matching
MotionHiFlow generates text-aligned 3D human motions using hierarchical flow matching across temporal scales, cross-scale transitions, a Text-Motion Diffusion Transformer, and a topology-aware Motion VAE, achieving state-of-the-art results on HumanML3D and KIT-ML.
-
Tora3: Trajectory-Guided Audio-Video Generation with Physical Coherence
Tora3 uses shared object trajectories as kinematic priors to jointly guide visual motion and acoustic events in audio-video generation, improving realism and synchronization.