Motion-Adapter improves text-to-motion diffusion models for compound actions by using decoupled cross-attention maps as structural masks during denoising to produce more coherent full-body motions.
Mogents: Motion generation based on spatial-temporal joint modeling,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Motion-Adapter: A Diffusion Model Adapter for Text-to-Motion Generation of Compound Actions
Motion-Adapter improves text-to-motion diffusion models for compound actions by using decoupled cross-attention maps as structural masks during denoising to produce more coherent full-body motions.