Multimodal Diffusion Forcing trains a diffusion model on partially masked multimodal robot trajectories to learn temporal and cross-modal dependencies for forceful manipulation.
Video diffusion models,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2025 2verdicts
UNVERDICTED 2representative citing papers
FCVSR is a frequency-aware deep model for compressed video super-resolution using MGAA and MFFR modules plus contrastive loss, achieving up to 0.14 dB PSNR gain on three public datasets.
citing papers explorer
-
Multimodal Diffusion Forcing for Forceful Manipulation
Multimodal Diffusion Forcing trains a diffusion model on partially masked multimodal robot trajectories to learn temporal and cross-modal dependencies for forceful manipulation.
-
FCVSR: A Frequency-aware Method for Compressed Video Super-Resolution
FCVSR is a frequency-aware deep model for compressed video super-resolution using MGAA and MFFR modules plus contrastive loss, achieving up to 0.14 dB PSNR gain on three public datasets.