The paper presents a multimodal framework, dataset, and reconstruction pipeline to create immersive volumetric videos supporting large 6-DoF audiovisual interaction from real multi-view captures.
MoDGS : Dynamic gaussian splatting from causually-captured monocular videos
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.CV 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
PD-4DGS decomposes 4DGS into static scaffold, global deformation, and local refinement layers using hierarchical decomposition and custom losses, achieving over 60% bitstream reduction and reducing first-frame latency to about 1.7 seconds on 2 Mbps links.
By fine-tuning DUST3R to output per-timestep pointmaps on scarce dynamic video datasets, MonST3R achieves stronger video depth and pose estimation without explicit motion modeling.
citing papers explorer
-
Realizing Immersive Volumetric Video: A Multimodal Framework for 6-DoF VR Engagement
The paper presents a multimodal framework, dataset, and reconstruction pipeline to create immersive volumetric videos supporting large 6-DoF audiovisual interaction from real multi-view captures.
-
PD-4DGS:Progressive Decomposition of 4D Gaussian Splatting for Bandwidth-Adaptive Dynamic Scene Streaming
PD-4DGS decomposes 4DGS into static scaffold, global deformation, and local refinement layers using hierarchical decomposition and custom losses, achieving over 60% bitstream reduction and reducing first-frame latency to about 1.7 seconds on 2 Mbps links.
-
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
By fine-tuning DUST3R to output per-timestep pointmaps on scarce dynamic video datasets, MonST3R achieves stronger video depth and pose estimation without explicit motion modeling.