Dynamic 3D Gaussian Fields for Urban Areas

Jonas Kulhanek; Lorenzo Porzi; Marc Pollefeys; Peter Kontschieder; Samuel Rota Bul\`o; Tobias Fischer

arxiv: 2406.03175 · v2 · pith:CBE4DN4Ynew · submitted 2024-06-05 · 💻 cs.CV

Dynamic 3D Gaussian Fields for Urban Areas

Tobias Fischer , Jonas Kulhanek , Samuel Rota Bul\`o , Lorenzo Porzi , Marc Pollefeys , Peter Kontschieder This is my paper

classification 💻 cs.CV

keywords sceneareasdynamicneuralrenderingspeedsurbanappearance

0 comments

read the original abstract

We present an efficient neural 3D scene representation for novel-view synthesis (NVS) in large-scale, dynamic urban areas. Existing works are not well suited for applications like mixed-reality or closed-loop simulation due to their limited visual quality and non-interactive rendering speeds. Recently, rasterization-based approaches have achieved high-quality NVS at impressive speeds. However, these methods are limited to small-scale, homogeneous data, i.e. they cannot handle severe appearance and geometry variations due to weather, season, and lighting and do not scale to larger, dynamic areas with thousands of images. We propose 4DGF, a neural scene representation that scales to large-scale dynamic urban areas, handles heterogeneous input data, and substantially improves rendering speeds. We use 3D Gaussians as an efficient geometry scaffold while relying on neural fields as a compact and flexible appearance model. We integrate scene dynamics via a scene graph at global scale while modeling articulated motions on a local level via deformations. This decomposed approach enables flexible scene composition suitable for real-world applications. In experiments, we surpass the state-of-the-art by over 3 dB in PSNR and more than 200 times in rendering speed.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

ProDiG: Progressive Diffusion-Guided Gaussian Splatting for Aerial to Ground Reconstruction
cs.CV 2026-04 unverdicted novelty 7.0

ProDiG progressively transforms aerial Gaussian splats into coherent ground-level 3D reconstructions via diffusion guidance and specialized attention modules.
Effective Multi-sensor Conditioning for Street-view Novel-view Synthesis
cs.CV 2026-06 unverdicted novelty 6.0

StreetNVS presents a multi-sensor conditioned video diffusion framework for street-view novel view synthesis that outperforms baselines with sparse LiDAR and handles extreme out-of-trajectory paths on the Waymo dataset.
Visually-grounded Humanoid Agents
cs.CV 2026-04 unverdicted novelty 6.0

A coupled world-agent framework uses 3D Gaussian reconstruction and first-person RGB-D perception with iterative planning to enable goal-directed, collision-avoiding humanoid behavior in novel reconstructed scenes.
FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting
cs.CV 2025-11 unverdicted novelty 6.0

FACT-GS allocates higher texture sampling density to high-frequency areas in 2D Gaussian Splatting through a learnable deformation field, recovering sharper details at the same parameter budget.