pith. machine review for the scientific record. sign in

arxiv: 2603.13783 · v2 · submitted 2026-03-14 · 💻 cs.CV

RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting

Pith reviewed 2026-05-15 11:45 UTC · model grok-4.3

classification 💻 cs.CV
keywords 4D Gaussian Splattingcontinuous-time reconstructiontemporal retimingdynamic scene renderingoptical flow supervisionghost-free interpolationnon-rigid deformation
0
0 comments X

The pith

RetimeGS defines explicit temporal behavior for 3D Gaussians to enable continuous-time rendering of dynamic scenes without ghosting.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Existing 4D Gaussian Splatting methods overfit to discrete frame indices and produce ghosting artifacts when interpolating to intermediate timestamps. RetimeGS treats this as temporal aliasing and introduces a representation that explicitly defines how each 3D Gaussian evolves over time. Optical flow-guided initialization, triple-rendering supervision, and related strategies are added to enforce smooth and consistent interpolation. The result matters for slow-motion playback, temporal editing, and post-production, where scenes must be rendered at arbitrary continuous times rather than only at captured frames. Tests on data with fast motion, non-rigid deformation, and heavy occlusions show improved quality and coherence compared with prior 4DGS approaches.

Core claim

RetimeGS is a 4D Gaussian Splatting representation that explicitly defines the temporal behavior of the 3D Gaussian and mitigates temporal aliasing. Optical flow-guided initialization and triple-rendering supervision, together with other targeted strategies, enable ghost-free, temporally coherent rendering at arbitrary timestamps even under large motions.

What carries the argument

Explicit temporal definition of each 3D Gaussian, supported by optical flow initialization and triple-rendering supervision, which together prevent discrete-frame overfitting and enforce consistent continuous-time interpolation.

If this is right

  • Rendering becomes possible at any continuous timestamp rather than only at discrete captured frames.
  • Temporal coherence holds under fast motion, non-rigid deformation, and severe occlusions where prior methods fail.
  • Applications such as slow-motion playback and temporal video editing gain direct support.
  • Interpolation between frames no longer requires separate post-processing to suppress artifacts.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same explicit-timing idea could be applied to other continuous scene representations beyond Gaussians.
  • If training cost remains moderate, the method could support interactive temporal editing in consumer video tools.
  • Scenes reconstructed from fewer cameras might benefit if the temporal supervision reduces reliance on dense view coverage.

Load-bearing premise

Temporal aliasing from discrete-frame overfitting is the primary cause of ghosting, and the added optical-flow and supervision strategies will eliminate it reliably across varied scenes without introducing new artifacts.

What would settle it

If rendering RetimeGS at timestamps between training frames on a fast-motion sequence still produces visible ghosting, the claim that the method achieves effective continuous-time reconstruction would be false.

Figures

Figures reproduced from arXiv: 2603.13783 by Li Ma, Pedro V. Sander, Xuezhen Wang, Yulin Shen, Zeyu Wang.

Figure 1
Figure 1. Figure 1: We introduce a 4DGS representation with tailored training strategies that enables interpolating arbitrary intermediate frames, [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Illustration of temporal overfitting to input frames [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Pipeline Overview. We represent a dynamic scene using a novel 4D representation that combines regularized temporal opacity with smooth spline-based spatial positioning. By leveraging tailored training strategies using RGB images and bidirectional optical flow, our method can reconstruct arbitrary intermediate frames under sparse temporal sampling and large motion. Specifically, at initialization, the tempo… view at source ↗
Figure 4
Figure 4. Figure 4: Qualitative comparison. Results on DNA-Rendering Dataset [4] (w/o GT held-out views, left) and Stage-Capture Dataset (w/ GT held-out views, right). The red boxes show the corresponding zoomed-in views for detailed comparison. distorted textures. In the bottom-right scene, where the fin￾gers dynamically emerge from the clothing, the method fails to estimate even coarse correspondences, causing the hand to s… view at source ↗
Figure 5
Figure 5. Figure 5: (5a) The texture is distorted when flow-related components are removed. (5b) Without triple rendering, the two adjacent primitive groups each capture only part of the content in the input frames they jointly cover: the previous group reconstructs the right texture in the circled region, while the next group reconstructs the left texture [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Ablation on dynamic stretching. The magenta is ren￾dered using static stretched primitives, and the teal is rendered us￾ing dynamic primitives (with static background removed). our dynamic stretching in [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗
Figure 8
Figure 8. Figure 8: Failure case under extremely low capture FPS. Our method struggles to interpolate intermediate frames when the inter-frame motion becomes too large due to low temporal sam￾pling or large motion. C. More Experiment Results C.1. Per Scene Breakdown In [PITH_FULL_IMAGE:figures/full_fig_p012_8.png] view at source ↗
Figure 10
Figure 10. Figure 10: Comparison on rendered and pseudo-GT flow map. [PITH_FULL_IMAGE:figures/full_fig_p014_10.png] view at source ↗
Figure 9
Figure 9. Figure 9: Qualitative comparison on Neural3DV: Scene with fire (rapid opacity changes) and non-stage-capture setting. [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗
read the original abstract

Temporal retiming, the ability to reconstruct and render dynamic scenes at arbitrary timestamps, is crucial for applications such as slow-motion playback, temporal editing, and post-production. However, most existing 4D Gaussian Splatting (4DGS) methods overfit at discrete frame indices but struggle to represent continuous-time frames, leading to ghosting artifacts when interpolating between timestamps. We identify this limitation as a form of temporal aliasing and propose RetimeGS, a simple yet effective 4DGS representation that explicitly defines the temporal behavior of the 3D Gaussian and mitigates temporal aliasing. To achieve smooth and consistent interpolation, we incorporate optical flow-guided initialization and supervision, triple-rendering supervision, and other targeted strategies. Together, these components enable ghost-free, temporally coherent rendering even under large motions. Experiments on datasets featuring fast motion, non-rigid deformation, and severe occlusions demonstrate that RetimeGS achieves superior quality and coherence over state-of-the-art methods.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces RetimeGS, a 4D Gaussian Splatting method that explicitly defines continuous-time parameters for 3D Gaussians to mitigate temporal aliasing and enable ghost-free rendering at arbitrary timestamps. It augments prior 4DGS with optical flow-guided initialization, triple-rendering supervision, and related strategies, claiming superior quality and temporal coherence over state-of-the-art methods on datasets with fast motion, non-rigid deformation, and severe occlusions.

Significance. If the central claims hold with robust validation, the work would meaningfully advance continuous-time dynamic scene reconstruction, directly benefiting applications such as slow-motion playback and temporal editing. The targeted fixes for aliasing in 4DGS represent a practical incremental contribution, though its significance depends on demonstrating that gains are not artifacts of flow-dependent supervision.

major comments (2)
  1. [Method and Experiments] The central construction relies on optical-flow-guided initialization and flow-based loss terms to suppress temporal aliasing, yet the manuscript provides no ablation isolating performance under accurate versus noisy or failed flow estimates (known to occur precisely on the fast-motion and occlusion datasets highlighted in the abstract). This leaves open whether reported gains are robust or flow-dependent; a concrete test with synthetic flow degradation or alternative initializers is needed to support the superiority claim.
  2. [Abstract and §4] No quantitative metrics, error analysis, or ablation tables are referenced in the abstract or early sections to substantiate the 'superior quality and coherence' claim; the soundness assessment requires explicit reporting of PSNR/SSIM/LPIPS deltas versus baselines on the targeted regimes, with statistical significance.
minor comments (2)
  1. [Method] Notation for continuous-time Gaussian parameters (e.g., time-dependent means and covariances) should be introduced with explicit equations early in the method section to clarify how they differ from discrete-frame 4DGS.
  2. [Figures] Figure captions for qualitative results should include the specific timestamps used for interpolation and note any failure cases observed.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the major comments point-by-point below and will incorporate revisions to strengthen the manuscript.

read point-by-point responses
  1. Referee: [Method and Experiments] The central construction relies on optical-flow-guided initialization and flow-based loss terms to suppress temporal aliasing, yet the manuscript provides no ablation isolating performance under accurate versus noisy or failed flow estimates (known to occur precisely on the fast-motion and occlusion datasets highlighted in the abstract). This leaves open whether reported gains are robust or flow-dependent; a concrete test with synthetic flow degradation or alternative initializers is needed to support the superiority claim.

    Authors: We agree that robustness to flow estimation errors warrants explicit validation. In the revised manuscript we will add a dedicated ablation that synthetically degrades the input optical flow (via additive Gaussian noise and simulated occlusions on fast-motion regions) and reports the resulting changes in PSNR/SSIM/LPIPS. This will isolate the contribution of the explicit continuous-time Gaussian parameterization and triple-rendering supervision, which are intended to provide complementary robustness beyond flow guidance. revision: yes

  2. Referee: [Abstract and §4] No quantitative metrics, error analysis, or ablation tables are referenced in the abstract or early sections to substantiate the 'superior quality and coherence' claim; the soundness assessment requires explicit reporting of PSNR/SSIM/LPIPS deltas versus baselines on the targeted regimes, with statistical significance.

    Authors: We will revise the abstract and introduction to include the key quantitative deltas (average PSNR, SSIM, and LPIPS improvements versus the strongest baselines on the fast-motion and occlusion subsets). We will also add error analysis with standard deviations computed across scenes and multiple random seeds to support statistical significance. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation builds on prior 4DGS with independent external signals

full rationale

The paper identifies temporal aliasing in existing 4DGS methods as the source of ghosting during interpolation and introduces RetimeGS, which explicitly defines continuous-time behavior for 3D Gaussians. It adds optical flow-guided initialization, triple-rendering supervision, and related strategies. These components rely on external inputs (optical flow estimators) and supervision signals that are not defined in terms of the method's own outputs or predictions. No equations reduce by construction to fitted parameters renamed as predictions, no load-bearing uniqueness theorems are imported via self-citation, and no ansatz is smuggled through prior work by the same authors. The central claims remain independent of the reported results.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review based solely on abstract; no explicit free parameters, axioms, or invented entities are named. The method appears to rest on standard Gaussian splatting assumptions plus optical flow from prior work.

pith-pipeline@v0.9.0 · 5475 in / 1106 out tokens · 43219 ms · 2026-05-15T11:45:55.421976+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

50 extracted references · 50 canonical work pages · 1 internal anchor

  1. [1]

    Per-Gaussian Embedding- Based Deformation for Deformable 3D Gaussian Splatting

    Jeongmin Bae, Seoha Kim, Youngsik Yun, Hahyun Lee, Gun Bang, and Youngjung Uh. Per-Gaussian Embedding- Based Deformation for Deformable 3D Gaussian Splatting. InEuropean Conference on Computer Vision, pages 321–

  2. [2]

    A Class of Local Interpo- lating Splines

    Edwin Catmull and Raphael Rom. A Class of Local Interpo- lating Splines. InComputer Aided Geometric Design, pages 317–326. Elsevier, 1974. 2, 4, 5

  3. [3]

    4DSloMo: 4D Reconstruc- tion for High Speed Scene with Asynchronous Capture.Pro- ceedings of the SIGGRAPH Asia 2025 Conference Papers, pages 1–11, 2025

    Yutian Chen, Shi Guo, Tianshuo Yang, Lihe Ding, Xiuyuan Yu, Jinwei Gu, and Tianfan Xue. 4DSloMo: 4D Reconstruc- tion for High Speed Scene with Asynchronous Capture.Pro- ceedings of the SIGGRAPH Asia 2025 Conference Papers, pages 1–11, 2025. 3

  4. [4]

    DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-Centric Render- ing

    Wei Cheng, Ruixiang Chen, Siming Fan, Wanqi Yin, Keyu Chen, Zhongang Cai, Jingbo Wang, Yang Gao, Zheng- ming Yu, Zhengyu Lin, Daxuan Ren, Lei Yang, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Bo Dai, and Kwan-Yee Lin. DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-Centric Render- ing. InProceedings of the IEEE/CVF...

  5. [5]

    4D-Rotor Gaussian Splat- ting: Towards Efficient Novel View Synthesis for Dynamic Scenes

    Yuanxing Duan, Fangyin Wei, Qiyu Dai, Yuhang He, Wen- zheng Chen, and Baoquan Chen. 4D-Rotor Gaussian Splat- ting: Towards Efficient Novel View Synthesis for Dynamic Scenes. InACM SIGGRAPH Conference Papers, pages 1– 11, 2024. 2

  6. [6]

    GaussianFlow: Splatting Gaussian Dynamics for 4D Con- tent Creation.Transactions on Machine Learning Research,

    Quankai Gao, Qiangeng Xu, Zhe Cao, Ben Mildenhall, Wen- chao Ma, Le Chen, Danhang Tang, and Ulrich Neumann. GaussianFlow: Splatting Gaussian Dynamics for 4D Con- tent Creation.Transactions on Machine Learning Research,

  7. [7]

    Motion-Aware 3D Gaussian Splatting for Effi- cient Dynamic Scene Reconstruction.IEEE Transactions on Circuits and Systems for Video Technology, 2024

    Zhiyang Guo, Wengang Zhou, Li Li, Min Wang, and Houqiang Li. Motion-Aware 3D Gaussian Splatting for Effi- cient Dynamic Scene Reconstruction.IEEE Transactions on Circuits and Systems for Video Technology, 2024. 2

  8. [8]

    Forge4D: Feed-Forward 4D Human Reconstruction and In- terpolation from Uncalibrated Sparse-view Videos.arXiv preprint arXiv:2509.24209, 2025

    Yingdong Hu, Yisheng He, Jinnan Chen, Weihao Yuan, Ke- jie Qiu, Zehong Lin, Siyu Zhu, Zilong Dong, and Jun Zhang. Forge4D: Feed-Forward 4D Human Reconstruction and In- terpolation from Uncalibrated Sparse-view Videos.arXiv preprint arXiv:2509.24209, 2025. 3

  9. [9]

    SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

    Yi-Hua Huang, Yang-Tian Sun, Ziyi Yang, Xiaoyang Lyu, Yan-Pei Cao, and Xiaojuan Qi. SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4220–4230, 2024. 2

  10. [10]

    Co- Tracker: It is Better to Track Together

    Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova, Andrea Vedaldi, and Christian Rupprecht. Co- Tracker: It is Better to Track Together. InEuropean Confer- ence on Computer Vision, pages 18–35, 2024. 2

  11. [11]

    3D Gaussian Splatting for Real-Time Radiance Field Rendering.ACM Transactions on Graphics (TOG), 42(4), 2023

    Bernhard Kerbl, Georgios Kopanas, Thomas Leimkuehler, and George Drettakis. 3D Gaussian Splatting for Real-Time Radiance Field Rendering.ACM Transactions on Graphics (TOG), 42(4), 2023. 3

  12. [12]

    3D Gaussian Splatting as Markov Chain Monte Carlo.Advances in Neural Information Processing Systems, 2024

    Shakiba Kheradmand, Daniel Rebain, Gopal Sharma, Wei- wei Sun, Yang-Che Tseng, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, and Kwang Moo Yi. 3D Gaussian Splatting as Markov Chain Monte Carlo.Advances in Neural Information Processing Systems, 2024. 5, 8, 1, 3

  13. [13]

    AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation.arXiv preprint arXiv:2506.01061, 2025

    Dahyeon Kye, Changhyun Roh, Sukhun Ko, Chanho Eom, and Jihyong Oh. AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation.arXiv preprint arXiv:2506.01061, 2025. 3

  14. [14]

    DGD: Dynamic 3D Gaussians Distillation

    Isaac Labe, Noam Issachar, Itai Lang, and Sagie Benaim. DGD: Dynamic 3D Gaussians Distillation. InEuropean Conference on Computer Vision, pages 361–378, 2024. 2

  15. [15]

    Fully Explicit Dynamic Gaussian Splat- ting.Advances in Neural Information Processing Systems, 37:5384–5409, 2024

    Junoh Lee, ChangYeon Won, Hyunjun Jung, Inhwan Bae, and Hae-Gon Jeon. Fully Explicit Dynamic Gaussian Splat- ting.Advances in Neural Information Processing Systems, 37:5384–5409, 2024. 2, 3

  16. [16]

    Harley, Leonidas Guibas, and Kostas Daniilidis

    Jiahui Lei, Yijia Weng, Adam W. Harley, Leonidas Guibas, and Kostas Daniilidis. MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds. InProceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6165–6177, 2025. 2

  17. [17]

    Neural 3D Video Synthesis from Multi-View Video

    Tianye Li, Mira Slavcheva, Michael Zollhoefer, Simon Green, Christoph Lassner, Changil Kim, Tanner Schmidt, Steven Lovegrove, Michael Goesele, Richard Newcombe, et al. Neural 3D Video Synthesis from Multi-View Video. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5521–5531, 2022. 2, 3

  18. [18]

    Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis

    Zhan Li, Zhang Chen, Zhong Li, and Yi Xu. Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8508– 8520, 2024. 2, 3, 6, 1, 4

  19. [19]

    GauFRe: Gaussian Deformation Fields for Real-time Dy- namic Novel View Synthesis

    Yiqing Liang, Numair Khan, Zhengqin Li, Thu Nguyen- Phuoc, Douglas Lanman, James Tompkin, and Lei Xiao. GauFRe: Gaussian Deformation Fields for Real-time Dy- namic Novel View Synthesis. InProc. IEEE/CVF Winter Conference on Applications of Computer Vision, 2025. 2

  20. [20]

    DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos.Advances in Neural Information Pro- cessing Systems, 2025

    Chieh Hubert Lin, Zhaoyang Lv, Songyin Wu, Zhen Xu, Thu Nguyen-Phuoc, Hung-Yu Tseng, Julian Straub, Nu- mair Khan, Lei Xiao, Ming-Hsuan Yang, et al. DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos.Advances in Neural Information Pro- cessing Systems, 2025. 3 9

  21. [21]

    Global Motion Corresponder for 3D Point-Based Scene Interpola- tion under Large Motion

    Junru Lin, Chirag Vashist, Mikaela Angelina Uy, Colton Stearns, Xuan Luo, Leonidas Guibas, and Ke Li. Global Motion Corresponder for 3D Point-Based Scene Interpola- tion under Large Motion. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 7884– 7893, 2025. 3

  22. [22]

    3D Geometry-Aware Deformable Gaussian Splatting for Dynamic View Synthe- sis

    Zhicheng Lu, Xiang Guo, Le Hui, Tianrui Chen, Ming Yang, Xiao Tang, Feng Zhu, and Yuchao Dai. 3D Geometry-Aware Deformable Gaussian Splatting for Dynamic View Synthe- sis. InProceedings of the IEEE/CVF Conference on Com- puter Vision and Pattern Recognition, 2024. 2

  23. [23]

    Dynamic 3D Gaussians: Tracking by Persis- tent Dynamic View Synthesis

    Jonathon Luiten, Georgios Kopanas, Bastian Leibe, and Deva Ramanan. Dynamic 3D Gaussians: Tracking by Persis- tent Dynamic View Synthesis. InInternational Conference on 3D Vision, pages 800–809. IEEE, 2024. 2

  24. [24]

    In-2-4D: Inbetweening from Two Sin- gle View Images to 4D Generation.arXiv preprint arXiv:2504.08366, 2025

    Sauradip Nag, Daniel Cohen-Or, Hao Zhang, and Ali Mahdavi-Amiri. In-2-4D: Inbetweening from Two Sin- gle View Images to 4D Generation.arXiv preprint arXiv:2504.08366, 2025. 3

  25. [25]

    SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video

    Park, Jongmin and Bui, Minh-Quan Viet and Bello, Juan Luis Gonzalez and Moon, Jaeho and Oh, Jihyong and Kim, Munchurl. SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video. InProceedings of the Computer Vision and Pattern Recogni- tion Conference, pages 26866–26875, 2025. 2

  26. [26]

    PyTorch: An Imperative Style, High-Performance Deep Learning Library.Advances in Neural Information Process- ing Systems, pages 8026–8037, 2019

    Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, and Alban Desmai- son. PyTorch: An Imperative Style, High-Performance Deep Learning Library.Advances in Neural Information Process- ing Systems, pages 8026–8037, 2019. 6

  27. [27]

    PAPR in Motion: Seamless Point-level 3D Scene Interpolation

    Shichong Peng, Yanshu Zhang, and Ke Li. PAPR in Motion: Seamless Point-level 3D Scene Interpolation. InProceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 21007–21016, 2024. 3

  28. [28]

    FILM: Frame In- terpolation for Large Motion

    Fitsum Reda, Janne Kontkanen, Eric Tabellion, Deqing Sun, Caroline Pantofaru, and Brian Curless. FILM: Frame In- terpolation for Large Motion. InEuropean Conference on Computer Vision, 2022. 6, 1, 2, 3

  29. [29]

    L4GM: Large 4D Gaussian Reconstruction Model.Advances in Neural Information Processing Systems, 37:56828–56858, 2024

    Jiawei Ren, Cheng Xie, Ashkan Mirzaei, Karsten Kreis, Zi- wei Liu, Antonio Torralba, Sanja Fidler, Seung Wook Kim, Huan Ling, et al. L4GM: Large 4D Gaussian Reconstruction Model.Advances in Neural Information Processing Systems, 37:56828–56858, 2024. 3

  30. [30]

    SWinGS: Sliding Windows for Dynamic 3D Gaussian Splatting

    Richard Shaw, Michal Nazarczuk, Jifei Song, Arthur Moreau, Sibi Catley-Chandar, Helisa Dhamo, and Eduardo P´erez-Pellitero. SWinGS: Sliding Windows for Dynamic 3D Gaussian Splatting. InEuropean Conference on Computer Vision, pages 37–54. Springer, 2024. 2

  31. [31]

    Wan: Open and Advanced Large-Scale Video Generative Models

    Team Wan, Ang Wang, Baole Ai, Bin Wen, Chaojie Mao, Chen-Wei Xie, Di Chen, Feiwu Yu, Haiming Zhao, Jianx- iao Yang, Jianyuan Zeng, Jiayu Wang, Jingfeng Zhang, Jin- gren Zhou, Jinkai Wang, Jixuan Chen, Kai Zhu, Kang Zhao, Keyu Yan, Lianghua Huang, Mengyang Feng, Ningyi Zhang, Pandeng Li, Pingyu Wu, Ruihang Chu, Ruili Feng, Shiwei Zhang, Siyang Sun, Tao Fan...

  32. [32]

    Effect of Frame Rate on User Experience, Performance, and Simulator Sickness in Virtual Reality.IEEE Transactions on Visualization and Computer Graphics, 29(5):2478–2488, 2023

    Jialin Wang, Rongkai Shi, Wenxuan Zheng, Weijie Xie, Do- minic Kao, and Hai-Ning Liang. Effect of Frame Rate on User Experience, Performance, and Simulator Sickness in Virtual Reality.IEEE Transactions on Visualization and Computer Graphics, 29(5):2478–2488, 2023. 1

  33. [33]

    VGGT: Visual Geometry Grounded Transformer

    Jianyuan Wang, Minghao Chen, Nikita Karaev, Andrea Vedaldi, Christian Rupprecht, and David Novotny. VGGT: Visual Geometry Grounded Transformer. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025. 6

  34. [34]

    Shape of Mo- tion: 4D Reconstruction from a Single Video

    Qianqian Wang, Vickie Ye, Hang Gao, Weijia Zeng, Jake Austin, Zhengqi Li, and Angjoo Kanazawa. Shape of Mo- tion: 4D Reconstruction from a Single Video. InInterna- tional Conference on Computer Vision (ICCV), 2025. 2

  35. [35]

    W AFT: Warping-Alone Field Transforms for Optical Flow.arXiv preprint arXiv:2506.21526, 2025

    Yihan Wang and Jia Deng. W AFT: Warping-Alone Field Transforms for Optical Flow.arXiv preprint arXiv:2506.21526, 2025. 3, 4

  36. [36]

    SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow

    Yihan Wang, Lahav Lipson, and Jia Deng. SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow. In European Conference on Computer Vision, pages 36–54. Springer, 2024. 3, 4

  37. [37]

    FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction

    Yifan Wang, Peishan Yang, Zhen Xu, Jiaming Sun, Zhan- hua Zhang, Yong Chen, Hujun Bao, Sida Peng, and Xiaowei Zhou. FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction. InProceed- ings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 21750–21760, 2025. 2, 5, 8

  38. [38]

    Bovik, H.R

    Zhou Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. Image Quality Assessment: From Error Visibility to Struc- tural Similarity.IEEE Transactions on Image Processing, 13 (4):600–612, 2004. 6

  39. [39]

    4D Gaussian Splatting for Real-Time Dynamic Scene Ren- dering

    Guanjun Wu, Taoran Yi, Jiemin Fang, Lingxi Xie, Xiaopeng Zhang, Wei Wei, Wenyu Liu, Qi Tian, and Xinggang Wang. 4D Gaussian Splatting for Real-Time Dynamic Scene Ren- dering. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20310– 20320, 2024. 2, 6, 3

  40. [40]

    StreamSplat: Towards Online Dynamic 3D Recon- struction from Uncalibrated Video Streams.arXiv preprint arXiv:2506.08862, 2025

    Zike Wu, Qi Yan, Xuanyu Yi, Lele Wang, and Renjie Liao. StreamSplat: Towards Online Dynamic 3D Recon- struction from Uncalibrated Video Streams.arXiv preprint arXiv:2506.08862, 2025. 3

  41. [41]

    Representing Long V olumet- ric Video with Temporal Gaussian Hierarchy.ACM Transac- tions on Graphics (TOG), 43(6):1–18, 2024

    Zhen Xu, Yinghao Xu, Zhiyuan Yu, Sida Peng, Jiaming Sun, Hujun Bao, and Xiaowei Zhou. Representing Long V olumet- ric Video with Temporal Gaussian Hierarchy.ACM Transac- tions on Graphics (TOG), 43(6):1–18, 2024. 2

  42. [42]

    4DGT: Learning a 4D Gaus- sian Transformer Using Real-World Monocular Videos.Ad- vances in Neural Information Processing Systems, 2025

    Zhen Xu, Zhengqin Li, Zhao Dong, Xiaowei Zhou, Richard Newcombe, and Zhaoyang Lv. 4DGT: Learning a 4D Gaus- sian Transformer Using Real-World Monocular Videos.Ad- vances in Neural Information Processing Systems, 2025. 3 10

  43. [43]

    Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction

    Ziyi Yang, Xinyu Gao, Wen Zhou, Shaohui Jiao, Yuqing Zhang, and Xiaogang Jin. Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vi- sion and Pattern Recognition, pages 20331–20341, 2024. 2

  44. [44]

    Real- time Photorealistic Dynamic Scene Representation and Ren- dering with 4D Gaussian Splatting

    Zeyu Yang, Hongye Yang, Zijie Pan, and Li Zhang. Real- time Photorealistic Dynamic Scene Representation and Ren- dering with 4D Gaussian Splatting. InInternational Confer- ence on Learning Representations, 2024. 2

  45. [45]

    gsplat: An Open-Source Library for Gaussian Splatting.Journal of Ma- chine Learning Research, 26(34):1–17, 2025

    Vickie Ye, Ruilong Li, Justin Kerr, Matias Turkulainen, Brent Yi, Zhuoyang Pan, Otto Seiskari, Jianbo Ye, Jeffrey Hu, Matthew Tancik, and Angjoo Kanazawa. gsplat: An Open-Source Library for Gaussian Splatting.Journal of Ma- chine Learning Research, 26(34):1–17, 2025. 1

  46. [46]

    TrackerSplat: Exploiting Point Tracking for Fast and Robust Dynamic 3D Gaussians Reconstruction

    Daheng Yin, Isaac Ding, Yili Jin, Jianxin Shi, and Jiangchuan Liu. TrackerSplat: Exploiting Point Tracking for Fast and Robust Dynamic 3D Gaussians Reconstruction. In SIGGRAPH Asia 2025 Conference Papers, 2025. 3

  47. [47]

    Mip-Splatting: Alias-free 3D Gaussian Splatting

    Zehao Yu, Anpei Chen, Binbin Huang, Torsten Sattler, and Andreas Geiger. Mip-Splatting: Alias-free 3D Gaussian Splatting. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 19447– 19456, 2024. 2

  48. [48]

    The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

    Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shecht- man, and Oliver Wang. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 586–595, 2018. 6

  49. [49]

    Dynamic Scene Reconstruc- tion: Recent Advance in Real-time Rendering and Stream- ing.arXiv preprint arXiv:2503.08166, 2025

    Jiaxuan Zhu and Hao Tang. Dynamic Scene Reconstruc- tion: Recent Advance in Real-time Rendering and Stream- ing.arXiv preprint arXiv:2503.08166, 2025. 2

  50. [50]

    temporal sum

    Ruijie Zhu, Yanzhe Liang, Hanzhi Chang, Jiacheng Deng, Jiahao Lu, Wenfei Yang, Tianzhu Zhang, and Yongdong Zhang. MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting.Advances in Neural In- formation Processing Systems, 37:101790–101817, 2024. 2 11 RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting Supplementary...