hub Canonical reference

3d gaussian splatting for real-time radiance field rendering.ACM Trans

Bernhard Kerbl, Georgios Kopanas, Thomas Leimk ¨uhler, George Drettakis

Canonical reference. 71% of citing Pith papers cite this work as background.

58 Pith papers citing it

Background 71% of classified citations

browse 58 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

background 5 baseline 1 method 1

citation-polarity summary

background 5 baseline 1 use method 1

representative citing papers

3DReflecNet: A Large-Scale Dataset for 3D Reconstruction of Reflective, Transparent, and Low-Texture Objects

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

3DReflecNet is a 22 TB+ dataset of over 120,000 synthetic and 1,000 real objects with millions of multi-view frames for benchmarking 3D reconstruction on reflective, transparent, and low-texture surfaces.

ULF-Loc: Unbiased Landmark Feature for Robust Visual Localization with 3D Gaussian Splatting

cs.CV · 2026-05-06 · unverdicted · novelty 7.0

ULF-Loc removes bias from 3DGS landmark features via geometry-weighted fusion and consistency checks, cutting median translation error 17% while using 1/10 training time and 1/6 GPU memory of prior state-of-the-art.

PAGaS: Pixel-Aligned 1DoF Gaussian Splatting for Depth Refinement

cs.CV · 2026-04-24 · unverdicted · novelty 7.0

PAGaS refines multi-view stereo depths by optimizing 1DoF Gaussians whose positions and sizes are fixed by back-projected pixel volumes, producing detailed depth maps that outperform reference baselines on 3D reconstruction benchmarks.

TokenGS: Decoupling 3D Gaussian Prediction from Pixels with Learnable Tokens

cs.CV · 2026-04-16 · unverdicted · novelty 7.0

TokenGS uses learnable Gaussian tokens in an encoder-decoder architecture to regress 3D means directly, achieving SOTA feed-forward reconstruction on static and dynamic scenes with better robustness.

ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction

cs.CV · 2026-04-15 · unverdicted · novelty 7.0

ClipGStream enables scalable flicker-free reconstruction of long dynamic multi-view videos by performing stream optimization at the clip level with clip-independent spatio-temporal fields, residual anchor compensation, and inter-clip inherited anchors.

Neural 3D Reconstruction of Planetary Surfaces from Descent-Phase Wide-Angle Imagery

cs.CV · 2026-04-14 · unverdicted · novelty 7.0

A novel explicit neural height field method for descent-phase wide-angle imagery achieves greater spatial coverage than multi-view stereo while preserving estimation accuracy on simulated planetary terrains.

DreamStereo: Towards Real-Time Stereo Inpainting for HD Videos

cs.CV · 2026-04-14 · unverdicted · novelty 7.0

DreamStereo uses GAPW, PBDP, and SASI to enable real-time stereo video inpainting at 25 FPS for HD videos by reducing over 70% redundant computation while maintaining quality.

AnchorSplat: Feed-Forward 3D Gaussian Splatting with 3D Geometric Priors

cs.CV · 2026-04-08 · unverdicted · novelty 7.0

AnchorSplat uses anchor-aligned 3D Gaussians guided by geometric priors for feed-forward scene reconstruction, achieving SOTA novel view synthesis on ScanNet++ with fewer primitives and better view consistency.

AvatarPointillist: AutoRegressive 4D Gaussian Avatarization

cs.CV · 2026-04-06 · unverdicted · novelty 7.0

AvatarPointillist autoregressively generates adaptive 3D point clouds via Transformer for photorealistic 4D Gaussian avatars from one image, jointly predicting animation bindings and using a conditioned Gaussian decoder.

Learning 3D Reconstruction with Priors in Test Time

cs.CV · 2026-04-04 · unverdicted · novelty 7.0

Test-time constrained optimization incorporates priors into pre-trained multiview transformers via self-supervised losses and penalty terms to improve 3D reconstruction accuracy.

THOM: Generating Physically Plausible Hand-Object Meshes From Text

cs.CV · 2026-04-03 · unverdicted · novelty 7.0

THOM is a training-free two-stage framework that generates physically plausible hand-object 3D meshes directly from text by combining text-guided Gaussians with contact-aware physics optimization and VLM refinement.

ProDiG: Progressive Diffusion-Guided Gaussian Splatting for Aerial to Ground Reconstruction

cs.CV · 2026-04-02 · unverdicted · novelty 7.0

ProDiG progressively transforms aerial Gaussian splats into coherent ground-level 3D reconstructions via diffusion guidance and specialized attention modules.

Space-Time Forecasting of Dynamic Scenes with Motion-aware Gaussian Grouping

cs.CV · 2026-02-25 · unverdicted · novelty 7.0

MoGaF groups Gaussians by motion in 4D splatting representations to enable stable long-term forecasting of dynamic scenes.

PerpetualWonder: Long-Horizon Action-Conditioned 4D Scene Generation

cs.CV · 2026-02-04 · unverdicted · novelty 7.0

PerpetualWonder introduces a closed-loop generative simulator with a unified physical-visual representation for long-horizon action-conditioned 4D scene generation from one image.

AGILE: Hand-Object Interaction Reconstruction from Video via Agentic Generation

cs.CV · 2026-02-04 · unverdicted · novelty 7.0

AGILE generates complete object meshes via VLM-guided synthesis and tracks poses with anchor-and-track plus contact-aware optimization to achieve robust hand-object reconstruction from video.

ART: Articulated Reconstruction Transformer

cs.CV · 2025-12-16 · unverdicted · novelty 7.0

ART is a category-agnostic transformer that maps sparse multi-state RGB images to per-part 3D geometry, texture, and articulation parameters via learnable part slots.

RDSplat: Robust Watermarking for 3D Gaussian Splatting Against 2D and 3D Diffusion Editing

cs.CV · 2025-12-07 · conditional · novelty 7.0

RDSplat is the first 3D Gaussian Splatting watermarking method that maintains 0.701 bit accuracy against both 2D and 3D diffusion editing by embedding only in low-frequency primitives selected via FAPS.

Z-Order Transformer for Feed-Forward Gaussian Splatting

cs.CV · 2026-05-13 · unverdicted · novelty 6.0

A Z-order transformer organizes unstructured Gaussians for sparse attention, enabling feed-forward prediction of high-quality 3D splats with fewer primitives.

HumanSplatHMR: Closing the Loop Between Human Mesh Recovery and Gaussian Splatting Avatar

cs.CV · 2026-05-04 · unverdicted · novelty 6.0 · 2 refs

HumanSplatHMR jointly refines 3D human poses and learns Gaussian Splatting avatars by backpropagating photometric, segmentation, and depth losses through a differentiable renderer to improve novel-view and novel-pose synthesis from in-the-wild video.

High-Fidelity Mobile Avatars with Pruned Local Blendshapes

cs.CV · 2026-05-03 · unverdicted · novelty 6.0

Pruned local linear blendshapes on Gaussians capture pose-dependent appearance changes to deliver high-quality mobile avatars at 120 FPS from multi-view video without pretrained models.

Multi-Scale Gaussian-Language Map for Zero-shot Embodied Navigation and Reasoning

cs.CV · 2026-05-03 · unverdicted · novelty 6.0

GLMap combines explicit 3D Gaussians with multi-scale language semantics in a dual-modality structure and uses an analytical Gaussian Estimator for incremental map building, improving zero-shot performance on navigation and reasoning tasks.

Generalizable Sparse-View 3D Reconstruction from Unconstrained Images

cs.CV · 2026-04-30 · unverdicted · novelty 6.0

GenWildSplat is a feed-forward model that reconstructs 3D Gaussians from sparse unposed unconstrained images by predicting depth and poses with learned priors, an appearance adapter, and semantic segmentation for transients.

Color-Encoded Illumination for High-Speed Volumetric Scene Reconstruction

cs.CV · 2026-04-29 · unverdicted · novelty 6.0

Color-encoded illumination combined with dynamic Gaussian Splatting enables first-of-a-kind high-speed volumetric reconstruction from unaugmented low-speed multi-view cameras.

Generalizable Human Gaussian Splatting via Multi-view Semantic Consistency

cs.CV · 2026-04-28 · unverdicted · novelty 6.0

Unprojecting latent embeddings via depth maps and recalibrating with cross-view attention improves 3D Gaussian localization for generalizable sparse-view human rendering.

citing papers explorer

Showing 16 of 16 citing papers after filters.

ART: Articulated Reconstruction Transformer cs.CV · 2025-12-16 · unverdicted · none · ref 24
ART is a category-agnostic transformer that maps sparse multi-state RGB images to per-part 3D geometry, texture, and articulation parameters via learnable part slots.
RDSplat: Robust Watermarking for 3D Gaussian Splatting Against 2D and 3D Diffusion Editing cs.CV · 2025-12-07 · conditional · none · ref 26
RDSplat is the first 3D Gaussian Splatting watermarking method that maintains 0.701 bit accuracy against both 2D and 3D diffusion editing by embedding only in low-frequency primitives selected via FAPS.
GaussianDWM: 3D Gaussian Driving World Model for Unified Scene Understanding and Multi-Modal Generation cs.CV · 2025-12-29 · unverdicted · none · ref 25
GaussianDWM uses 3D Gaussians with embedded linguistic features, language-guided sampling, and dual-condition generation for unified scene understanding and multi-modal output in driving world models.
Chorus: Multi-Teacher Pretraining for Holistic 3D Gaussian Scene Encoding cs.CV · 2025-12-19 · unverdicted · none · ref 22
Chorus pretrains a shared 3D Gaussian scene encoder via multi-teacher distillation to capture holistic features from high-level semantics to fine-grained structure, with strong transfer on segmentation and point-cloud tasks using far fewer scenes.
FlexAvatar: Learning Complete 3D Head Avatars with Partial Supervision cs.CV · 2025-12-17 · unverdicted · none · ref 22
FlexAvatar introduces bias sinks in a transformer to unify monocular and multi-view training, yielding complete 3D head avatars with strong generalization and view extrapolation from single images.
Native and Compact Structured Latents for 3D Generation cs.CV · 2025-12-16 · unverdicted · none · ref 27
Introduces O-Voxel omni-voxel representation and Sparse Compression VAE for structured native 3D latents, enabling efficient training of large flow-matching models that produce higher-quality geometry and materials than prior methods.
From Orbit to Ground: Generative City Photogrammetry from Extreme Off-Nadir Satellite Images cs.CV · 2025-12-08 · unverdicted · none · ref 22
A technique reconstructs large urban areas from sparse extreme off-nadir satellite images by modeling geometry as a Z-monotonic 2.5D height map SDF and applying a generative network to restore plausible textures on the resulting mesh.
C3G: Learning Compact 3D Representations with 2K Gaussians cs.CV · 2025-12-03 · unverdicted · none · ref 30
C3G creates compact 3D Gaussian representations with 2K points by guiding placement via learnable tokens that aggregate multi-view features through attention, yielding better efficiency and performance than dense methods.
ShelfGaussian: Shelf-Supervised Open-Vocabulary Gaussian-based 3D Scene Understanding cs.CV · 2025-12-03 · unverdicted · none · ref 37
ShelfGaussian achieves state-of-the-art zero-shot semantic occupancy prediction on Occ3D-nuScenes by jointly supervising Gaussian representations with vision foundation model features at 2D image and 3D scene levels.
FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting cs.CV · 2025-11-28 · unverdicted · none · ref 17
FACT-GS allocates higher texture sampling density to high-frequency areas in 2D Gaussian Splatting through a learnable deformation field, recovering sharper details at the same parameter budget.
GRLoc: Geometric Representation Regression for Visual Localization cs.CV · 2025-11-17 · unverdicted · none · ref 28
The paper reformulates absolute pose regression as regressing disentangled world-coordinate raymaps and pointmaps from images, then recovering pose via a differentiable solver, claiming SOTA results on 7-Scenes and Cambridge Landmarks.
MedGS: Gaussian Splatting for Multi-Modal 3D Medical Imaging cs.CV · 2025-09-20 · conditional · none · ref 21
MedGS extends Gaussian Splatting with a relightable model tailored to endoscopic imaging where light and camera are co-located, achieving better novel-view synthesis and tissue editing than baselines.
SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model cs.CV · 2025-11-27 · unverdicted · none · ref 13
A sparse transformer predicts multi-frame 3D occupancy from images without BEV or VAE tokenization and reports SOTA results on nuScenes for 1-3s forecasting under arbitrary trajectories.
MetroGS: Efficient and Stable Reconstruction of Geometrically Accurate High-Fidelity Large-Scale Scenes cs.CV · 2025-11-24 · unverdicted · none · ref 22
MetroGS combines distributed 2D Gaussian Splatting with structured dense enhancement, progressive hybrid optimization, and depth-guided appearance modeling to deliver higher geometric accuracy and stability in large-scale urban reconstruction.
SING3R-SLAM: Submap-based Indoor Monocular Gaussian SLAM with 3D Reconstruction Priors cs.CV · 2025-11-21 · unverdicted · none · ref 11
SING3R-SLAM adds submap-level global alignment and reconstruction priors to a Gaussian map to reduce drift and improve local geometry in monocular indoor SLAM.
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling cs.CV · 2025-12-16 · unreviewed · ref 25

3d gaussian splatting for real-time radiance field rendering.ACM Trans

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer