arXiv preprint arXiv:1812.04605 , year =

· 2018 · arXiv 1812.04605

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Improved monocular depth prediction using distance transform over pre-semantic contours with self-supervised neural networks

eess.IV · 2026-05-08 · unverdicted · novelty 7.0

Self-supervised monocular depth estimation improves in low-texture regions by using distance transforms on jointly estimated pre-semantic contours to create more informative loss signals.

Tango3D: Towards Alignment for Global and Local 2D-3D Correspondence

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

Tango3D unifies dense pixel-to-point 2D-3D alignment and global retrieval in one shared space using a geometry-aware 2D backbone, 3D VAE tokens, and three-stage progressive training.

Depth Anything 3: Recovering the Visual Space from Any Views

cs.CV · 2025-11-13 · unverdicted · novelty 6.0

DA3 recovers consistent visual geometry from arbitrary views via a vanilla DINO transformer and depth-ray target, setting new SOTA on a visual geometry benchmark while outperforming DA2 on monocular depth.

Flow4DGS-SLAM: Optical Flow-Guided 4D Gaussian Splatting SLAM

cs.CV · 2026-04-24 · unverdicted · novelty 5.0

Flow4DGS-SLAM uses optical flow to generate motion masks, initialize poses, and guide 4D Gaussian modeling with scene flow and GMM for temporal properties, claiming SOTA results in dynamic tracking and reconstruction.

MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion

cs.CV · 2024-10-04 · unverdicted · novelty 5.0

By fine-tuning DUST3R to output per-timestep pointmaps on scarce dynamic video datasets, MonST3R achieves stronger video depth and pose estimation without explicit motion modeling.

VGGT-SLAM++

cs.CV · 2026-04-08 · unverdicted · novelty 4.0

VGGT-SLAM++ improves on prior transformer SLAM by adding dense DEM submap graphs and high-cadence local optimization, achieving SOTA accuracy with reduced drift and bounded memory on benchmarks.

VGGT-Long: Chunk it, Loop it, Align it -- Pushing VGGT's Limits on Kilometer-scale Long RGB Sequences

cs.CV · 2025-07-22 · conditional · novelty 4.0

VGGT-Long extends VGGT with chunking, overlap alignment, and loop closure to produce consistent kilometer-scale 3D reconstructions from monocular RGB sequences without retraining or extra supervision.

PRISM-SLAM: Probabilistic Ray-Grounded Inference for Scale-aware Metric SLAM

cs.RO · 2026-05-19

citing papers explorer

Showing 8 of 8 citing papers.

Improved monocular depth prediction using distance transform over pre-semantic contours with self-supervised neural networks eess.IV · 2026-05-08 · unverdicted · none · ref 70
Self-supervised monocular depth estimation improves in low-texture regions by using distance transforms on jointly estimated pre-semantic contours to create more informative loss signals.
Tango3D: Towards Alignment for Global and Local 2D-3D Correspondence cs.CV · 2026-05-19 · unverdicted · none · ref 33
Tango3D unifies dense pixel-to-point 2D-3D alignment and global retrieval in one shared space using a geometry-aware 2D backbone, 3D VAE tokens, and three-stage progressive training.
Depth Anything 3: Recovering the Visual Space from Any Views cs.CV · 2025-11-13 · unverdicted · none · ref 86
DA3 recovers consistent visual geometry from arbitrary views via a vanilla DINO transformer and depth-ray target, setting new SOTA on a visual geometry benchmark while outperforming DA2 on monocular depth.
Flow4DGS-SLAM: Optical Flow-Guided 4D Gaussian Splatting SLAM cs.CV · 2026-04-24 · unverdicted · none · ref 34
Flow4DGS-SLAM uses optical flow to generate motion masks, initialize poses, and guide 4D Gaussian modeling with scene flow and GMM for temporal properties, claiming SOTA results in dynamic tracking and reconstruction.
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion cs.CV · 2024-10-04 · unverdicted · none · ref 148
By fine-tuning DUST3R to output per-timestep pointmaps on scarce dynamic video datasets, MonST3R achieves stronger video depth and pose estimation without explicit motion modeling.
VGGT-SLAM++ cs.CV · 2026-04-08 · unverdicted · none · ref 75
VGGT-SLAM++ improves on prior transformer SLAM by adding dense DEM submap graphs and high-cadence local optimization, achieving SOTA accuracy with reduced drift and bounded memory on benchmarks.
VGGT-Long: Chunk it, Loop it, Align it -- Pushing VGGT's Limits on Kilometer-scale Long RGB Sequences cs.CV · 2025-07-22 · conditional · none · ref 32
VGGT-Long extends VGGT with chunking, overlap alignment, and loop closure to produce consistent kilometer-scale 3D reconstructions from monocular RGB sequences without retraining or extra supervision.
PRISM-SLAM: Probabilistic Ray-Grounded Inference for Scale-aware Metric SLAM cs.RO · 2026-05-19 · unreviewed · ref 42

arXiv preprint arXiv:1812.04605 , year =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer