Ba-net: Dense bundle ad- justment network

Chengzhou Tang, Ping Tan · 2019 · arXiv 1806.04807

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Accelerating Transformer-Based Monocular SLAM via Geometric Utility Scoring

cs.CV · 2026-04-09 · unverdicted · novelty 7.0

LeanGate is a lightweight feed-forward network that predicts geometric utility scores to skip over 90% of redundant frames in GFM-based monocular SLAM, reducing tracking FLOPs by 85% and achieving 5x speedup while maintaining accuracy.

Tango3D: Towards Alignment for Global and Local 2D-3D Correspondence

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

Tango3D unifies dense pixel-to-point 2D-3D alignment and global retrieval in one shared space using a geometry-aware 2D backbone, 3D VAE tokens, and three-stage progressive training.

Scal3R: Scalable Test-Time Training for Large-Scale 3D Reconstruction

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

Scal3R achieves better accuracy and consistency in large-scale 3D scene reconstruction by maintaining a compressed global context through test-time adaptation of lightweight neural networks on long video sequences.

PAGE-4D: VGGT-4D Perception via Disentangled Pose and Geometry Estimation

cs.CV · 2025-10-20 · unverdicted · novelty 6.0

PAGE-4D is a feedforward extension of VGGT that uses a dynamics-aware aggregator and mask to disentangle pose estimation from geometry reconstruction in videos with moving objects.

Efficient 3D Content Reconstruction and Generation

cs.CV · 2026-05-18 · unverdicted · novelty 5.0

Presents Instant3D for rapid text/image-to-3D generation via multi-view diffusion plus feed-forward reconstruction, and FastMap for 10x faster structure-from-motion with comparable accuracy.

TTT3R: 3D Reconstruction as Test-Time Training

cs.CV · 2025-09-30 · unverdicted · novelty 5.0

TTT3R derives a closed-form learning rate from memory-observation alignment confidence to boost length generalization in RNN-based 3D reconstruction by 2x in global pose estimation.

MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion

cs.CV · 2024-10-04 · unverdicted · novelty 5.0

By fine-tuning DUST3R to output per-timestep pointmaps on scarce dynamic video datasets, MonST3R achieves stronger video depth and pose estimation without explicit motion modeling.

VGGT-Long: Chunk it, Loop it, Align it -- Pushing VGGT's Limits on Kilometer-scale Long RGB Sequences

cs.CV · 2025-07-22 · conditional · novelty 4.0

VGGT-Long extends VGGT with chunking, overlap alignment, and loop closure to produce consistent kilometer-scale 3D reconstructions from monocular RGB sequences without retraining or extra supervision.

citing papers explorer

Showing 8 of 8 citing papers.

Accelerating Transformer-Based Monocular SLAM via Geometric Utility Scoring cs.CV · 2026-04-09 · unverdicted · none · ref 35
LeanGate is a lightweight feed-forward network that predicts geometric utility scores to skip over 90% of redundant frames in GFM-based monocular SLAM, reducing tracking FLOPs by 85% and achieving 5x speedup while maintaining accuracy.
Tango3D: Towards Alignment for Global and Local 2D-3D Correspondence cs.CV · 2026-05-19 · unverdicted · none · ref 30
Tango3D unifies dense pixel-to-point 2D-3D alignment and global retrieval in one shared space using a geometry-aware 2D backbone, 3D VAE tokens, and three-stage progressive training.
Scal3R: Scalable Test-Time Training for Large-Scale 3D Reconstruction cs.CV · 2026-04-09 · unverdicted · none · ref 68
Scal3R achieves better accuracy and consistency in large-scale 3D scene reconstruction by maintaining a compressed global context through test-time adaptation of lightweight neural networks on long video sequences.
PAGE-4D: VGGT-4D Perception via Disentangled Pose and Geometry Estimation cs.CV · 2025-10-20 · unverdicted · none · ref 13
PAGE-4D is a feedforward extension of VGGT that uses a dynamics-aware aggregator and mask to disentangle pose estimation from geometry reconstruction in videos with moving objects.
Efficient 3D Content Reconstruction and Generation cs.CV · 2026-05-18 · unverdicted · none · ref 240
Presents Instant3D for rapid text/image-to-3D generation via multi-view diffusion plus feed-forward reconstruction, and FastMap for 10x faster structure-from-motion with comparable accuracy.
TTT3R: 3D Reconstruction as Test-Time Training cs.CV · 2025-09-30 · unverdicted · none · ref 77
TTT3R derives a closed-form learning rate from memory-observation alignment confidence to boost length generalization in RNN-based 3D reconstruction by 2x in global pose estimation.
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion cs.CV · 2024-10-04 · unverdicted · none · ref 147
By fine-tuning DUST3R to output per-timestep pointmaps on scarce dynamic video datasets, MonST3R achieves stronger video depth and pose estimation without explicit motion modeling.
VGGT-Long: Chunk it, Loop it, Align it -- Pushing VGGT's Limits on Kilometer-scale Long RGB Sequences cs.CV · 2025-07-22 · conditional · none · ref 31
VGGT-Long extends VGGT with chunking, overlap alignment, and loop closure to produce consistent kilometer-scale 3D reconstructions from monocular RGB sequences without retraining or extra supervision.

Ba-net: Dense bundle ad- justment network

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer