Bundle Adjustment in the Eager Mode
Pith reviewed 2026-05-23 20:51 UTC · model grok-4.3
The pith
A PyTorch eager-mode bundle adjustment library achieves average speedups of 18.5x to 23x on GPU over GTSAM, g2o, and Ceres.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Our eager-mode BA on GPU demonstrates substantial runtime efficiency, achieving an average speedup of 18.5×, 22×, and 23× across all benchmarks compared to GTSAM, g²o, and Ceres, respectively, by means of a sparsity-aware auto-differentiation design and GPU-accelerated sparse operations designed for 2nd-order optimization.
What carries the argument
Sparsity-aware auto-differentiation design combined with GPU-accelerated sparse operations for second-order optimization inside PyTorch.
If this is right
- Bundle adjustment can be called directly from PyTorch code without data transfer to external C++ libraries.
- Robotic perception systems gain faster second-order optimization on GPU hardware for tasks such as SLAM.
- Implementation and debugging of bundle adjustment become simpler within the PyTorch ecosystem.
- Second-order methods for camera pose and landmark estimation become more accessible to deep learning workflows.
Where Pith is reading between the lines
- The integration opens the possibility of embedding bundle adjustment as a differentiable layer inside larger neural networks for joint training.
- Similar sparsity handling could be applied to other second-order problems that arise in robotics beyond pure bundle adjustment.
- The reported speedups suggest that real-time bundle adjustment on embedded GPU devices becomes more feasible when paired with learned components.
Load-bearing premise
A sparsity-aware auto-differentiation design realized in PyTorch can deliver the reported GPU speedups while preserving the same numerical correctness and convergence behavior as the C++ solvers.
What would settle it
Running identical benchmark problems on the same GPU hardware and finding that the PyTorch version either converges to different parameter values or takes longer wall-clock time than GTSAM, g2o, or Ceres.
Figures
read the original abstract
Bundle adjustment (BA) is a critical technique in various robotic applications such as simultaneous localization and mapping (SLAM), augmented reality (AR), and photogrammetry. BA optimizes parameters such as camera poses and 3D landmarks to align them with observations. With the growing importance of deep learning in perception systems, there is an increasing need to integrate BA with deep learning frameworks for enhanced reliability and performance. However, widely-used C++-based BA libraries, such as GTSAM, g$^2$o, and Ceres Solver, lack native integration with modern deep learning libraries like PyTorch. This limitation affects their flexibility, ease of debugging, and overall implementation efficiency. To address this gap, we introduce an eager-mode BA library seamlessly integrated with PyTorch with high efficiency. Our approach includes a sparsity-aware auto-differentiation design and GPU-accelerated sparse operations designed for 2nd-order optimization. Our eager-mode BA on GPU demonstrates substantial runtime efficiency, achieving an average speedup of 18.5$\times$, 22$\times$, and 23$\times$ across all benchmarks compared to GTSAM, g$^2$o, and Ceres, respectively.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces an eager-mode bundle adjustment (BA) library integrated with PyTorch. It employs a sparsity-aware auto-differentiation design and GPU-accelerated sparse operations for second-order optimization, claiming average speedups of 18.5× versus GTSAM, 22× versus g²o, and 23× versus Ceres across all benchmarks.
Significance. If the reported speedups are achieved while preserving numerical equivalence and convergence behavior to the C++ baselines, the work would enable tighter integration of BA with deep-learning pipelines in robotics and vision, improving flexibility for end-to-end systems in SLAM and AR.
major comments (2)
- [Abstract] Abstract: the central speedup claims (18.5×, 22×, 23×) are presented without any accompanying implementation details, benchmark descriptions, accuracy metrics, error analysis, or convergence criteria, so it is impossible to determine whether the data or design supports the numbers.
- [Approach (inferred from abstract claims)] The manuscript assumes a sparsity-aware autodiff design in eager-mode PyTorch combined with GPU sparse linear algebra can realize the claimed wall-clock gains and numerically equivalent iterates; no description of the custom kernels, Schur-complement handling, or precision safeguards is supplied to substantiate this premise.
minor comments (1)
- [Abstract] Notation for the g²o library is inconsistent (g$^2$o in the abstract).
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address the two major comments below and will revise the manuscript to improve clarity on the abstract claims and technical details of the approach.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central speedup claims (18.5×, 22×, 23×) are presented without any accompanying implementation details, benchmark descriptions, accuracy metrics, error analysis, or convergence criteria, so it is impossible to determine whether the data or design supports the numbers.
Authors: We agree the abstract is concise and omits these supporting elements due to length limits. The manuscript body (Experiments section) contains the benchmark descriptions, accuracy metrics, error analysis, and convergence criteria, along with evidence of numerical equivalence to the C++ baselines. We will revise the abstract to briefly reference the benchmark setup and equivalence results. revision: yes
-
Referee: [Approach (inferred from abstract claims)] The manuscript assumes a sparsity-aware autodiff design in eager-mode PyTorch combined with GPU sparse linear algebra can realize the claimed wall-clock gains and numerically equivalent iterates; no description of the custom kernels, Schur-complement handling, or precision safeguards is supplied to substantiate this premise.
Authors: Section 3 of the manuscript outlines the sparsity-aware autodiff design and GPU sparse operations. To directly address the concern, we will expand this section in the revision with additional details on the custom kernels, Schur-complement implementation, and precision safeguards that maintain numerical equivalence and convergence behavior. revision: yes
Circularity Check
No circularity: engineering implementation with external benchmarks
full rationale
The paper presents a PyTorch-based eager-mode BA implementation using sparsity-aware autodiff and GPU sparse ops, with performance claims benchmarked against independent external solvers (GTSAM, g²o, Ceres). No equations, derivations, or fitted parameters are present that reduce to self-definitions or self-citations. The central claims rest on reported wall-clock measurements rather than any load-bearing mathematical chain, satisfying the self-contained criterion.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption PyTorch autograd and GPU sparse matrix operations can be extended to support efficient second-order bundle adjustment without loss of correctness
Reference graph
Works this paper leans on
-
[1]
Vr-gs: A physical dynamics-aware interactive gaussian splatting system in virtual reality,
Y . Jiang, C. Yu, T. Xie, X. Li, Y . Feng, H. Wang, M. Li, H. Lau, F. Gao, Y . Yang, and C. Jiang, “VR-GS: A physical dynamics-aware interactive gaussian splatting system in virtual reality,” arXiv preprint arXiv:2401.16663, 2024
-
[2]
Detector-free structure from motion,
X. He, J. Sun, Y . Wang, S. Peng, Q. Huang, H. Bao, and X. Zhou, “Detector-free structure from motion,” CVPR, 2024
work page 2024
-
[3]
AirSLAM: An efficient and illumination-robust point-line visual slam system,
K. Xu, Y . Hao, S. Yuan, C. Wang, and L. Xie, “AirSLAM: An efficient and illumination-robust point-line visual slam system,” IEEE Transactions on Robotics (T-RO) , 2025. [Online]. Available: https://arxiv.org/abs/2408.03520
-
[4]
iMatching: Imperative correspondence learning,
Z. Zhan, D. Gao, Y .-J. Lin, Y . Xia, and C. Wang, “iMatching: Imperative correspondence learning,” in European Conference on Computer Vision (ECCV), 2024. [Online]. Available: https://arxiv.org/abs/2312.02141
-
[5]
Droid-slam: Deep visual slam for monocular, stereo, and rgb-d cameras,
Z. Teed and J. Deng, “Droid-slam: Deep visual slam for monocular, stereo, and rgb-d cameras,” Advances in Neural Information Processing Systems, vol. 34, pp. 16 558–16 569, 2021
work page 2021
-
[6]
Imperative learning: A self-supervised neuro- symbolic learning framework for robot autonomy,
C. Wang, K. Ji, J. Geng, Z. Ren, T. Fu, F. Yang, Y . Guo, H. He, X. Chen, Z. Zhan, Q. Du, S. Su, B. Li, Y . Qiu, Y . Du, Q. Li, Y . Yang, X. Lin, and Z. Zhao, “Imperative learning: A self-supervised neuro- symbolic learning framework for robot autonomy,” The International Journal of Robotics Research (IJRR) , 2025. [Online]. Available: https://arxiv.org/a...
-
[7]
T. Fu, S. Su, Y . Lu, and C. Wang, “iSLAM: Imperative SLAM,” IEEE Robotics and Automation Letters (RA-L) , 2024. [Online]. Available: https://arxiv.org/abs/2306.07894
-
[8]
Vggt: Visual geometry grounded transformer,
J. Wang, M. Chen, N. Karaev, A. Vedaldi, C. Rupprecht, and D. Novotny, “Vggt: Visual geometry grounded transformer,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2025
work page 2025
-
[9]
J. Ansel, E. Yang, H. He, N. Gimelshein, A. Jain, M. V oznesensky, B. Bao, P. Bell, D. Berard, E. Burovski, G. Chauhan, A. Chourdia, W. Constable, A. Desmaison, Z. DeVito, E. Ellison, W. Feng, J. Gong, M. Gschwind, B. Hirsh, S. Huang, K. Kalambarkar, L. Kirsch, M. Lazos, M. Lezcano, Y . Liang, J. Liang, Y . Lu, C. K. Luk, B. Maher, Y . Pan, C. Puhrsch, M....
-
[10]
PyTorch: An Imperative Style, High- Performance Deep Learning Library,
A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala, “PyTorch: An Imperative Style, High- Performance Deep Learning Library,” inAdvances in Neural Information Processing S...
work page 2019
-
[11]
TensorFlow: A system for large-scale machine learning
M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, M. Kudlur, J. Levenberg, R. Monga, S. Moore, D. G. Murray, B. Steiner, P. A. Tucker, V . Vasudevan, P. Warden, M. Wicke, Y . Yu, and X. Zhang, “Tensorflow: A system for large-scale machine learning,” CoRR, vol. abs/1605.08695, 2016. [Online]. Available: h...
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[12]
The state of machine learning frameworks in 2019,
H. He, “The state of machine learning frameworks in 2019,” The Gradient, 2019
work page 2019
-
[13]
F. Dellaert and Contributors, “borglab/gtsam,” May 2022. [Online]. Available: https://github.com/borglab/gtsam
work page 2022
- [14]
-
[15]
Available: https://github.com/ceres-solver/ceres-solver
[Online]. Available: https://github.com/ceres-solver/ceres-solver
-
[16]
G2o: A general framework for graph optimization,
R. K ¨ummerle, G. Grisetti, H. Strasdat, K. Konolige, and W. Burgard, “G2o: A general framework for graph optimization,” in IEEE Int. Conf. on Robotics and Automation (ICRA) , 06 2011, pp. 3607 – 3613
work page 2011
-
[17]
Theseus: A Library for Differentiable Nonlinear Optimization,
L. Pineda, T. Fan, M. Monge, S. Venkataraman, P. Sodhi, R. T. Chen, J. Ortiz, D. DeTone, A. Wang, S. Anderson, J. Dong, B. Amos, and M. Mukadam, “Theseus: A Library for Differentiable Nonlinear Optimization,” Advances in Neural Information Processing Systems , 2022
work page 2022
-
[18]
Robust bundle adjustment revisited,
C. Zach, “Robust bundle adjustment revisited,” in Computer Vision – ECCV 2014 , D. Fleet, T. Pajdla, B. Schiele, and T. Tuytelaars, Eds. Cham: Springer International Publishing, 2014, pp. 772–787
work page 2014
-
[19]
A micro lie theory for state estimation in robotics,
J. Sol `a, J. Deray, and D. Atchuthan, “A micro lie theory for state estimation in robotics,” 2021. [Online]. Available: https: //arxiv.org/abs/1812.01537
-
[20]
Bundle adjustment in the large,
S. Agarwal, N. Snavely, S. M. Seitz, and R. Szeliski, “Bundle adjustment in the large,” in European Conference on Computer Vision (ECCV) . Springer, 2010, pp. 29–42
work page 2010
-
[21]
PyPose v0.6: The imperative programming interface for robotics,
Z. Zhan, X. Li, Q. Li, H. He, A. Pandey, H. Xiao, Y . Xu, X. Chen, K. Xu, K. Cao, Z. Zhao, Z. Wang, H. Xu, Z. Fang, Y . Chen, W. Wang, X. Fang, Y . Du, T. Wu, X. Lin, Y . Qiu, F. Yang, J. Shi, S. Su, Y . Lu, T. Fu, K. Dantu, J. Wu, L. Xie, M. Hutter, L. Carlone, S. Scherer, D. Huang, Y . Hu, J. Geng, and C. Wang, “PyPose v0.6: The imperative programming i...
-
[22]
Tensorflow: A system for large-scale machine learning,
M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, M. Kudlur, J. Levenberg, R. Monga, S. Moore, D. G. Murray, B. Steiner, P. Tucker, V . Vasudevan, P. Warden, M. Wicke, Y . Yu, and X. Zheng, “Tensorflow: A system for large-scale machine learning,” in Proceedings of the 12th USENIX Symposium on Operating S...
work page 2016
-
[23]
TensorFlow Eager: A Multi-Stage, Python-Embedded DSL for Machine Learning
A. Agrawal, A. N. Modi, A. Passos, A. Lavoie, A. Agarwal, A. Shankar, I. Ganichev, J. Levenberg, M. Hong, R. Monga, and S. Cai, “Tensorflow eager: A multi-stage, python-embedded dsl for machine learning,” 2019. [Online]. Available: https://arxiv.org/abs/1903.01855
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[24]
PyPose: A library for robot learning with physics-based optimization,
C. Wang, D. Gao, K. Xu, J. Geng, Y . Hu, Y . Qiu, B. Li, F. Yang, B. Moon, A. Pandey, Aryan, J. Xu, T. Wu, H. He, D. Huang, Z. Ren, S. Zhao, T. Fu, P. Reddy, X. Lin, W. Wang, J. Shi, R. Talak, K. Cao, Y . Du, H. Wang, H. Yu, S. Wang, S. Chen, A. Kashyap, R. Bandaru, K. Dantu, J. Wu, L. Xie, L. Carlone, M. Hutter, and S. Scherer, “PyPose: A library for rob...
-
[25]
▽slam: Dense slam meets automatic differentiation,
K. M. Jatavallabhula, G. Iyer, and L. Paull, “ ▽slam: Dense slam meets automatic differentiation,” in 2020 IEEE International Conference on Robotics and Automation (ICRA) . IEEE, 2020, pp. 2130–2137. 17
work page 2020
-
[26]
J. Huang, S. Huang, and M. Sun, “Deeplm: Large-scale nonlinear least squares on deep learning frameworks using stochastic domain decomposition,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2021, pp. 10 308–10 317
work page 2021
-
[27]
OpenMP application program interface version 3.0,
OpenMP Architecture Review Board, “OpenMP application program interface version 3.0,” May 2008. [Online]. Available: http://www. openmp.org/mp-documents/spec30.pdf
work page 2008
-
[28]
C. Wu, S. Agarwal, B. Curless, and S. M. Seitz, “Multicore bundle adjustment,” in Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition , ser. CVPR ’11. USA: IEEE Computer Society, 2011, p. 3057–3064. [Online]. Available: https://doi.org/10.1109/CVPR.2011.5995552
-
[29]
Decentralization and acceleration enables large-scale bundle adjustment,
T. Fan, J. Ortiz, M. Hsiao, M. Monge, J. Dong, T. Murphey, and M. Mukadam, “Decentralization and acceleration enables large-scale bundle adjustment,” arXiv:2305.07026, 2023
-
[30]
C. Wang, K. M. Jatavallabhula, and M. Mukadam, Differentiable Optimization . Cambridge University Press, 2025. [Online]. Available: https://github.com/SLAM-Handbook-contributors/ slam-handbook-public-release/
work page 2025
-
[31]
pypose.optim.levenbergmarquardt,
“pypose.optim.levenbergmarquardt,” https://pypose.org/docs/main/ generated/pypose.optim.LevenbergMarquardt/
-
[32]
M. Zheng, N. Chen, J. Zhu, X. Zeng, H. Qiu, Y . Jiang, X. Lu, and H. Qu, “Distributed bundle adjustment with block-based sparse matrix compression for super large scale datasets,” in IEEE/CVF International Conference on Computer Vision (ICCV) , 2023. [Online]. Available: https://arxiv.org/abs/2307.08383
-
[33]
PyTorch sparse bsr tensor documentation,
“PyTorch sparse bsr tensor documentation,” https://pytorch.org/docs/ stable/sparse.html#sparse-bsr-tensor, 2024, accessed: 2024-09-12
work page 2024
-
[34]
Sparse Matrix - Coordinate List (COO),
“Sparse Matrix - Coordinate List (COO),” https://en.wikipedia.org/wiki/ Sparse matrix#Coordinate list (COO), 2024, accessed: 2024-09-12
work page 2024
-
[35]
“Tensor Indexing API,” https://pytorch.org/cppdocs/notes/tensor indexing.html, 2024, accessed: 2024-09-13
work page 2024
-
[36]
Very high-speed computing systems,
M. Flynn, “Very high-speed computing systems,” Proceedings of the IEEE, vol. 54, no. 12, pp. 1901–1909, 1966
work page 1901
-
[37]
Automatic differentiation with torch.autograd,
“Automatic differentiation with torch.autograd,” https://pytorch.org/ tutorials/beginner/basics/autogradqs tutorial.html, accessed: 2024-09- 12
work page 2024
-
[38]
PyTorch Sparse CSR Tensor documentation,
“PyTorch Sparse CSR Tensor documentation,” https://pytorch.org/docs/ stable/sparse.html#sparse-csr-tensor, 2024, accessed: 2024-09-12
work page 2024
-
[39]
Two fast algorithms for sparse matrices: Multiplication and permuted transposition,
F. G. Gustavson, “Two fast algorithms for sparse matrices: Multiplication and permuted transposition,” ACM Trans. Math. Softw., vol. 4, no. 3, p. 250–269, sep 1978. [Online]. Available: https://doi.org/10.1145/355791. 355796
-
[40]
Optimizing sparse matrix–matrix multiplication for the gpu,
S. Dalton, N. Bell, and L. N. Olson, “Optimizing sparse matrix–matrix multiplication for the gpu,” ACM Transactions on Mathematical Soft- ware, vol. 41, no. 4, pp. 1–20, 2015
work page 2015
-
[41]
NVIDIA Corporation, “cusparse library,” https://docs.nvidia.com/cuda/ cusparse/index.html, 2024, accessed: 2024-09-13
work page 2024
-
[42]
Warp: A high-performance python framework for gpu simulation and graphics,
M. Macklin, “Warp: A high-performance python framework for gpu simulation and graphics,” https://github.com/nvidia/warp, March 2022, NVIDIA GPU Technology Conference (GTC)
work page 2022
-
[43]
J. H. Wilkinson and C. B. Moler, Matrix computations . GBR: John Wiley and Sons Ltd., 2003, p. 1103–1109
work page 2003
-
[44]
Triton Contributors, “Triton language and compiler,” https://github.com/ triton-lang/triton, 2024, accessed: 2024-09-13
work page 2024
-
[45]
A.-L. Cholesky, “Note sur une m ´ethode de r ´esolution des ´equations normales provenant de l’application de la m ´ethode des moindres carr ´es a un syst `eme d’ ´equations lin ´eaires en nombre inf ´erieur a celui des inconnues. —application de la m ´ethode a la r ´esolution d’un syst `eme defini d’´equations lin ´eaires,” Bulletin g ´eod´esique, vol. 2...
-
[46]
cudss: A high-performance direct linear solver library,
NVIDIA Corporation, “cudss: A high-performance direct linear solver library,” 2025, accessed: 2025-06-15. [Online]. Available: https://docs.nvidia.com/cuda/cudss/
work page 2025
-
[47]
Block preconditioning for the conjugate gradient method,
P. Concus, G. Golub, and G. Meurant, “Block preconditioning for the conjugate gradient method,” LBL Publications , no. LBL-14856, 1982. [Online]. Available: https://escholarship.org/uc/item/0j60b61v
work page 1982
-
[48]
NVIDIA Corporation, “Cuda graphs,” 2025, accessed: 2025-06-15. [Online]. Available: https://docs.nvidia.com/cuda/ cuda-c-programming-guide/index.html#cuda-graphs
work page 2025
-
[49]
“Pypose linear solver,” https://pypose.org/docs/main/generated/pypose. optim.solver.PINV/
-
[50]
Newton’s method with a model trust-region modification,
D. C. Sorensen, “Newton’s method with a model trust-region modification,” University of North Texas Libraries, UNT Digital Library, Tech. Rep., September 1980, accessed: September 13, 2024. [Online]. Available: https://digital.library.unt.edu/ark:/67531/metadc283479/
work page 1980
-
[51]
Robust global translations with 1dsfm,
K. Wilson and N. Snavely, “Robust global translations with 1dsfm,” in Proceedings of the European Conference on Computer Vision (ECCV) , 2014
work page 2014
-
[52]
Common Objects in 3D: Large-scale learning and evaluation of real-life 3D category reconstruction,
J. Reizenstein, R. Shapovalov, P. Henzler, L. Sbordone, P. Labatut, and D. Novotny, “Common Objects in 3D: Large-scale learning and evaluation of real-life 3D category reconstruction,” in Proc. ICCV, 2021
work page 2021
-
[53]
oneapi threading building blocks (onetbb),
Intel Corporation, “oneapi threading building blocks (onetbb),” https:// www.intel.com/content/www/us/en/developer/tools/oneapi/onetbb.html, 2021, version 2021.5
work page 2021
-
[54]
Superglue: Learning feature matching with graph neural networks,
P.-E. Sarlin, D. DeTone, T. Malisiewicz, and A. Rabinovich, “Superglue: Learning feature matching with graph neural networks,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 4938–4947
work page 2020
-
[55]
Pixel-perfect structure-from-motion with featuremetric refinement,
P. Lindenberger, P. Sarlin, V . Larsson, and M. Pollefeys, “Pixel-perfect structure-from-motion with featuremetric refinement,” arXiv.cs, vol. abs/2108.08291, 2021
-
[56]
PoseDiffusion: solving pose estimation via diffusion-aided bundle adjustment,
J. Wang, C. Rupprecht, and D. Novotny, “PoseDiffusion: solving pose estimation via diffusion-aided bundle adjustment,” in Proc. ICCV, 2023
work page 2023
-
[57]
DUSt3R: Geometric 3D vision made easy,
S. Wang, V . Leroy, Y . Cabon, B. Chidlovskii, and J. Revaud, “DUSt3R: Geometric 3D vision made easy,” in Proc. CVPR, 2024
work page 2024
-
[58]
Grounding image matching in 3d with mast3r, 2024
V . Leroy, Y . Cabon, and J. Revaud, “Grounding image matching in 3d with mast3r,” arXiv preprint arXiv:2406.09756 , 2024
-
[59]
VGGSfM: visual geometry grounded deep structure from motion,
J. Wang, N. Karaev, C. Rupprecht, and D. Novotny, “VGGSfM: visual geometry grounded deep structure from motion,” in Proc. CVPR, 2024
work page 2024
-
[60]
Mv-dust3r+: Single-stage scene reconstruction from sparse views in 2 seconds,
Z. Tang, Y . Fan, D. Wang, H. Xu, R. Ranjan, A. Schwing, and Z. Yan, “Mv-dust3r+: Single-stage scene reconstruction from sparse views in 2 seconds,” arXiv preprint arXiv:2412.06974 , 2024
-
[61]
Continuous 3d perception model with persistent state,
Q. Wang, Y . Zhang, A. Holynski, A. A. Efros, and A. Kanazawa, “Continuous 3d perception model with persistent state,” 2025
work page 2025
-
[62]
Flare: Feed-forward geometry, appearance and camera estimation from uncalibrated sparse views, 2026
S. Zhang, J. Wang, Y . Xu, N. Xue, C. Rupprecht, X. Zhou, Y . Shen, and G. Wetzstein, “Flare: Feed-forward geometry, appearance and camera estimation from uncalibrated sparse views,” 2025. [Online]. Available: https://arxiv.org/abs/2502.12138
-
[63]
arXiv preprint arXiv:2501.13928 (2025)
J. Yang, A. Sax, K. J. Liang, M. Henaff, H. Tang, A. Cao, J. Chai, F. Meier, and M. Feiszli, “Fast3r: Towards 3d reconstruction of 1000+ images in one forward pass,” arXiv preprint arXiv:2501.13928 , 2025
-
[64]
Baspacho: Direct solver for sparse spd matrices for nonlinear optimization,
Facebook Research, “Baspacho: Direct solver for sparse spd matrices for nonlinear optimization,” https://github.com/facebookresearch/baspacho, 2025, accessed: February 19, 2025
work page 2025
-
[65]
Scannet: Richly-annotated 3d reconstructions of indoor scenes,
A. Dai, A. X. Chang, M. Savva, M. Halber, T. Funkhouser, and M. Nießner, “Scannet: Richly-annotated 3d reconstructions of indoor scenes,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 5828–5839
work page 2017
-
[66]
C. Lu, F. Yin, X. Chen, T. Chen, G. Yu, and J. Fan, “A large-scale outdoor multi-modal dataset and benchmark for novel view synthesis and implicit scene reconstruction,” arXiv preprint arXiv:2301.06782 , 2023
-
[67]
KITTI-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d,
Y . Liao, J. Xie, and A. Geiger, “KITTI-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d,” Pattern Analysis and Machine Intelligence (PAMI) , 2022
work page 2022
-
[68]
BAD SLAM: Bundle adjusted direct RGB-D SLAM,
T. Sch ¨ops, T. Sattler, and M. Pollefeys, “BAD SLAM: Bundle adjusted direct RGB-D SLAM,” in Conference on Computer Vision and Pattern Recognition (CVPR), 2019
work page 2019
-
[69]
Learning feature descriptors using camera pose supervision,
Q. Wang, X. Zhou, B. Hariharan, and N. Snavely, “Learning feature descriptors using camera pose supervision,” in European Conference on Computer Vision. Springer, 2020, pp. 757–774
work page 2020
-
[70]
Aspanformer: Detector-free image matching with adaptive span transformer,
H. Chen, Z. Luo, L. Zhou, Y . Tian, M. Zhen, T. Fang, D. McKinnon, Y . Tsin, and L. Quan, “Aspanformer: Detector-free image matching with adaptive span transformer,” in Computer Vision–ECCV 2022: 17th Eu- ropean Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXII. Springer, 2022, pp. 20–36
work page 2022
-
[71]
DKM: Dense kernelized feature matching for geometry estimation,
J. Edstedt, I. Athanasiadis, M. Wadenb ¨ack, and M. Felsberg, “DKM: Dense kernelized feature matching for geometry estimation,” in IEEE Conference on Computer Vision and Pattern Recognition , 2023
work page 2023
-
[72]
Self-supervised ge- ometric perception,
H. Yang, W. Dong, L. Carlone, and V . Koltun, “Self-supervised ge- ometric perception,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2021, pp. 14 350–14 361
work page 2021
-
[73]
Scalability in perception for autonomous driving: Waymo open dataset,
P. Sun, H. Kretzschmar, X. Dotiwalla, A. Chouard, V . Patnaik, P. Tsui, J. Guo, Y . Zhou, Y . Chai, B. Caine, V . Vasudevan, W. Han, J. Ngiam, H. Zhao, A. Timofeev, S. Ettinger, M. Krivokon, A. Gao, A. Joshi, Y . Zhang, J. Shlens, Z. Chen, and D. Anguelov, “Scalability in perception for autonomous driving: Waymo open dataset,” in Proceedings of the IEEE/C...
work page 2020
-
[74]
VERTIGO: Versatile Extensions for Robust Inference using Graphical Odometry,
OpenSLAM-org, “VERTIGO: Versatile Extensions for Robust Inference using Graphical Odometry,” https://openslam-org.github.io/vertigo.html, accessed: 2025-04-11
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.