Sonar-GPS Fusion for Seabed Mapping in Turbid Shallow Waters with an Autonomous Surface Vehicle
Pith reviewed 2026-05-09 17:23 UTC · model grok-4.3
The pith
Integrating sonar alignment with GPS-IMU EKF optimization reduces seabed mapping drift by 9.5 percent versus sonar-only methods.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that a drift-resilient seabed mapping framework integrating local forward-looking sonar frame alignment via Fourier-Mellin transform with global extended Kalman filter optimization fusing GPS, IMU, and compass data, together with variance-based blending, achieves 9.5% RMSE drift reduction over FMT-only baseline, sub-meter reconstruction accuracy, and preservation of high-resolution textures in field trials on a structured oyster farm site.
What carries the argument
The integration of Fourier-Mellin transform for local forward-looking sonar alignment with extended Kalman filter trajectory optimization fusing GPS, IMU, and compass data, plus variance-based image blending.
If this is right
- Long-trajectory mapping becomes feasible in low-visibility waters where optical cameras fail.
- Sub-meter accuracy supports detailed habitat monitoring and infrastructure inspection.
- Preserved high-resolution textures enable automated inventory estimation such as oyster counts from the maps.
- The method offers a practical alternative for autonomous surface vehicles in aquaculture and coastal monitoring.
Where Pith is reading between the lines
- The multi-sensor fusion could be adapted to other autonomous platforms facing similar drift and visibility challenges.
- Further tests in open coastal or varying-turbidity settings would clarify how far the EKF model extends beyond the tested farm.
- If accuracy holds, missions could extend longer between GPS updates without losing map quality.
Load-bearing premise
Performance gains from one structured oyster farm site will generalize to other turbid shallow waters and the FMT-only baseline is a fair untuned comparison.
What would settle it
Repeat field trials in other turbid shallow sites that show RMSE drift reduction below 9.5 percent or reconstruction accuracy worse than one meter would disprove the claimed benefits of the fusion.
Figures
read the original abstract
Accurate seabed mapping is essential for habitat monitoring and infrastructure inspection. In turbid, shallow coastal waters, such as shellfish aquaculture farms, the effectiveness of traditional optical methods is limited. Autonomous surface vehicles (ASVs) equipped with forward-looking sonar (FLS) offer a promising alternative. However, existing sonar-based systems face challenges in achieving fine resolution mapping over long trajectories due to low-resolution positioning measurements and accumulated drift over long trajectories. In this paper, we present a drift-resilient seabed mapping framework that integrates local FLS frame alignment using the Fourier-Mellin transform (FMT) with global trajectory optimization based on an extended Kalman filter (EKF) that fuses global positioning system (GPS), inertial measurement unit (IMU), and compass data. A variance-based image blending strategy is used to further reduce visual artifacts in overlapping regions. Field trials on a structured oyster farm site show that our framework helps reduce drift in RMSE by 9.5% relative to the FMT-only baseline. This framework also enables sub-meter reconstruction accuracy and preservation of high-resolution textures needed for oyster inventory estimation within the mapped areas.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents a drift-resilient seabed mapping framework for autonomous surface vehicles in turbid shallow waters. It integrates local forward-looking sonar (FLS) frame alignment via the Fourier-Mellin transform (FMT) with global trajectory optimization using an extended Kalman filter (EKF) that fuses GPS, IMU, and compass measurements, plus a variance-based image blending step. Field trials on a structured oyster farm site are reported to yield a 9.5% RMSE reduction in drift relative to an FMT-only baseline, sub-meter reconstruction accuracy, and preservation of high-resolution textures suitable for oyster inventory estimation.
Significance. If the reported quantitative gains are robust, the work offers a practical advance for sonar-based mapping in environments where optical methods are ineffective, with direct relevance to shellfish aquaculture monitoring and coastal habitat inspection. The combination of local FMT alignment with EKF-based global fusion and blending addresses accumulated drift over long trajectories, a known limitation in existing sonar systems. The field-trial results provide concrete evidence of feasibility on a real structured site.
major comments (2)
- [Field trials] Field trials section: the 9.5% RMSE drift reduction and sub-meter accuracy claims rest on comparison to an FMT-only baseline and an unspecified ground-truth reference. The manuscript must clarify whether the baseline used identical sensor noise models, loop-closure thresholds, blending parameters, and processing pipeline as the EKF-fused system; otherwise the delta cannot be attributed to the fusion step rather than implementation differences.
- [Experimental results] Experimental results: independent ground-truth trajectory or map data used to compute RMSE and sub-meter accuracy must be described in detail, including acquisition method, coverage, and any potential correlation with the proposed pipeline. Single-site trials on one structured oyster farm limit generalizability; additional sites or conditions should be discussed or provided.
minor comments (2)
- [Method] Notation for the EKF state vector and measurement models should be introduced with explicit equations rather than prose descriptions only.
- [Figures] Figure captions for the reconstructed maps should include scale bars, RMSE values, and direct visual comparison to the FMT-only result.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our manuscript. We address each major point below, outlining the clarifications and revisions we will incorporate to strengthen the presentation of the field trials and experimental results.
read point-by-point responses
-
Referee: [Field trials] Field trials section: the 9.5% RMSE drift reduction and sub-meter accuracy claims rest on comparison to an FMT-only baseline and an unspecified ground-truth reference. The manuscript must clarify whether the baseline used identical sensor noise models, loop-closure thresholds, blending parameters, and processing pipeline as the EKF-fused system; otherwise the delta cannot be attributed to the fusion step rather than implementation differences.
Authors: We agree that the baseline comparison requires explicit confirmation to attribute the performance gain to the EKF fusion. In the revised manuscript, we will add a detailed description of the FMT-only baseline implementation, explicitly stating that it employed identical sensor noise models, loop-closure thresholds, blending parameters, and the same processing pipeline as the proposed system. This will ensure the 9.5% RMSE reduction is clearly attributable to the global trajectory optimization step. revision: yes
-
Referee: [Experimental results] Experimental results: independent ground-truth trajectory or map data used to compute RMSE and sub-meter accuracy must be described in detail, including acquisition method, coverage, and any potential correlation with the proposed pipeline. Single-site trials on one structured oyster farm limit generalizability; additional sites or conditions should be discussed or provided.
Authors: We will expand the Experimental results section to describe the independent ground-truth data in detail, including the acquisition method (high-precision RTK-GPS surveying), spatial coverage, and evaluation of any potential correlations with the sonar-GPS fusion pipeline. For generalizability, we will add a discussion acknowledging the single-site limitation while explaining how the FMT local alignment, EKF global fusion, and variance-based blending are formulated to apply to other turbid shallow-water environments; we note that multi-site validation remains valuable for future extensions. revision: partial
Circularity Check
No circularity: empirical claims rest on external field measurements
full rationale
The paper presents an engineering framework (FMT local alignment + EKF GPS/IMU/compass fusion + variance blending) and validates it via field trials on one oyster farm site. The headline quantitative results (9.5% RMSE drift reduction vs. FMT-only baseline, sub-meter accuracy) are reported as direct outcomes of those external measurements, not as predictions derived from fitted parameters or self-referential equations. No derivation chain, uniqueness theorem, or ansatz is invoked that reduces to the inputs by construction. Self-citations, if any, are not load-bearing for the central claims. This is a standard experimental robotics paper whose correctness can be assessed against the reported ground-truth data rather than internal consistency of a derivation.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
On 3-d motion estimation from feature tracks in 2-d fs sonar video,
S. Negahdaripour, “On 3-d motion estimation from feature tracks in 2-d fs sonar video,”IEEE Transactions on Robotics, vol. 29, no. 4, pp. 1016–1030, 2013
work page 2013
-
[2]
Evaluation of registration methods on two-dimensional forward-looking sonar imagery,
N. Hurt ´os, S. Nagappa, X. Cufi, Y . Petillot, and J. Salvi, “Evaluation of registration methods on two-dimensional forward-looking sonar imagery,” in2013 MTS/IEEE OCEANS - Bergen, 2013, pp. 1–8
work page 2013
-
[3]
On feature matching and image registration for two-dimensional forward-scan sonar imaging,
M. D. Aykin and S. Negahdaripour, “On feature matching and image registration for two-dimensional forward-scan sonar imaging,”Journal of Field Robotics, vol. 30, no. 4, pp. 602–623, 2013
work page 2013
-
[4]
Fourier-based registrations for two-dimensional forward-looking sonar image mosaicing,
N. Hurt ´os, Y . Petillot, J. Salviet al., “Fourier-based registrations for two-dimensional forward-looking sonar image mosaicing,” in2012 IEEE/RSJ International Conference on Intelligent Robots and Systems. Ieee, 2012, pp. 5298–5305
work page 2012
-
[5]
N. Hurt ´os, D. Ribas, X. Cufi, Y . Petillot, and J. Salvi, “Fourier- based registration for robust forward-looking sonar mosaicing in low- visibility underwater environments,”Journal of Field Robotics, vol. 32, no. 1, pp. 123–151, 2015
work page 2015
-
[6]
Riverine coverage with an autonomous surface vehicle over known environments,
N. Karapetyan, A. Braude, J. Moulton, J. A. Burstein, S. White, J. M. O’Kane, and I. Rekleitis, “Riverine coverage with an autonomous surface vehicle over known environments,” in2019 IEEE/RSJ Interna- tional Conference on Intelligent Robots and Systems (IROS). IEEE, 2019, pp. 3098–3104
work page 2019
-
[7]
Machine learning-based classification of on-bottom oyster farms using a mechanical scanning sonar,
M. A. Xu, N. Karapetyan, K. Rajasekaran, A. Williams, D. A. Pattillo, P. Tokekar, B. Hudson, Y . Tao, D. Webster, M. Gray, and M. Yu, “Machine learning-based classification of on-bottom oyster farms using a mechanical scanning sonar,”IEEE Sensors Journal, vol. 25, no. 12, pp. 22 068–22 079, 2025
work page 2025
-
[8]
X. Liu, J. Yang, C. Lu, E. Zhang, and W. Xu, “Robust forward-looking sonar-image mosaicking without external sensors for autonomous deep-sea mining,”Journal of Marine Science and Engineering, vol. 13, no. 7, p. 1291, 2025
work page 2025
-
[9]
Realtime registration for phased-array submarine sonar image,
J. Li, P. Wang, H. Zhang, Y . Yang, and B. Liu, “Realtime registration for phased-array submarine sonar image,”OCEANS 2018 MTS/IEEE Charleston, pp. 1–6, 2018
work page 2018
-
[10]
Method for registration of 3-d shapes,
P. J. Besl and N. D. McKay, “Method for registration of 3-d shapes,” inSensor fusion IV: control paradigms and data structures, vol. 1611. Spie, 1992, pp. 586–606
work page 1992
-
[11]
Iterative point matching for registration of free-form curves and surfaces,
Z. Zhang, “Iterative point matching for registration of free-form curves and surfaces,”International journal of computer vision, vol. 13, no. 2, pp. 119–152, 1994
work page 1994
-
[12]
Underwater sonar and aerial images data fusion for robot localization,
W. Dos Santos, T. Oliverira, and E. Brunskill, “Underwater sonar and aerial images data fusion for robot localization,”Robotics and Autonomous Systems, vol. 121, pp. 103–115, 2019
work page 2019
-
[13]
Pose-graph slam using forward-looking sonar,
J. Li, M. Kaess, R. M. Eustice, and M. Johnson-Roberson, “Pose-graph slam using forward-looking sonar,”IEEE Robotics and Automation Letters, vol. 3, no. 3, pp. 2330–2337, 2018
work page 2018
-
[14]
Imaging sonar-aided navigation for autonomous underwater harbor surveillance,
H. Johannsson, M. Kaess, B. Englot, F. Hover, and J. Leonard, “Imaging sonar-aided navigation for autonomous underwater harbor surveillance,” in2010 IEEE/RSJ International Conference on Intelli- gent Robots and Systems. IEEE, 2010, pp. 4396–4403
work page 2010
-
[15]
isam2: Incremental smoothing and mapping using the bayes tree,
M. Kaess, H. Johannsson, R. Roberts, V . Ila, J. J. Leonard, and F. Dellaert, “isam2: Incremental smoothing and mapping using the bayes tree,”The International Journal of Robotics Research, vol. 31, no. 2, pp. 216–235, 2012
work page 2012
-
[16]
Robust imaging sonar- based place recognition and localization in underwater environments,
H. Kim, G. Kang, S. Jeong, S. Ma, and Y . Cho, “Robust imaging sonar- based place recognition and localization in underwater environments,” in2023 IEEE International Conference on Robotics and Automation (ICRA), 2023, pp. 1083–1089
work page 2023
-
[17]
Research on a high-precision real-time 3d sonar image registration algorithm,
Q. Wu, G. He, C. Liu, J. Li, B. Zhang, and C. Tao, “Research on a high-precision real-time 3d sonar image registration algorithm,” in2023 IEEE International Conference on Control, Electronics and Computer Technology (ICCECT), 2023, pp. 413–416
work page 2023
-
[18]
Non-gaussian slam utilizing synthetic aperture sonar,
M. Y . Cheung, D. Fourie, N. R. Rypkema, P. V . Teixeira, H. Schmidt, and J. Leonard, “Non-gaussian slam utilizing synthetic aperture sonar,” in2019 International Conference on Robotics and Automation (ICRA), 2019, pp. 3457–3463
work page 2019
-
[19]
J. McConnell, Y . Huang, P. Szenher, I. Collado-Gonzalez, and B. En- glot, “Draco-slam: Distributed robust acoustic communication-efficient slam for imaging sonar equipped underwater robot teams,” in2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022, pp. 8457–8464
work page 2022
-
[20]
Y . Huang, J. McConnell, X. Lin, and B. Englot, “Draco-slam2: Distributed robust acoustic communication-efficient slam for imaging sonar equipped underwater robot teams with object graph matching,” in2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025, pp. 3587–3594
work page 2025
-
[21]
Overhead image factors for underwater sonar-based slam,
J. McConnell, F. Chen, and B. Englot, “Overhead image factors for underwater sonar-based slam,”IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 4901–4908, 2022
work page 2022
-
[22]
Large- scale dense 3-d mapping using submaps derived from orthogonal imaging sonars,
J. McConnell, I. Collado-Gonzalez, P. Szenher, and B. Englot, “Large- scale dense 3-d mapping using submaps derived from orthogonal imaging sonars,”IEEE Journal of Oceanic Engineering, 2024
work page 2024
-
[23]
Oysterbot: Autonomous surface vehicle based system for aquaculture monitoring operations,
N. Karapetyan, M. Xu, Y . Zhang, K. Joshi, N. Anil, S. Mayenkar, X. Lin, Y . Aloimonos, N. Chopra, P. Maloneet al., “Oysterbot: Autonomous surface vehicle based system for aquaculture monitoring operations,” inOCEANS 2025-Great Lakes. IEEE, 2025, pp. 1–7
work page 2025
-
[24]
L. J. Poppe, R. Lewis, H. J. Knebel, E. A. Haase, K. Parolski, and M. DiGiacomo-Cohen, “Sidescan sonar images, surficial geologic interpretations, and bathymetry of the long island sound sea floor in new haven harbor and new haven dumping ground, connecticut,” US Geological Survey, Tech. Rep., 2001
work page 2001
-
[25]
Image fusion methods: A survey
S. Masood, M. Sharif, M. Yasmin, M. A. Shahid, and A. Rehman, “Image fusion methods: A survey.”Journal of Engineering Science & Technology Review, vol. 10, no. 6, 2017
work page 2017
-
[26]
A novel blending technique for two- dimensional forward-looking sonar mosaicing,
N. Hurt ´os, X. Cufi, and J. Salvi, “A novel blending technique for two- dimensional forward-looking sonar mosaicing,” in2013 OCEANS - San Diego, 2013, pp. 1–7
work page 2013
-
[27]
A blending method for forward-looking sonar mosaicing handling intra- and inter-frame artifacts,
J. Su, H. Li, J. Qian, X. An, F. Qu, and Y . Wei, “A blending method for forward-looking sonar mosaicing handling intra- and inter-frame artifacts,”Ocean Engineering, vol. 298, p. 117249, 2024
work page 2024
-
[28]
A non-local algorithm for image denoising,
A. Buades, B. Coll, and J.-M. Morel, “A non-local algorithm for image denoising,” in2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), vol. 2, 2005, pp. 60–65
work page 2005
-
[29]
Zuiderveld,Contrast limited adaptive histogram equalization
K. Zuiderveld,Contrast limited adaptive histogram equalization. USA: Academic Press Professional, Inc., 1994, p. 474–485
work page 1994
-
[30]
A mathematical theory of communication,
C. E. Shannon, “A mathematical theory of communication,”The Bell System Technical Journal, vol. 27, no. 3, pp. 379–423, 1948
work page 1948
-
[31]
E. Peli, “Contrast in complex images,”Journal of the Optical Society of America A, vol. 7, no. 10, pp. 2032–2040, 1990
work page 2032
- [32]
-
[33]
Diatom autofocusing in brightfield microscopy: a comparative study,
J. Pech-Pacheco, G. Cristobal, J. Chamorro-Martinez, and J. Fernandez-Valdivia, “Diatom autofocusing in brightfield microscopy: a comparative study,” inProceedings 15th International Conference on Pattern Recognition. ICPR-2000, vol. 3, 2000, pp. 314–317
work page 2000
-
[34]
An fft-based technique for translation, rotation, and scale-invariant image registration,
B. Reddy and B. Chatterji, “An fft-based technique for translation, rotation, and scale-invariant image registration,”IEEE Transactions on Image Processing, vol. 5, no. 8, pp. 1266–1271, 1996
work page 1996
-
[35]
Orb: An efficient alternative to sift or surf,
E. Rublee, V . Rabaud, K. Konolige, and G. Bradski, “Orb: An efficient alternative to sift or surf,” in2011 International Conference on Computer Vision, 2011, pp. 2564–2571
work page 2011
-
[36]
M. A. Fischler and R. C. Bolles, “Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,”Commun. ACM, vol. 24, no. 6, p. 381–395, Jun. 1981
work page 1981
-
[37]
Distinctive image features from scale-invariant key- points,
D. G. Lowe, “Distinctive image features from scale-invariant key- points,”International journal of computer vision, vol. 60, no. 2, pp. 91–110, 2004
work page 2004
-
[38]
Parametric image alignment using enhanced correlation coefficient maximization,
G. D. Evangelidis and E. Z. Psarakis, “Parametric image alignment using enhanced correlation coefficient maximization,”IEEE transac- tions on pattern analysis and machine intelligence, vol. 30, no. 10, pp. 1858–1865, 2008
work page 2008
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.