pith. sign in

arxiv: 2605.09093 · v1 · submitted 2026-05-09 · 💻 cs.RO · cs.SY· eess.SY

HyDRA Scorpion: A Cost-effective and Modular ROV for Real-Time Underwater Inspection, Intervention, and Object Detection

Pith reviewed 2026-05-12 02:43 UTC · model grok-4.3

classification 💻 cs.RO cs.SYeess.SY
keywords ROVunderwater inspectionobject detectionAI in roboticsmodular vehiclepressure testingmanipulator arms
0
0 comments X

The pith

A modular low-cost ROV integrates AI to achieve real-time underwater object detection and stable intervention capabilities.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents HyDRA Scorpion as an affordable, locally manufacturable ROV that combines 4-DoF movement, dual manipulators, and an AI perception system for underwater tasks. It shows through tests that the vehicle can maintain position within 0.15 meters, operate without leaks at simulated depths of 300 meters, and detect objects with 0.89 mean average precision in real time. A sympathetic reader would care because this approach reduces the high costs of commercial systems and bridges the gap between basic mechanical platforms and intelligent ones. If successful, such designs could expand access to underwater robotics for inspection, intervention, and research.

Core claim

HyDRA Scorpion is a cost-effective modular ROV that incorporates an advanced AI-driven perception stack with in-situ measurement capabilities on a low-cost platform featuring 4-DoF maneuverability, dual manipulators, and a custom pressure-tested housing, achieving leak-free operation at 4 bar, station-keeping within ±0.15 meters, and object detection mAP of 0.89 with real-time inference.

What carries the argument

The onboard AI module for object detection and 3D-mapping distance measurement, integrated with the pressure-resistant electronics housing and dual-function manipulators allowing 360-degree rotation.

If this is right

  • Real-time inference enables immediate object detection during inspections.
  • Stable station-keeping supports precise intervention tasks with the manipulators.
  • Pressure testing to 4 bar confirms suitability for operations up to approximately 300 meters.
  • Modular design allows for local manufacturing and customization.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Such a platform might enable broader participation in marine research by lowering entry costs for academic and citizen science groups.
  • Extending the AI capabilities to include more complex tasks like autonomous navigation could further reduce operator workload.
  • Testing in actual ocean environments with biofouling and variable lighting would be a natural next step to validate the simulated results.

Load-bearing premise

The performance metrics from simulated pressure tests and object detection on the datasets used will hold in unpredictable real ocean conditions with changing visibility, currents, and marine growth.

What would settle it

A field test in open water showing either water leakage into the housing or object detection mAP falling below 0.7 under typical underwater visibility conditions would disprove the claims of robustness and performance.

Figures

Figures reproduced from arXiv: 2605.09093 by A.K.M. Muzahidul Islam, A. M. Zayed Abdullah, Anika Tabassum Orchi, Fahim Hafiz, Iftekharul Islam, Mahbubul Islam, Md Alamgir, Md Darain Khan, Md Farhan Zaman, Md. Jobayer Rahman, Md. Kutub Al Baki, Md Mehrab Hossain Khan, Md Muzahidul Islam, Md Sadique Hossain, M. Rezwan Khan, Nusrat Jahan Piyal, Shah Mohammad Seaman, Shakil Ahmed, Shekh Md. Saifur Rahman.

Figure 1
Figure 1. Figure 1: (a) HyDRA Scorpion ROV navigates by leveraging the AI-based [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 3
Figure 3. Figure 3: The prototype of HyDRA Scorpion Beyond vehicle platforms, recent work [13], [14] on subsea autonomy emphasises semantic mapping and intervention. Semantic mapping frameworks provide higher-level situational awareness and task reasoning, but they presuppose reliable object detection and quantitative measurements as inputs. Similarly, efforts in underwater exploration ROVs and search￾and-rescue prototypes hi… view at source ↗
Figure 4
Figure 4. Figure 4: HyDRA Scorpion system overview. A central, 6-inch acrylic pressure vessel, fabricated in￾house, houses the core electronics and is mounted within an open-frame chassis that supports all propulsion, manipulation, and sensory payloads. Power is delivered from the surface at 48V to minimize transmission losses and converted onboard. The ROV’s vectored eight-thruster configuration provides full 6-DoF authority… view at source ↗
Figure 5
Figure 5. Figure 5: Mechanical structure of HyDRA-Scorpion. a) Frame and Hydrodynamics: The chassis is fabricated from 10mm carbon black reinforced high-density polyethylene (HDPE) sheets. This material was selected over traditional aluminum alloys for its excellent corrosion resistance in salt￾water, its near-neutral buoyancy (95% of water density), which significantly simplifies vehicle trimming, and its environmental recyc… view at source ↗
Figure 7
Figure 7. Figure 7: Rovmaker t200 thrusters Pixhawk 4 Leak Sensor Bar Sensor Sensors Control System Raspberry Pi 5 On-board Computer Front Facing Night Vision Camera (Pi Cam v3 NoIR) Down Facing Night Vision Camera (Pi Cam v3 NoIR) Visual Sensors Flight Controller Telemetry Interface Software Interface [PITH_FULL_IMAGE:figures/full_fig_p005_7.png] view at source ↗
Figure 6
Figure 6. Figure 6: Cross-section of the primary manipulator, detailing the coaxial dual [PITH_FULL_IMAGE:figures/full_fig_p005_6.png] view at source ↗
Figure 8
Figure 8. Figure 8: Vision Pipelines of object detection and length measurement [PITH_FULL_IMAGE:figures/full_fig_p006_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Sample output images from field tests of the AI vision system, showing [PITH_FULL_IMAGE:figures/full_fig_p007_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Sample outputs from the vision-based measurement system. The [PITH_FULL_IMAGE:figures/full_fig_p007_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: A full operation of the Manipulator Arm in the test bed [PITH_FULL_IMAGE:figures/full_fig_p008_11.png] view at source ↗
read the original abstract

A Remotely Operated Vehicle (ROV) is a tethered underwater robot used for tasks like inspection and intervention. While essential tools for underwater science, the high cost of commercial ROVs and a persistent gap between mechanically capable platforms and those with integrated intelligence create a significant barrier to access. HyDRA Scorpion differs from conventional systems by addressing these challenges, integrating an advanced, AI-driven perception stack with in-situ measurement capabilities onto a low-cost, locally manufacturable platform. The system combines 4-DoF maneuverability, dual manipulators, and a custom pressure-tested housing. Experimental results validate the system's robustness and performance. Leak-free operation was confirmed through prolonged pressure testing of the electronics housing to 4 bar, equivalent to the pressure of a 304.8-meter water depth approximately in a simulated environment, with no moisture ingress detected. The vehicle also demonstrated stable station-keeping, maintaining its position within a tight tolerance of $\(\pm\)0.15$ meters under external disturbances. The onboard AI module achieved underwater object detection mean Average Precision (mAP) of 0.89 with real-time inference, length and 3D-mapping based distance measurement. Also, 4-DoF manipulator arm can grip and maintain dual-function manipulator feature which support 360 degree tangle-free rotation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript presents HyDRA Scorpion, a low-cost modular ROV with 4-DoF maneuverability, dual manipulators, a custom pressure-tested electronics housing, and an integrated AI perception stack for real-time underwater object detection (reported mAP 0.89), length/3D-mapping distance measurement, and intervention tasks. Experimental claims include leak-free housing performance at 4 bar (simulating ~305 m depth) in a controlled environment and station-keeping within ±0.15 m under external disturbances.

Significance. If the controlled-test results generalize, the platform could meaningfully lower barriers to underwater robotics by combining accessible mechanical design with onboard AI capabilities. The work demonstrates practical integration of perception and manipulation on a locally manufacturable base, but its significance is constrained by the absence of open-water validation data under variable visibility, currents, and biofouling.

major comments (3)
  1. [Abstract] Abstract: the mAP of 0.89 for underwater object detection is reported without any description of the training/test datasets, object classes, baseline detectors, number of trials, error bars, or inference hardware/FPS, preventing assessment of whether the result supports the claim of robust real-time performance.
  2. [Abstract] Abstract: the station-keeping result of ±0.15 m is given without specifying the magnitude, type, or duration of external disturbances, the control law employed, sensor suite, or number of repeated trials, which are load-bearing for the 4-DoF maneuverability claim.
  3. [Abstract] Abstract: the pressure test to 4 bar is described only as 'prolonged' in a 'simulated environment' with no quantitative details on test duration, pressure ramp profile, or any accompanying open-water deployment data, leaving the generalization to 304.8 m ocean depth unsupported.
minor comments (2)
  1. [Abstract] Abstract: the phrase 'length and 3D-mapping based distance measurement' is ambiguous and should be clarified with respect to the measurement method and accuracy.
  2. [Abstract] Abstract: the final sentence on the manipulator ('4-DoF manipulator arm can grip and maintain dual-function manipulator feature which support 360 degree tangle-free rotation') contains grammatical errors and unclear terminology; rephrase for precision.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We agree that the abstract would benefit from greater specificity to support the experimental claims and have prepared revisions to address each point. Below we respond to the major comments individually.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the mAP of 0.89 for underwater object detection is reported without any description of the training/test datasets, object classes, baseline detectors, number of trials, error bars, or inference hardware/FPS, preventing assessment of whether the result supports the claim of robust real-time performance.

    Authors: We agree that these details are necessary for proper evaluation. The full manuscript describes the training and test datasets (including image counts and collection conditions), the specific object classes used for underwater inspection tasks, comparisons against baseline detectors, the number of trials performed, associated error bars, and the onboard inference hardware with measured FPS. In the revised version we will condense this information into the abstract while retaining the 0.89 mAP figure. revision: yes

  2. Referee: [Abstract] Abstract: the station-keeping result of ±0.15 m is given without specifying the magnitude, type, or duration of external disturbances, the control law employed, sensor suite, or number of repeated trials, which are load-bearing for the 4-DoF maneuverability claim.

    Authors: We concur that these parameters are essential. The manuscript details the control law (a 4-DoF PID-based controller), the sensor suite (IMU, depth sensor, and visual odometry), the disturbance types and magnitudes applied during testing, test durations, and the number of repeated trials yielding the ±0.15 m tolerance. We will incorporate a concise summary of these elements into the revised abstract. revision: yes

  3. Referee: [Abstract] Abstract: the pressure test to 4 bar is described only as 'prolonged' in a 'simulated environment' with no quantitative details on test duration, pressure ramp profile, or any accompanying open-water deployment data, leaving the generalization to 304.8 m ocean depth unsupported.

    Authors: We will add the missing quantitative details on test duration and pressure ramp profile to the abstract, drawing directly from the experimental section. The study was conducted in a controlled pressure chamber simulating 4 bar; no open-water deployment data are available in the current work. We will therefore revise the abstract to explicitly state that the result validates housing integrity under simulated pressure and to note the absence of in-situ ocean testing as a limitation, with future open-water validation planned. revision: partial

Circularity Check

0 steps flagged

No circularity: all claims are direct empirical reports from hardware tests

full rationale

The paper contains no mathematical derivations, equations, fitted parameters, or predictions. All load-bearing claims (pressure testing to 4 bar with no moisture ingress, station-keeping within ±0.15 m, AI mAP of 0.89 with real-time inference) are presented as direct experimental outcomes from controlled/simulated conditions. No self-citations, ansatzes, or uniqueness theorems are invoked to justify results. The validation chain is self-contained empirical reporting without any reduction of outputs to inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No free parameters, axioms, or invented entities are introduced; the work relies on standard engineering practices and off-the-shelf components without new theoretical foundations.

pith-pipeline@v0.9.0 · 5659 in / 1115 out tokens · 38479 ms · 2026-05-12T02:43:08.534675+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

24 extracted references · 24 canonical work pages

  1. [1]

    A review of subsea auv technology,

    J. Zhou, Y . Si, and Y . Chen, “A review of subsea auv technology,” Journal of Marine Science and Engineering, vol. 11, no. 6, p. 1119, 2023

  2. [2]

    A hybrid underwater robot for multidisciplinary investigation of the ocean twilight zone,

    D. R. Yoerger, A. F. Govindarajan, J. C. Howland, J. K. Llopiz, P. H. Wiebe, M. Curran, J. Fujii, D. Gomez-Ibanez, K. Katija, B. H. Robison et al., “A hybrid underwater robot for multidisciplinary investigation of the ocean twilight zone,”Science Robotics, vol. 6, no. 55, p. eabe1901, 2021

  3. [3]

    Autonomous underwater vehicles-challenging developments and technological maturity towards strategic swarm robotics systems,

    N. Vedachalam, R. Ramesh, V . B. N. Jyothi, V . Doss Prakash, and G. Ra- madass, “Autonomous underwater vehicles-challenging developments and technological maturity towards strategic swarm robotics systems,” Marine Georesources & Geotechnology, vol. 37, no. 5, pp. 525–538, 2019

  4. [4]

    Analytical approach to sampling estimation of underwater tunnels using mechanical profiling sonars,

    V . A. Machado Jorge, P. D. d. C. Gava, J. R. B. d. F. Silva, T. M. Mancilha, W. Vieira, G. J. Adabo, and C. L. Nascimento Jr, “Analytical approach to sampling estimation of underwater tunnels using mechanical profiling sonars,”Sensors, vol. 21, no. 5, p. 1900, 2021

  5. [5]

    Oceanic challenges to technological solu- tions: A review of autonomous underwater vehicle path technologies in biomimicry, control, navigation, and sensing,

    K. Hasan, S. Ahmad, A. F. Liaf, M. Karimi, T. Ahmed, M. A. Shawon, and S. Mekhilef, “Oceanic challenges to technological solu- tions: A review of autonomous underwater vehicle path technologies in biomimicry, control, navigation, and sensing,”IEEE Access, vol. 12, pp. 46 202–46 231, 2024

  6. [6]

    Underwater and scuba diving accidents,

    D. Lambert, M. Binkley, and Z. Gaskill, “Underwater and scuba diving accidents,”Emergency medicine clinics of North America, vol. 42, no. 3, pp. 551–563, 2024

  7. [7]

    Scuba diving operators’ perspective of scuba diving tourism business in nusa penida, indonesia,

    A. Gerungan and K. W. Chia, “Scuba diving operators’ perspective of scuba diving tourism business in nusa penida, indonesia,”Journal of Outdoor Recreation and Tourism, vol. 31, p. 100328, 2020

  8. [8]

    Hy- perspectral 3d mapping of underwater environments,

    M. Ferrera, A. Arnaubec, K. Isteni ˇc, N. Gracias, and T. Bajjouk, “Hy- perspectral 3d mapping of underwater environments,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 3703–3712

  9. [9]

    Review on unmanned underwater robotics, structure de- signs, materials, sensors, actuators, and navigation control,

    J. Neira, C. Sequeiros, R. Huamani, E. Machaca, P. Fonseca, and W. Nina, “Review on unmanned underwater robotics, structure de- signs, materials, sensors, actuators, and navigation control,”Journal of Robotics, vol. 2021, no. 1, p. 5542920, 2021

  10. [10]

    Maresye: A hybrid imaging system for underwater robotic applications,

    A. M. Pinto and A. C. Matos, “Maresye: A hybrid imaging system for underwater robotic applications,”Information Fusion, vol. 55, pp. 16– 29, 2020

  11. [11]

    Fully actuated model-based control with six-degree-of-freedom coupled dynamical plant models for under- water vehicles: Theory and experimental evaluation,

    S. C. Martin and L. L. Whitcomb, “Fully actuated model-based control with six-degree-of-freedom coupled dynamical plant models for under- water vehicles: Theory and experimental evaluation,”The International Journal of Robotics Research, vol. 35, no. 10, pp. 1164–1184, 2016

  12. [12]

    Design and imple- mentation of a six-degrees-of-freedom underwater remotely operated vehicle,

    K. M. Salem, M. Rady, H. Aly, and H. Elshimy, “Design and imple- mentation of a six-degrees-of-freedom underwater remotely operated vehicle,”Applied Sciences, vol. 13, no. 12, p. 6870, 2023

  13. [13]

    Odyssee: Oyster detection yielded by sensor systems on edge electronics,

    X. Lin, V . Mange, A. Suresh, B. Neuberger, A. Palnitkar, B. Campbell, A. Williams, K. Baxevani, J. Mallette, A. Veraet al., “Odyssee: Oyster detection yielded by sensor systems on edge electronics,” in2025 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2025, pp. 5290–5297

  14. [14]

    Stonefish: supporting machine learning research in marine robotics (2025),

    M. Grimaldi, P. Cieslak, E. Ochoa, V . Bharti, H. Rajani, I. Carlucho, M. Koskinopoulou, Y . Petillot, and N. Gracias, “Stonefish: supporting machine learning research in marine robotics (2025),”URL: https://arxiv. org/abs/2502.11887

  15. [15]

    Design and modeling of an experimental rov with six degrees of freedom,

    A. Kabanov, V . Kramar, and I. Ermakov, “Design and modeling of an experimental rov with six degrees of freedom,”Drones, vol. 5, no. 4, p. 113, 2021

  16. [16]

    Dual refinement underwater object detection network,

    B. Fan, W. Chen, Y . Cong, and J. Tian, “Dual refinement underwater object detection network,” inEuropean Conference on Computer Vision. Springer, 2020, pp. 275–291

  17. [17]

    Enyolo: a real-time framework for domain-adaptive underwater object detection with image enhancement,

    J. Wen, J. Cui, B. Zhao, B. Han, X. Liu, Z. Gao, and B. M. Chen, “Enyolo: a real-time framework for domain-adaptive underwater object detection with image enhancement,” in2024 IEEE International Con- ference on Robotics and Automation (ICRA). IEEE, 2024, pp. 12 613– 12 619

  18. [18]

    Lightweight yet high-performance defect detector for uav-based large-scale infrastructure real-time inspection,

    B. Zhao, Q. Duan, G. Yang, J. Tang, Z. Song, J. Wen, X. Liu, Q. Li, L. Lei, J. Zhanget al., “Lightweight yet high-performance defect detector for uav-based large-scale infrastructure real-time inspection,” in2025 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2025, pp. 13 675–13 682

  19. [19]

    Lightweight underwater object detection based on yolo v4 and multi-scale attentional feature fusion,

    M. Zhang, S. Xu, W. Song, Q. He, and Q. Wei, “Lightweight underwater object detection based on yolo v4 and multi-scale attentional feature fusion,”Remote Sensing, vol. 13, no. 22, p. 4706, 2021

  20. [20]

    An improved yolo algorithm for fast and accurate underwater object detection,

    S. Zhao, J. Zheng, S. Sun, and L. Zhang, “An improved yolo algorithm for fast and accurate underwater object detection,”Symmetry, vol. 14, no. 8, p. 1669, 2022

  21. [21]

    Underwater image processing and object detection based on deep cnn method,

    F. Han, J. Yao, H. Zhu, and C. Wang, “Underwater image processing and object detection based on deep cnn method,”Journal of Sensors, vol. 2020, no. 1, p. 6707328, 2020

  22. [22]

    Fast underwater image enhancement for improved visual perception,

    M. J. Islam, Y . Xia, and J. Sattar, “Fast underwater image enhancement for improved visual perception,”IEEE robotics and automation letters, vol. 5, no. 2, pp. 3227–3234, 2020

  23. [23]

    Real-time flying object detection with yolov8,

    D. Reis, J. Kupec, J. Hong, and A. Daoudi, “Real-time flying object detection with yolov8,”arXiv preprint arXiv:2305.09972, 2023

  24. [24]

    Rethinking pascal-voc and ms-coco dataset for small object detection,

    K. Tong and Y . Wu, “Rethinking pascal-voc and ms-coco dataset for small object detection,”Journal of visual communication and image representation, vol. 93, p. 103830, 2023