Contextual Multi-Task Reinforcement Learning for Autonomous Reef Monitoring

· 2026 · cs.RO · arXiv 2604.12645

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Although autonomous underwater vehicles promise the capability of marine ecosystem monitoring, their deployment is fundamentally limited by the difficulty of controlling vehicles under highly uncertain and non-stationary underwater dynamics. To address these challenges, we employ a data-driven reinforcement learning approach to compensate for unknown dynamics and task variations.Traditional single-task reinforcement learning has a tendency to overfit the training environment, thus, limit the long-term usefulness of the learnt policy. Hence, we propose to use a contextual multi-task reinforcement learning paradigm instead, allowing us to learn controllers that can be reused for various tasks, e.g., detecting oysters in one reef and detecting corals in another. We evaluate whether contextual multi-task reinforcement learning can efficiently learn robust and generalisable control policies for autonomous underwater reef monitoring. We train a single context-dependent policy that is able to solve multiple related monitoring tasks in a simulated reef environment in HoloOcean. In our experiments, we empirically evaluate the contextual policies regarding sample-efficiency, zero-shot generalisation to unseen tasks, and robustness to varying water currents. By utilising multi-task reinforcement learning, we aim to improve the training effectiveness, as well as the reusability of learnt policies to take a step towards more sustainable procedures in autonomous reef monitoring.

representative citing papers

Task-specific Subnetwork Discovery in Reinforcement Learning for Autonomous Underwater Navigation

cs.LG · 2026-04-23 · unverdicted · novelty 5.0

Contextual multi-task RL for underwater navigation uses just 1.5% of network weights for task differentiation, mostly from context-variable connections to the first hidden layer.

citing papers explorer

Showing 1 of 1 citing paper.

Task-specific Subnetwork Discovery in Reinforcement Learning for Autonomous Underwater Navigation cs.LG · 2026-04-23 · unverdicted · none · ref 17 · internal anchor
Contextual multi-task RL for underwater navigation uses just 1.5% of network weights for task differentiation, mostly from context-variable connections to the first hidden layer.

Contextual Multi-Task Reinforcement Learning for Autonomous Reef Monitoring

fields

years

verdicts

representative citing papers

citing papers explorer