AoI-MDP integrates age of information into MDP state, action, and reward to optimize decision-making under observation delays for underwater autonomous vehicles.
Multi-Objective-Optimization Assisted Data Collection Framework for IoUT Based on Offline Reinforcement
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
The Information Updating Networks (IUNs) offers significant potential for ocean exploration but encounters challenges due to dynamic underwater environments and severe system attenuation. Current methods relying on Autonomous Underwater Vehicles (AUVs) based on online reinforcement learning (RL) lead to high computational costs and low data utilization. To address these issues and the constraints of turbulent ocean environments, we propose a multi-AUV assisted data collection framework for IUNs based on multi-agent offline RL. This framework maximizes data rate and the value of information (VoI), minimizes energy consumption, and ensures collision avoidance by utilizing environmental and equipment status data. We introduce a semi-communication decentralized training with decentralized execution (SC-DTDE) paradigm and a multi-agent independent conservative Q-learning algorithm (MAICQL) to effectively tackle the problem. Extensive simulations demonstrate the high applicability, robustness, and data collection efficiency of the proposed framework.
fields
eess.SY 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
AoI-MDP: An AoI Optimized Markov Decision Process (Student Abstract)
AoI-MDP integrates age of information into MDP state, action, and reward to optimize decision-making under observation delays for underwater autonomous vehicles.