Deep Reinforcement Learning-Based Multi-AUV Task Allocation Algorithm in Underwater Wireless Sensor Networks

IF 4.3 2区综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Journal Pub Date : 2024-12-04 DOI:10.1109/JSEN.2024.3507796

Zhibin Liu;Chunfeng Liu;Wenyu Qu;Tie Qiu;Zhao Zhao;Yansheng Hu;Huiyong Dong

{"title":"Deep Reinforcement Learning-Based Multi-AUV Task Allocation Algorithm in Underwater Wireless Sensor Networks","authors":"Zhibin Liu;Chunfeng Liu;Wenyu Qu;Tie Qiu;Zhao Zhao;Yansheng Hu;Huiyong Dong","doi":"10.1109/JSEN.2024.3507796","DOIUrl":null,"url":null,"abstract":"Autonomous underwater vehicle (AUV)-based data collection can bring significant advantages to underwater wireless sensor networks (UWSNs). Collaborative collection based on multi-AUV task allocation is an effective way to reduce delay. However, the existing research work seldom considers the real current environment in the task allocation, which leads to the large delay and yaw of AUVs. The introduction of ocean currents makes the existing task allocation algorithms no longer applicable due to the poor solving ability and long convergence time. Therefore, we propose an efficient task allocation algorithm named genetic algorithm N-step reinforcement learning improved DQN (GA-NDQN) by combining the genetic algorithm (GA) and N-step reinforcement learning (RL) nature of DQN to minimize the data collection delay. In our work, to minimize the impact of ocean currents on the AUV’s travel, the specific trajectory optimization problem between adjacent nodes is considered and modeled as a minimum weight sum problem (MWSP). To complete the entire data collection process, we performed path planning for AUVs and modeled it as an asymmetric traveling salesman problem (ATSP). A* algorithm and the Lin-Kernighan–Helsgaun (LKH) algorithm are designed to solve these problems, which are further nested in GA-NDQN to optimize the task allocation strategy for data collection. Finally, the effectiveness of the proposed scheme is verified by extensive simulation results.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 2","pages":"3909-3922"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10778250/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Autonomous underwater vehicle (AUV)-based data collection can bring significant advantages to underwater wireless sensor networks (UWSNs). Collaborative collection based on multi-AUV task allocation is an effective way to reduce delay. However, the existing research work seldom considers the real current environment in the task allocation, which leads to the large delay and yaw of AUVs. The introduction of ocean currents makes the existing task allocation algorithms no longer applicable due to the poor solving ability and long convergence time. Therefore, we propose an efficient task allocation algorithm named genetic algorithm N-step reinforcement learning improved DQN (GA-NDQN) by combining the genetic algorithm (GA) and N-step reinforcement learning (RL) nature of DQN to minimize the data collection delay. In our work, to minimize the impact of ocean currents on the AUV’s travel, the specific trajectory optimization problem between adjacent nodes is considered and modeled as a minimum weight sum problem (MWSP). To complete the entire data collection process, we performed path planning for AUVs and modeled it as an asymmetric traveling salesman problem (ATSP). A* algorithm and the Lin-Kernighan–Helsgaun (LKH) algorithm are designed to solve these problems, which are further nested in GA-NDQN to optimize the task allocation strategy for data collection. Finally, the effectiveness of the proposed scheme is verified by extensive simulation results.

查看原文本刊更多论文

基于深度强化学习的水下无线传感器网络多auv任务分配算法

基于自主水下航行器（AUV）的数据采集可以为水下无线传感器网络（UWSNs）带来显著的优势。基于多auv任务分配的协同采集是减少延迟的有效途径。然而，现有的研究工作在任务分配中很少考虑真实的当前环境，这导致了auv的大延迟和偏航。洋流的引入使得现有的任务分配算法由于求解能力差、收敛时间长而不再适用。因此，我们结合遗传算法（GA）和DQN的n步强化学习（RL）特性，提出了一种高效的任务分配算法——遗传算法n步强化学习改进DQN (GA- ndqn)，以最小化数据收集延迟。在我们的工作中，为了最大限度地减少洋流对AUV航行的影响，考虑了相邻节点之间的特定轨迹优化问题，并将其建模为最小权值和问题（MWSP）。为了完成整个数据收集过程，我们对auv进行了路径规划，并将其建模为不对称旅行推销员问题（ATSP）。针对这些问题设计了A*算法和Lin-Kernighan-Helsgaun （LKH）算法，并将其进一步嵌套在GA-NDQN中，优化数据采集的任务分配策略。最后，通过大量的仿真结果验证了所提方案的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Sensors Journal 工程技术-工程：电子与电气

CiteScore

7.70

自引率

14.00%

发文量

2058

审稿时长

5.2 months

期刊介绍： The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice