{"title":"基于深度强化学习的水下无线传感器网络多auv任务分配算法","authors":"Zhibin Liu;Chunfeng Liu;Wenyu Qu;Tie Qiu;Zhao Zhao;Yansheng Hu;Huiyong Dong","doi":"10.1109/JSEN.2024.3507796","DOIUrl":null,"url":null,"abstract":"Autonomous underwater vehicle (AUV)-based data collection can bring significant advantages to underwater wireless sensor networks (UWSNs). Collaborative collection based on multi-AUV task allocation is an effective way to reduce delay. However, the existing research work seldom considers the real current environment in the task allocation, which leads to the large delay and yaw of AUVs. The introduction of ocean currents makes the existing task allocation algorithms no longer applicable due to the poor solving ability and long convergence time. Therefore, we propose an efficient task allocation algorithm named genetic algorithm N-step reinforcement learning improved DQN (GA-NDQN) by combining the genetic algorithm (GA) and N-step reinforcement learning (RL) nature of DQN to minimize the data collection delay. In our work, to minimize the impact of ocean currents on the AUV’s travel, the specific trajectory optimization problem between adjacent nodes is considered and modeled as a minimum weight sum problem (MWSP). To complete the entire data collection process, we performed path planning for AUVs and modeled it as an asymmetric traveling salesman problem (ATSP). A* algorithm and the Lin-Kernighan–Helsgaun (LKH) algorithm are designed to solve these problems, which are further nested in GA-NDQN to optimize the task allocation strategy for data collection. Finally, the effectiveness of the proposed scheme is verified by extensive simulation results.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 2","pages":"3909-3922"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Reinforcement Learning-Based Multi-AUV Task Allocation Algorithm in Underwater Wireless Sensor Networks\",\"authors\":\"Zhibin Liu;Chunfeng Liu;Wenyu Qu;Tie Qiu;Zhao Zhao;Yansheng Hu;Huiyong Dong\",\"doi\":\"10.1109/JSEN.2024.3507796\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Autonomous underwater vehicle (AUV)-based data collection can bring significant advantages to underwater wireless sensor networks (UWSNs). Collaborative collection based on multi-AUV task allocation is an effective way to reduce delay. However, the existing research work seldom considers the real current environment in the task allocation, which leads to the large delay and yaw of AUVs. The introduction of ocean currents makes the existing task allocation algorithms no longer applicable due to the poor solving ability and long convergence time. Therefore, we propose an efficient task allocation algorithm named genetic algorithm N-step reinforcement learning improved DQN (GA-NDQN) by combining the genetic algorithm (GA) and N-step reinforcement learning (RL) nature of DQN to minimize the data collection delay. In our work, to minimize the impact of ocean currents on the AUV’s travel, the specific trajectory optimization problem between adjacent nodes is considered and modeled as a minimum weight sum problem (MWSP). To complete the entire data collection process, we performed path planning for AUVs and modeled it as an asymmetric traveling salesman problem (ATSP). A* algorithm and the Lin-Kernighan–Helsgaun (LKH) algorithm are designed to solve these problems, which are further nested in GA-NDQN to optimize the task allocation strategy for data collection. Finally, the effectiveness of the proposed scheme is verified by extensive simulation results.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 2\",\"pages\":\"3909-3922\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10778250/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10778250/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Deep Reinforcement Learning-Based Multi-AUV Task Allocation Algorithm in Underwater Wireless Sensor Networks
Autonomous underwater vehicle (AUV)-based data collection can bring significant advantages to underwater wireless sensor networks (UWSNs). Collaborative collection based on multi-AUV task allocation is an effective way to reduce delay. However, the existing research work seldom considers the real current environment in the task allocation, which leads to the large delay and yaw of AUVs. The introduction of ocean currents makes the existing task allocation algorithms no longer applicable due to the poor solving ability and long convergence time. Therefore, we propose an efficient task allocation algorithm named genetic algorithm N-step reinforcement learning improved DQN (GA-NDQN) by combining the genetic algorithm (GA) and N-step reinforcement learning (RL) nature of DQN to minimize the data collection delay. In our work, to minimize the impact of ocean currents on the AUV’s travel, the specific trajectory optimization problem between adjacent nodes is considered and modeled as a minimum weight sum problem (MWSP). To complete the entire data collection process, we performed path planning for AUVs and modeled it as an asymmetric traveling salesman problem (ATSP). A* algorithm and the Lin-Kernighan–Helsgaun (LKH) algorithm are designed to solve these problems, which are further nested in GA-NDQN to optimize the task allocation strategy for data collection. Finally, the effectiveness of the proposed scheme is verified by extensive simulation results.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice