A novel underwater target tracking method in UASNs via collaborative deep reinforcement learning

IF 15.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Information Fusion Pub Date : 2025-10-02 DOI:10.1016/j.inffus.2025.103797

Linyao Zheng , Meiqin Liu , Senlin Zhang , Shanling Dong

{"title":"A novel underwater target tracking method in UASNs via collaborative deep reinforcement learning","authors":"Linyao Zheng , Meiqin Liu , Senlin Zhang , Shanling Dong","doi":"10.1016/j.inffus.2025.103797","DOIUrl":null,"url":null,"abstract":"<div><div>Modern underwater acoustic sensor networks (UASNs), as vital infrastructure for marine surveillance, face dual challenges in energy-efficient sensor scheduling and correlation-aware data fusion for underewater target tracking under resource-constrained conditions. Existing UASNs-based target tracking methods suffer from key limitations, including environment-dependent scheduling with poor adaptability, reliance on predefined correlation models for multi-sensor fusion, and the separate optimization of inherently coupled tasks. To address these issues, we develop a cooperative deep reinforcement learning (CDRL)-based framework for underwater target tracking that performs joint optimization through coordinated policy design. In this framework, a scheduling agent adaptively selects energy-efficient sensing platforms under dynamic conditions, while a fusion agent implements a model-free strategy to alleviate the need for precise correlation models. Both agents are trained using Proximal Policy Optimization (PPO) within a multi-agent coordinate architecture equipped with a global critic, enabling collaborative decision-making across tasks. In addition, a mock data method is introduced to reduce reliance on accurate ground truth, enhancing robustness against non-cooperative targets. Numerical simulation and real-world experiment confirm that the proposed framework consistently outperforms conventional approaches, achieving no less than a 15<span><math><mo>%</mo></math></span> improvement in energy efficiency.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"127 ","pages":"Article 103797"},"PeriodicalIF":15.5000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525008590","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Modern underwater acoustic sensor networks (UASNs), as vital infrastructure for marine surveillance, face dual challenges in energy-efficient sensor scheduling and correlation-aware data fusion for underewater target tracking under resource-constrained conditions. Existing UASNs-based target tracking methods suffer from key limitations, including environment-dependent scheduling with poor adaptability, reliance on predefined correlation models for multi-sensor fusion, and the separate optimization of inherently coupled tasks. To address these issues, we develop a cooperative deep reinforcement learning (CDRL)-based framework for underwater target tracking that performs joint optimization through coordinated policy design. In this framework, a scheduling agent adaptively selects energy-efficient sensing platforms under dynamic conditions, while a fusion agent implements a model-free strategy to alleviate the need for precise correlation models. Both agents are trained using Proximal Policy Optimization (PPO) within a multi-agent coordinate architecture equipped with a global critic, enabling collaborative decision-making across tasks. In addition, a mock data method is introduced to reduce reliance on accurate ground truth, enhancing robustness against non-cooperative targets. Numerical simulation and real-world experiment confirm that the proposed framework consistently outperforms conventional approaches, achieving no less than a 15

%

improvement in energy efficiency.

查看原文本刊更多论文

一种基于协同深度强化学习的水下目标跟踪方法

现代水声传感器网络作为海洋监测的重要基础设施，在资源受限条件下，面临着传感器节能调度和相关感知数据融合的双重挑战。现有的基于usns的目标跟踪方法存在环境依赖调度、适应性差、多传感器融合依赖于预定义的相关模型以及固有耦合任务的单独优化等主要局限性。为了解决这些问题，我们开发了一个基于合作深度强化学习（CDRL）的水下目标跟踪框架，该框架通过协调策略设计进行联合优化。在该框架中，调度智能体在动态条件下自适应选择节能感知平台，融合智能体采用无模型策略，减少了对精确关联模型的需求。这两个智能体都使用多智能体坐标体系结构中的近端策略优化（PPO）进行训练，该体系结构配备了全局评论家，从而实现了跨任务的协作决策。此外，还引入了模拟数据方法，以减少对准确地面真值的依赖，增强对非合作目标的鲁棒性。数值模拟和现实世界的实验证实，所提出的框架始终优于传统方法，实现了不少于15%的能源效率提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Fusion 工程技术-计算机：理论方法

CiteScore

33.20

自引率

4.30%

发文量

161

审稿时长

7.9 months

期刊介绍： Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.