基于强化学习的水下目标运动自动分析

IF 5.5 2区工程技术 Q1 ENGINEERING, CIVIL

Ocean Engineering Pub Date : 2025-10-03 DOI:10.1016/j.oceaneng.2025.122946

Seunghwan Jang , Junha Shin , Dasol Kim , Juhyun Lee , Hyunsuk Ko

{"title":"基于强化学习的水下目标运动自动分析","authors":"Seunghwan Jang , Junha Shin , Dasol Kim , Juhyun Lee , Hyunsuk Ko","doi":"10.1016/j.oceaneng.2025.122946","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents an automated target motion analysis (TMA) framework that leverages deep reinforcement learning (DRL) to enhance the accuracy and reliability of target state estimation from SONAR-derived bearing-only measurements in underwater environments. Traditional TMA methods-such as the manual 10-point divider and batch estimation-rely heavily on operator expertise and are susceptible to inaccuracies due to environmental noise and human error. To address these limitations, we employ a Proximal Policy Optimization (PPO)-based agent to automatically and robustly estimate the target speed. A customized TMA simulator was developed to generate diverse underwater scenarios, incorporating variations in target motion and noise levels to ensure the model’s generalization capability. The PPO agent learns to infer target speed directly from sequential bearing data, achieving a strong balance between exploration and exploitation. Experimental results demonstrate that the trained agent provides highly accurate and robust speed estimates, even under realistic noise conditions. This work contributes to the advancement of autonomous maritime surveillance and defense systems by significantly reducing human dependency and improving operational reliability.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"342 ","pages":"Article 122946"},"PeriodicalIF":5.5000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement learning-based automated target motion analysis in underwater environments\",\"authors\":\"Seunghwan Jang , Junha Shin , Dasol Kim , Juhyun Lee , Hyunsuk Ko\",\"doi\":\"10.1016/j.oceaneng.2025.122946\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study presents an automated target motion analysis (TMA) framework that leverages deep reinforcement learning (DRL) to enhance the accuracy and reliability of target state estimation from SONAR-derived bearing-only measurements in underwater environments. Traditional TMA methods-such as the manual 10-point divider and batch estimation-rely heavily on operator expertise and are susceptible to inaccuracies due to environmental noise and human error. To address these limitations, we employ a Proximal Policy Optimization (PPO)-based agent to automatically and robustly estimate the target speed. A customized TMA simulator was developed to generate diverse underwater scenarios, incorporating variations in target motion and noise levels to ensure the model’s generalization capability. The PPO agent learns to infer target speed directly from sequential bearing data, achieving a strong balance between exploration and exploitation. Experimental results demonstrate that the trained agent provides highly accurate and robust speed estimates, even under realistic noise conditions. This work contributes to the advancement of autonomous maritime surveillance and defense systems by significantly reducing human dependency and improving operational reliability.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"342 \",\"pages\":\"Article 122946\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801825026290\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825026290","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}

引用次数: 0

摘要

本研究提出了一个自动目标运动分析（TMA）框架，该框架利用深度强化学习（DRL）来提高水下环境中基于声纳的纯方位测量的目标状态估计的准确性和可靠性。传统的TMA方法，如手动10点分配器和批量估计，严重依赖于操作员的专业知识，容易受到环境噪声和人为错误的影响。为了解决这些限制，我们使用了一个基于近端策略优化（PPO）的代理来自动和鲁棒地估计目标速度。开发了一个定制的TMA模拟器来生成不同的水下场景，包括目标运动和噪声水平的变化，以确保模型的泛化能力。PPO智能体学习直接从顺序方位数据推断目标速度，在勘探和开采之间实现了强有力的平衡。实验结果表明，即使在实际噪声条件下，训练后的智能体也能提供高度准确和鲁棒的速度估计。这项工作通过显著减少对人的依赖和提高操作可靠性，有助于自主海上监视和防御系统的发展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement learning-based automated target motion analysis in underwater environments

This study presents an automated target motion analysis (TMA) framework that leverages deep reinforcement learning (DRL) to enhance the accuracy and reliability of target state estimation from SONAR-derived bearing-only measurements in underwater environments. Traditional TMA methods-such as the manual 10-point divider and batch estimation-rely heavily on operator expertise and are susceptible to inaccuracies due to environmental noise and human error. To address these limitations, we employ a Proximal Policy Optimization (PPO)-based agent to automatically and robustly estimate the target speed. A customized TMA simulator was developed to generate diverse underwater scenarios, incorporating variations in target motion and noise levels to ensure the model’s generalization capability. The PPO agent learns to infer target speed directly from sequential bearing data, achieving a strong balance between exploration and exploitation. Experimental results demonstrate that the trained agent provides highly accurate and robust speed estimates, even under realistic noise conditions. This work contributes to the advancement of autonomous maritime surveillance and defense systems by significantly reducing human dependency and improving operational reliability.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Ocean Engineering 工程技术-工程：大洋

CiteScore

7.30

自引率

34.00%

发文量

2379

审稿时长

8.1 months

期刊介绍： Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.