Seunghwan Jang , Junha Shin , Dasol Kim , Juhyun Lee , Hyunsuk Ko
{"title":"基于强化学习的水下目标运动自动分析","authors":"Seunghwan Jang , Junha Shin , Dasol Kim , Juhyun Lee , Hyunsuk Ko","doi":"10.1016/j.oceaneng.2025.122946","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents an automated target motion analysis (TMA) framework that leverages deep reinforcement learning (DRL) to enhance the accuracy and reliability of target state estimation from SONAR-derived bearing-only measurements in underwater environments. Traditional TMA methods-such as the manual 10-point divider and batch estimation-rely heavily on operator expertise and are susceptible to inaccuracies due to environmental noise and human error. To address these limitations, we employ a Proximal Policy Optimization (PPO)-based agent to automatically and robustly estimate the target speed. A customized TMA simulator was developed to generate diverse underwater scenarios, incorporating variations in target motion and noise levels to ensure the model’s generalization capability. The PPO agent learns to infer target speed directly from sequential bearing data, achieving a strong balance between exploration and exploitation. Experimental results demonstrate that the trained agent provides highly accurate and robust speed estimates, even under realistic noise conditions. This work contributes to the advancement of autonomous maritime surveillance and defense systems by significantly reducing human dependency and improving operational reliability.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"342 ","pages":"Article 122946"},"PeriodicalIF":5.5000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement learning-based automated target motion analysis in underwater environments\",\"authors\":\"Seunghwan Jang , Junha Shin , Dasol Kim , Juhyun Lee , Hyunsuk Ko\",\"doi\":\"10.1016/j.oceaneng.2025.122946\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study presents an automated target motion analysis (TMA) framework that leverages deep reinforcement learning (DRL) to enhance the accuracy and reliability of target state estimation from SONAR-derived bearing-only measurements in underwater environments. Traditional TMA methods-such as the manual 10-point divider and batch estimation-rely heavily on operator expertise and are susceptible to inaccuracies due to environmental noise and human error. To address these limitations, we employ a Proximal Policy Optimization (PPO)-based agent to automatically and robustly estimate the target speed. A customized TMA simulator was developed to generate diverse underwater scenarios, incorporating variations in target motion and noise levels to ensure the model’s generalization capability. The PPO agent learns to infer target speed directly from sequential bearing data, achieving a strong balance between exploration and exploitation. Experimental results demonstrate that the trained agent provides highly accurate and robust speed estimates, even under realistic noise conditions. This work contributes to the advancement of autonomous maritime surveillance and defense systems by significantly reducing human dependency and improving operational reliability.</div></div>\",\"PeriodicalId\":19403,\"journal\":{\"name\":\"Ocean Engineering\",\"volume\":\"342 \",\"pages\":\"Article 122946\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ocean Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0029801825026290\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825026290","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Reinforcement learning-based automated target motion analysis in underwater environments
This study presents an automated target motion analysis (TMA) framework that leverages deep reinforcement learning (DRL) to enhance the accuracy and reliability of target state estimation from SONAR-derived bearing-only measurements in underwater environments. Traditional TMA methods-such as the manual 10-point divider and batch estimation-rely heavily on operator expertise and are susceptible to inaccuracies due to environmental noise and human error. To address these limitations, we employ a Proximal Policy Optimization (PPO)-based agent to automatically and robustly estimate the target speed. A customized TMA simulator was developed to generate diverse underwater scenarios, incorporating variations in target motion and noise levels to ensure the model’s generalization capability. The PPO agent learns to infer target speed directly from sequential bearing data, achieving a strong balance between exploration and exploitation. Experimental results demonstrate that the trained agent provides highly accurate and robust speed estimates, even under realistic noise conditions. This work contributes to the advancement of autonomous maritime surveillance and defense systems by significantly reducing human dependency and improving operational reliability.
期刊介绍:
Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.