{"title":"用于 UMV 抗噪运动预测的近端策略优化正则化最小二乘法算法","authors":"Yiming Zhong;Caoyang Yu;Xianbo Xiang;Lian Lian","doi":"10.1109/JOE.2024.3436770","DOIUrl":null,"url":null,"abstract":"To enhance the accuracy of motion prediction in unmanned marine vehicles (UMVs), an innovative proximal policy-optimized regularized least squares (PPO-RLS) algorithm is proposed in this article. This article begins by developing a dynamics model for UMVs that incorporates viscous damping and external forces to minimize modeling errors. However, this model does not account for data noise, making accurate parameter identification difficult when using traditional least squares (LS) algorithms. To overcome this limitation, the PPO-RLS algorithm is proposed, incorporating a regularization term within the LS framework and utilizing proximal policy optimization for adaptive regularization term tuning. The performance of the PPO-RLS algorithm is thoroughly evaluated using both simulation data and lake trial data, demonstrating significant improvements over both the traditional LS algorithm and a state-of-the-art algorithm. Specifically, in simulation tests, the PPO-RLS algorithm achieves a notable reduction in root mean square error for surge velocity (8.49E-03 m/s) and heading angle (2.32\n<inline-formula><tex-math>$^\\circ$</tex-math></inline-formula>\n), markedly outperforming the LS algorithm (2.36E-02 m/s for surge velocity and 4.14\n<inline-formula><tex-math>$^\\circ$</tex-math></inline-formula>\n for heading angle). In addition, the PPO-RLS algorithm displays enhanced stability, as indicated by a more than 50% reduction in condition number (1.46E+04 for PPO-RLS versus 2.89E+06 for LS). These improvements are further validated by lake trial data, confirming the algorithm's advanced motion prediction capabilities with quantitatively lower errors and greater robustness.","PeriodicalId":13191,"journal":{"name":"IEEE Journal of Oceanic Engineering","volume":"49 4","pages":"1397-1410"},"PeriodicalIF":3.8000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Proximal Policy-Optimized Regularized Least Squares Algorithm for Noise-Resilient Motion Prediction of UMVs\",\"authors\":\"Yiming Zhong;Caoyang Yu;Xianbo Xiang;Lian Lian\",\"doi\":\"10.1109/JOE.2024.3436770\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To enhance the accuracy of motion prediction in unmanned marine vehicles (UMVs), an innovative proximal policy-optimized regularized least squares (PPO-RLS) algorithm is proposed in this article. This article begins by developing a dynamics model for UMVs that incorporates viscous damping and external forces to minimize modeling errors. However, this model does not account for data noise, making accurate parameter identification difficult when using traditional least squares (LS) algorithms. To overcome this limitation, the PPO-RLS algorithm is proposed, incorporating a regularization term within the LS framework and utilizing proximal policy optimization for adaptive regularization term tuning. The performance of the PPO-RLS algorithm is thoroughly evaluated using both simulation data and lake trial data, demonstrating significant improvements over both the traditional LS algorithm and a state-of-the-art algorithm. Specifically, in simulation tests, the PPO-RLS algorithm achieves a notable reduction in root mean square error for surge velocity (8.49E-03 m/s) and heading angle (2.32\\n<inline-formula><tex-math>$^\\\\circ$</tex-math></inline-formula>\\n), markedly outperforming the LS algorithm (2.36E-02 m/s for surge velocity and 4.14\\n<inline-formula><tex-math>$^\\\\circ$</tex-math></inline-formula>\\n for heading angle). In addition, the PPO-RLS algorithm displays enhanced stability, as indicated by a more than 50% reduction in condition number (1.46E+04 for PPO-RLS versus 2.89E+06 for LS). These improvements are further validated by lake trial data, confirming the algorithm's advanced motion prediction capabilities with quantitatively lower errors and greater robustness.\",\"PeriodicalId\":13191,\"journal\":{\"name\":\"IEEE Journal of Oceanic Engineering\",\"volume\":\"49 4\",\"pages\":\"1397-1410\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2024-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Oceanic Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10680456/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Oceanic Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10680456/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
摘要
为了提高无人海洋航行器(UMV)运动预测的准确性,本文提出了一种创新的近端策略优化正则化最小二乘法(PPO-RLS)算法。本文首先为 UMV 建立了一个动力学模型,该模型结合了粘性阻尼和外力,以尽量减少建模误差。然而,该模型没有考虑数据噪声,因此在使用传统的最小二乘(LS)算法时很难准确识别参数。为了克服这一局限性,我们提出了 PPO-RLS 算法,该算法在 LS 框架中加入了正则项,并利用近似策略优化对正则项进行自适应调整。利用模拟数据和湖泊试验数据对 PPO-RLS 算法的性能进行了全面评估,结果表明该算法比传统的 LS 算法和最先进的算法都有显著改进。具体来说,在模拟测试中,PPO-RLS 算法显著降低了浪涌速度(8.49E-03 m/s)和航向角(2.32$^\circ$)的均方根误差,明显优于 LS 算法(浪涌速度为 2.36E-02 m/s,航向角为 4.14$^\circ$)。此外,PPO-RLS 算法显示出更强的稳定性,条件数减少了 50%以上(PPO-RLS 为 1.46E+04 而 LS 为 2.89E+06)。这些改进得到了湖泊试验数据的进一步验证,证实了该算法具有先进的运动预测能力,定量误差更低,鲁棒性更强。
Proximal Policy-Optimized Regularized Least Squares Algorithm for Noise-Resilient Motion Prediction of UMVs
To enhance the accuracy of motion prediction in unmanned marine vehicles (UMVs), an innovative proximal policy-optimized regularized least squares (PPO-RLS) algorithm is proposed in this article. This article begins by developing a dynamics model for UMVs that incorporates viscous damping and external forces to minimize modeling errors. However, this model does not account for data noise, making accurate parameter identification difficult when using traditional least squares (LS) algorithms. To overcome this limitation, the PPO-RLS algorithm is proposed, incorporating a regularization term within the LS framework and utilizing proximal policy optimization for adaptive regularization term tuning. The performance of the PPO-RLS algorithm is thoroughly evaluated using both simulation data and lake trial data, demonstrating significant improvements over both the traditional LS algorithm and a state-of-the-art algorithm. Specifically, in simulation tests, the PPO-RLS algorithm achieves a notable reduction in root mean square error for surge velocity (8.49E-03 m/s) and heading angle (2.32
$^\circ$
), markedly outperforming the LS algorithm (2.36E-02 m/s for surge velocity and 4.14
$^\circ$
for heading angle). In addition, the PPO-RLS algorithm displays enhanced stability, as indicated by a more than 50% reduction in condition number (1.46E+04 for PPO-RLS versus 2.89E+06 for LS). These improvements are further validated by lake trial data, confirming the algorithm's advanced motion prediction capabilities with quantitatively lower errors and greater robustness.
期刊介绍:
The IEEE Journal of Oceanic Engineering (ISSN 0364-9059) is the online-only quarterly publication of the IEEE Oceanic Engineering Society (IEEE OES). The scope of the Journal is the field of interest of the IEEE OES, which encompasses all aspects of science, engineering, and technology that address research, development, and operations pertaining to all bodies of water. This includes the creation of new capabilities and technologies from concept design through prototypes, testing, and operational systems to sense, explore, understand, develop, use, and responsibly manage natural resources.