Model-based deep reinforcement learning for data-driven motion control of an under-actuated unmanned surface vehicle: Path following and trajectory tracking

IF 4.2 3区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

Journal of The Franklin Institute-engineering and Applied Mathematics Pub Date : 2023-04-01 DOI:10.1016/j.jfranklin.2022.10.020

Zhouhua Peng , Enrong Liu , Chao Pan , Haoliang Wang , Dan Wang , Lu Liu

{"title":"Model-based deep reinforcement learning for data-driven motion control of an under-actuated unmanned surface vehicle: Path following and trajectory tracking","authors":"Zhouhua Peng , Enrong Liu , Chao Pan , Haoliang Wang , Dan Wang , Lu Liu","doi":"10.1016/j.jfranklin.2022.10.020","DOIUrl":null,"url":null,"abstract":"<div>Unmanned surface vehicles (USVs) are a promising marine robotic platform for numerous potential applications in ocean space due to their small size, low cost, and high autonomy. Modelling and control of USVs is a challenging task due to their intrinsic nonlinearities, strong couplings, high uncertainty, under-actuation, and multiple constraints. Well designed motion controllers may not be effective when exposed in the complex and dynamic sea environment. The paper presents a fully data-driven learning-based motion control method for an USV based on model-based deep reinforcement learning. Specifically, we first train a data-driven prediction model based on a deep network for the USV by using recorded input and output data. Based on the learned prediction model, model predictive motion controllers are presented for achieving trajectory tracking and path following tasks. It is shown that after learning with random data collected from the USV, the proposed data-driven motion controller is able to follow trajectories or parameterized paths accurately with excellent sample efficiency. Simulation results are given to illustrate the proposed deep reinforcement learning scheme for fully data-driven motion control without any a priori model information of the USV.</div>","PeriodicalId":17283,"journal":{"name":"Journal of The Franklin Institute-engineering and Applied Mathematics","volume":"360 6","pages":"Pages 4399-4426"},"PeriodicalIF":4.2000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Franklin Institute-engineering and Applied Mathematics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016003222007463","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 2

Abstract

Unmanned surface vehicles (USVs) are a promising marine robotic platform for numerous potential applications in ocean space due to their small size, low cost, and high autonomy. Modelling and control of USVs is a challenging task due to their intrinsic nonlinearities, strong couplings, high uncertainty, under-actuation, and multiple constraints. Well designed motion controllers may not be effective when exposed in the complex and dynamic sea environment. The paper presents a fully data-driven learning-based motion control method for an USV based on model-based deep reinforcement learning. Specifically, we first train a data-driven prediction model based on a deep network for the USV by using recorded input and output data. Based on the learned prediction model, model predictive motion controllers are presented for achieving trajectory tracking and path following tasks. It is shown that after learning with random data collected from the USV, the proposed data-driven motion controller is able to follow trajectories or parameterized paths accurately with excellent sample efficiency. Simulation results are given to illustrate the proposed deep reinforcement learning scheme for fully data-driven motion control without any a priori model information of the USV.

查看原文本刊更多论文

欠驱动无人水面车辆数据驱动运动控制的基于模型的深度强化学习:路径跟踪和轨迹跟踪

无人水面飞行器（USV）由于其体积小、成本低、自主性强，是一种很有前途的海洋机器人平台，在海洋空间中有许多潜在的应用。USV的建模和控制是一项具有挑战性的任务，因为其固有的非线性、强耦合、高不确定性、欠驱动和多重约束。当暴露在复杂和动态的海洋环境中时，精心设计的运动控制器可能是无效的。本文提出了一种基于基于模型的深度强化学习的USV运动控制方法。具体而言，我们首先通过使用记录的输入和输出数据，为USV训练基于深度网络的数据驱动预测模型。在学习预测模型的基础上，提出了用于实现轨迹跟踪和路径跟踪任务的模型预测运动控制器。结果表明，在利用从USV收集的随机数据进行学习后，所提出的数据驱动运动控制器能够以优异的采样效率准确地跟踪轨迹或参数化路径。仿真结果表明，所提出的深度强化学习方案用于完全数据驱动的运动控制，而无需USV的任何先验模型信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of The Franklin Institute-engineering and Applied Mathematics 工程技术-工程：电子与电气

CiteScore

7.30

自引率

14.60%

发文量

586

审稿时长

6.9 months

期刊介绍： The Journal of The Franklin Institute has an established reputation for publishing high-quality papers in the field of engineering and applied mathematics. Its current focus is on control systems, complex networks and dynamic systems, signal processing and communications and their applications. All submitted papers are peer-reviewed. The Journal will publish original research papers and research review papers of substance. Papers and special focus issues are judged upon possible lasting value, which has been and continues to be the strength of the Journal of The Franklin Institute.