Model-based deep reinforcement learning for data-driven motion control of an under-actuated unmanned surface vehicle: Path following and trajectory tracking

IF 4.2 3区 计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS
Zhouhua Peng , Enrong Liu , Chao Pan , Haoliang Wang , Dan Wang , Lu Liu
{"title":"Model-based deep reinforcement learning for data-driven motion control of an under-actuated unmanned surface vehicle: Path following and trajectory tracking","authors":"Zhouhua Peng ,&nbsp;Enrong Liu ,&nbsp;Chao Pan ,&nbsp;Haoliang Wang ,&nbsp;Dan Wang ,&nbsp;Lu Liu","doi":"10.1016/j.jfranklin.2022.10.020","DOIUrl":null,"url":null,"abstract":"<div><p><span><span>Unmanned surface vehicles (USVs) are a promising marine robotic platform<span><span> for numerous potential applications in ocean space due to their small size, low cost, and high autonomy. Modelling and control of USVs is a challenging task due to their intrinsic nonlinearities, strong couplings, high uncertainty, under-actuation, and multiple constraints. Well designed motion controllers may not be effective when exposed in the complex and dynamic sea environment. The paper presents a fully data-driven learning-based motion </span>control method for an USV based on model-based </span></span>deep reinforcement learning<span><span>. Specifically, we first train a data-driven prediction model based on a deep network for the USV by using recorded input and output data. Based on the learned prediction model, model predictive motion controllers are presented for achieving trajectory tracking and path following tasks. It is shown that after learning with random data collected from the USV, the proposed data-driven motion controller is able to follow trajectories or parameterized paths accurately with excellent </span>sample efficiency. Simulation results are given to illustrate the proposed deep reinforcement learning scheme for fully data-driven motion control without any </span></span><em>a priori</em> model information of the USV.</p></div>","PeriodicalId":17283,"journal":{"name":"Journal of The Franklin Institute-engineering and Applied Mathematics","volume":"360 6","pages":"Pages 4399-4426"},"PeriodicalIF":4.2000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Franklin Institute-engineering and Applied Mathematics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016003222007463","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 2

Abstract

Unmanned surface vehicles (USVs) are a promising marine robotic platform for numerous potential applications in ocean space due to their small size, low cost, and high autonomy. Modelling and control of USVs is a challenging task due to their intrinsic nonlinearities, strong couplings, high uncertainty, under-actuation, and multiple constraints. Well designed motion controllers may not be effective when exposed in the complex and dynamic sea environment. The paper presents a fully data-driven learning-based motion control method for an USV based on model-based deep reinforcement learning. Specifically, we first train a data-driven prediction model based on a deep network for the USV by using recorded input and output data. Based on the learned prediction model, model predictive motion controllers are presented for achieving trajectory tracking and path following tasks. It is shown that after learning with random data collected from the USV, the proposed data-driven motion controller is able to follow trajectories or parameterized paths accurately with excellent sample efficiency. Simulation results are given to illustrate the proposed deep reinforcement learning scheme for fully data-driven motion control without any a priori model information of the USV.

欠驱动无人水面车辆数据驱动运动控制的基于模型的深度强化学习:路径跟踪和轨迹跟踪
无人水面飞行器(USV)由于其体积小、成本低、自主性强,是一种很有前途的海洋机器人平台,在海洋空间中有许多潜在的应用。USV的建模和控制是一项具有挑战性的任务,因为其固有的非线性、强耦合、高不确定性、欠驱动和多重约束。当暴露在复杂和动态的海洋环境中时,精心设计的运动控制器可能是无效的。本文提出了一种基于基于模型的深度强化学习的USV运动控制方法。具体而言,我们首先通过使用记录的输入和输出数据,为USV训练基于深度网络的数据驱动预测模型。在学习预测模型的基础上,提出了用于实现轨迹跟踪和路径跟踪任务的模型预测运动控制器。结果表明,在利用从USV收集的随机数据进行学习后,所提出的数据驱动运动控制器能够以优异的采样效率准确地跟踪轨迹或参数化路径。仿真结果表明,所提出的深度强化学习方案用于完全数据驱动的运动控制,而无需USV的任何先验模型信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.30
自引率
14.60%
发文量
586
审稿时长
6.9 months
期刊介绍: The Journal of The Franklin Institute has an established reputation for publishing high-quality papers in the field of engineering and applied mathematics. Its current focus is on control systems, complex networks and dynamic systems, signal processing and communications and their applications. All submitted papers are peer-reviewed. The Journal will publish original research papers and research review papers of substance. Papers and special focus issues are judged upon possible lasting value, which has been and continues to be the strength of the Journal of The Franklin Institute.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信