Work Process Transfer Reinforcement Learning: Feature Extraction and Finetuning in Ship Collision Avoidance

Volume 2: 42nd Computers and Information in Engineering Conference (CIE) Pub Date : 2022-08-14 DOI:10.1115/detc2022-91145

Xinrui Wang, Yan Jin

{"title":"Work Process Transfer Reinforcement Learning: Feature Extraction and Finetuning in Ship Collision Avoidance","authors":"Xinrui Wang, Yan Jin","doi":"10.1115/detc2022-91145","DOIUrl":null,"url":null,"abstract":"\n The advancement of artificial intelligence and machine learning technologies has led to significant changes in work processes. The computer agents are applied to perform not only routine and repetitive jobs but also highly complex tasks such as driving a car and steering a ship. Given the sensory information of the environment, a reinforcement learning method has been applied for agents to learn how to perform complex tasks by trial and error through interactions with the environment. To overcome the issues such as limited and sparse training data, researchers are attempting to reuse the previously learned knowledge in new task situations. In this paper, we investigate how feature extraction and finetuning methods can be combined to allow computer agents to perform transfer reinforcement learning more effectively and efficiently in the context of ship collision avoidance. Taking a computer simulation-based empirical approach, we first develop a ship collision avoidance gameplay environment by introducing the own ship, target ships, and the base case and target cases. A deep neural network including four convolutional layers and three fully connected layers is devised for work process feature capturing through deep reinforcement learning. The case study results have shown that features do exist in work processes, and they can be captured and reused. The similarity between the source case and the target case is a key factor that determines how the feature extract and finetuning methods should be combined for effective task results and efficient learning processes.","PeriodicalId":382970,"journal":{"name":"Volume 2: 42nd Computers and Information in Engineering Conference (CIE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 2: 42nd Computers and Information in Engineering Conference (CIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2022-91145","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The advancement of artificial intelligence and machine learning technologies has led to significant changes in work processes. The computer agents are applied to perform not only routine and repetitive jobs but also highly complex tasks such as driving a car and steering a ship. Given the sensory information of the environment, a reinforcement learning method has been applied for agents to learn how to perform complex tasks by trial and error through interactions with the environment. To overcome the issues such as limited and sparse training data, researchers are attempting to reuse the previously learned knowledge in new task situations. In this paper, we investigate how feature extraction and finetuning methods can be combined to allow computer agents to perform transfer reinforcement learning more effectively and efficiently in the context of ship collision avoidance. Taking a computer simulation-based empirical approach, we first develop a ship collision avoidance gameplay environment by introducing the own ship, target ships, and the base case and target cases. A deep neural network including four convolutional layers and three fully connected layers is devised for work process feature capturing through deep reinforcement learning. The case study results have shown that features do exist in work processes, and they can be captured and reused. The similarity between the source case and the target case is a key factor that determines how the feature extract and finetuning methods should be combined for effective task results and efficient learning processes.

查看原文本刊更多论文

工作过程迁移强化学习:船舶避碰的特征提取与微调

人工智能和机器学习技术的进步导致了工作流程的重大变化。计算机代理不仅用于执行常规和重复性的工作，而且还用于执行高度复杂的任务，如驾驶汽车和掌舵。给定环境的感官信息，一种强化学习方法被应用于智能体，通过与环境的相互作用，通过试错来学习如何执行复杂的任务。为了克服训练数据有限和稀疏等问题，研究人员试图在新的任务情境中重用以前学习过的知识。在本文中，我们研究了如何将特征提取和微调方法相结合，以允许计算机代理在船舶避撞的背景下更有效地执行转移强化学习。采用基于计算机模拟的经验方法，我们首先通过引入自己的船、目标船、基本情况和目标情况来开发船舶避碰游戏环境。设计了一个包含4个卷积层和3个全连接层的深度神经网络，通过深度强化学习捕获工作过程特征。案例研究结果表明，功能确实存在于工作流程中，并且它们可以被捕获和重用。源案例和目标案例之间的相似性是决定如何将特征提取和微调方法结合起来以获得有效的任务结果和高效的学习过程的关键因素。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Volume 2: 42nd Computers and Information in Engineering Conference (CIE)

自引率

0.00%

发文量