Training a Robot to Attend a Person at Specific Locations using Soft Actor-Critic under Simulated Environment

Chandra Kusuma Dewa, J. Miura
{"title":"Training a Robot to Attend a Person at Specific Locations using Soft Actor-Critic under Simulated Environment","authors":"Chandra Kusuma Dewa, J. Miura","doi":"10.1109/IEEECONF49454.2021.9382716","DOIUrl":null,"url":null,"abstract":"We present the application of soft actor-critic (SAC) learning algorithm to train a mobile robot to attend a target person at specific locations inside a Gazebo simulator. Since our previous study confirmed that the appropriate attending position while the target person is standing or walking is at his left or his right side, we design a novel U-shaped reward function behind the target person’s position with respect to the robot’s position. To make the robot can better portray the surroundings, we also propose a novel SAC architecture which employs 1D convolutional neural networks to extract features from laser scans automatically during the training process. Our preliminary experiment result shows that the robot is able to attend the target person at the designed location using our proposed reward function and SAC architecture.","PeriodicalId":395378,"journal":{"name":"2021 IEEE/SICE International Symposium on System Integration (SII)","volume":"39 10","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/SICE International Symposium on System Integration (SII)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEEECONF49454.2021.9382716","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

We present the application of soft actor-critic (SAC) learning algorithm to train a mobile robot to attend a target person at specific locations inside a Gazebo simulator. Since our previous study confirmed that the appropriate attending position while the target person is standing or walking is at his left or his right side, we design a novel U-shaped reward function behind the target person’s position with respect to the robot’s position. To make the robot can better portray the surroundings, we also propose a novel SAC architecture which employs 1D convolutional neural networks to extract features from laser scans automatically during the training process. Our preliminary experiment result shows that the robot is able to attend the target person at the designed location using our proposed reward function and SAC architecture.
在模拟环境下,使用软演员评论家训练机器人在特定地点照顾人
我们提出了应用软演员评论家(SAC)学习算法来训练移动机器人在Gazebo模拟器内的特定位置参加目标人。由于我们之前的研究证实,当目标人站立或行走时,适当的出席位置是在他的左侧或右侧,我们设计了一个新颖的u形奖励函数,在目标人的位置后面,相对于机器人的位置。为了使机器人能够更好地描绘周围环境,我们还提出了一种新的SAC架构,该架构采用1D卷积神经网络在训练过程中自动从激光扫描中提取特征。我们的初步实验结果表明,利用我们提出的奖励函数和SAC结构,机器人能够在设计的位置参加目标人。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信