Training a Robot to Attend a Person at Specific Locations using Soft Actor-Critic under Simulated Environment

2021 IEEE/SICE International Symposium on System Integration (SII) Pub Date : 2021-01-11 DOI:10.1109/IEEECONF49454.2021.9382716

Chandra Kusuma Dewa, J. Miura

引用次数: 1

Abstract

We present the application of soft actor-critic (SAC) learning algorithm to train a mobile robot to attend a target person at specific locations inside a Gazebo simulator. Since our previous study confirmed that the appropriate attending position while the target person is standing or walking is at his left or his right side, we design a novel U-shaped reward function behind the target person’s position with respect to the robot’s position. To make the robot can better portray the surroundings, we also propose a novel SAC architecture which employs 1D convolutional neural networks to extract features from laser scans automatically during the training process. Our preliminary experiment result shows that the robot is able to attend the target person at the designed location using our proposed reward function and SAC architecture.

查看原文本刊更多论文

在模拟环境下，使用软演员评论家训练机器人在特定地点照顾人

我们提出了应用软演员评论家(SAC)学习算法来训练移动机器人在Gazebo模拟器内的特定位置参加目标人。由于我们之前的研究证实，当目标人站立或行走时，适当的出席位置是在他的左侧或右侧，我们设计了一个新颖的u形奖励函数，在目标人的位置后面，相对于机器人的位置。为了使机器人能够更好地描绘周围环境，我们还提出了一种新的SAC架构，该架构采用1D卷积神经网络在训练过程中自动从激光扫描中提取特征。我们的初步实验结果表明，利用我们提出的奖励函数和SAC结构，机器人能够在设计的位置参加目标人。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/SICE International Symposium on System Integration (SII)

自引率

0.00%

发文量