Data Driven Non-Verbal Behavior Generation for Humanoid Robots

Proceedings of the 20th ACM International Conference on Multimodal Interaction Pub Date : 2018-10-02 DOI:10.1145/3242969.3264970

Taras Kucherenko

引用次数: 11

Abstract

Social robots need non-verbal behavior to make an interaction pleasant and efficient. Most of the models for generating non-verbal behavior are rule-based and hence can produce a limited set of motions and are tuned to a particular scenario. In contrast, data-driven systems are flexible and easily adjustable. Hence we aim to learn a data-driven model for generating non-verbal behavior (in a form of a 3D motion sequence) for humanoid robots. Our approach is based on a popular and powerful deep generative model: Variation Autoencoder (VAE). Input for our model will be multi-modal and we will iteratively increase its complexity: first, it will only use the speech signal, then also the text transcription and finally - the non-verbal behavior of the conversation partner. We will evaluate our system on the virtual avatars as well as on two humanoid robots with different embodiments: NAO and Furhat. Our model will be easily adapted to a novel domain: this can be done by providing application specific training data.

查看原文本刊更多论文

数据驱动的类人机器人非语言行为生成

社交机器人需要非语言行为来使互动愉快和有效。大多数产生非语言行为的模型都是基于规则的，因此可以产生一组有限的动作，并根据特定的场景进行调整。相比之下，数据驱动的系统灵活且易于调整。因此，我们的目标是学习一种数据驱动模型，用于为人形机器人生成非语言行为(以3D运动序列的形式)。我们的方法是基于一个流行的和强大的深度生成模型:变化自编码器(VAE)。我们的模型的输入将是多模态的，我们将迭代地增加它的复杂性:首先，它将只使用语音信号，然后是文本转录，最后是对话伙伴的非语言行为。我们将在虚拟化身以及两个具有不同实施例的人形机器人NAO和Furhat上评估我们的系统。我们的模型将很容易适应新的领域:这可以通过提供特定于应用程序的训练数据来实现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 20th ACM International Conference on Multimodal Interaction

自引率

0.00%

发文量