Human-robot interface based on the mutual assistance between speech and vision

Workshop on Perceptive User Interfaces Pub Date : 2001-11-15 DOI:10.1145/971478.971483

Mitsutoshi Yoshizaki, Y. Kuno, A. Nakamura

引用次数: 9

Abstract

This paper presents a user interface for a service robot that can bring the objects asked by the user. Speech-based interface is appropriate for this application. However, it alone is not sufficient. The system needs a vision-based interface to recognize gestures as well. Moreover, it needs vision capabilities to obtain the real world information about the objects mentioned in the user's speech. For example, the robot needs to find the target object ordered by speech to carry out the task. This can be considered that vision assists speech. However, vision sometimes fails to detect the objects. Moreover, there are objects for which vision cannot be expected to work well. In these cases, the robot tells the current status to the user so that he/she can give advice by speech to the robot. This can be considered that speech assists vision through the user. This paper presents how the mutual assistance between speech and vision works and demonstrates promising results through experiments.

查看原文本刊更多论文

基于语音和视觉相互辅助的人机界面

本文提出了一种服务机器人的用户界面，该界面可以将用户要求的物品带到服务机器人中。基于语音的界面适合于此应用程序。然而，仅靠它是不够的。该系统还需要一个基于视觉的界面来识别手势。此外，它还需要视觉能力来获取用户语音中提到的对象的真实世界信息。例如，机器人需要找到语音命令的目标物体来执行任务。这可以被认为是视觉辅助语言。然而，视觉有时无法检测到物体。此外，有些物体的视觉不能指望很好地工作。在这些情况下，机器人告诉用户当前的状态，用户可以通过语音给机器人建议。这可以被认为是语言通过使用者帮助视觉。本文介绍了语言和视觉之间的相互帮助是如何工作的，并通过实验证明了有希望的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop on Perceptive User Interfaces

自引率

0.00%

发文量