Identification of the Driver's Interest Point using a Head Pose Trajectory for Situated Dialog Systems

Proceedings of the 16th International Conference on Multimodal Interaction Pub Date : 2014-11-12 DOI:10.1145/2663204.2663230

Young-Ho Kim, Teruhisa Misu

引用次数: 8

Abstract

This paper addresses issues existing in situated language understanding in a moving car. Particularly, we propose a method for understanding user queries regarding specific target buildings in their surroundings based on the driver's head pose and speech information. To identify a meaningful head pose motion related to the user query that is among spontaneous motions while driving, we construct a model describing the relationship between sequences of a driver's head pose and the relative direction to an interest point using the Gaussian process regression. We also consider time-varying interest point using kernel density estimation. We collected situated queries from subject drivers by using our research system embedded in a real car. The proposed method achieves an improvement in the target identification rate by 14% in the user-independent training condition and 27% in the user-dependent training condition over the method that uses the head motion at the start-of-speech timing.

查看原文本刊更多论文

使用头部姿态轨迹识别驾驶员的兴趣点

本文研究了在汽车行驶过程中存在的情景语言理解问题。特别是，我们提出了一种基于驾驶员头部姿势和语音信息来理解用户对周围特定目标建筑的查询的方法。为了在驾驶时的自发运动中识别与用户查询相关的有意义的头部姿势运动，我们使用高斯过程回归构建了一个模型，该模型描述了驾驶员头部姿势序列与兴趣点的相对方向之间的关系。我们还使用核密度估计来考虑时变兴趣点。我们通过使用嵌入在真实汽车中的研究系统，从主题驾驶员那里收集定位查询。该方法在用户独立训练条件下的目标识别率比在语音开始时使用头部运动的方法提高了14%，在用户依赖训练条件下的目标识别率提高了27%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 16th International Conference on Multimodal Interaction

自引率

0.00%

发文量