一种多模式人机交互中多感官数据融合的新方法

Proceedings of the 18th Australia conference on Computer-Human Interaction: Design: Activities, Artefacts and Environments Pub Date : 2006-11-20 DOI:10.1145/1228175.1228257

Yong Sun, Fang Chen, Yu Shi, Yuk Ying Chung

{"title":"一种多模式人机交互中多感官数据融合的新方法","authors":"Yong Sun, Fang Chen, Yu Shi, Yuk Ying Chung","doi":"10.1145/1228175.1228257","DOIUrl":null,"url":null,"abstract":"Multimodal User Interaction (MMUI) technology aims at building natural and intuitive interfaces allowing a user to interact with computer in a way similar to human-to-human communication, for example, through speech and gestures. As a critical component in MMUI, Multimodal Input Fusion explores ways to effectively interpret the combined semantic interpretation of user inputs through multiple modalities. This paper presents a novel approach to multi-sensory data fusion based on speech and manual deictic gesture inputs. The effectiveness of the technique has been validated through experiments, using a traffic incident management scenario where an operator interacts with a map on a large display at a distance and issues multimodal commands through speech and manual gestures. The description of the proposed approach and preliminary experiment results are presented.","PeriodicalId":164924,"journal":{"name":"Proceedings of the 18th Australia conference on Computer-Human Interaction: Design: Activities, Artefacts and Environments","volume":"105 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":"{\"title\":\"A novel method for multi-sensory data fusion in multimodal human computer interaction\",\"authors\":\"Yong Sun, Fang Chen, Yu Shi, Yuk Ying Chung\",\"doi\":\"10.1145/1228175.1228257\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multimodal User Interaction (MMUI) technology aims at building natural and intuitive interfaces allowing a user to interact with computer in a way similar to human-to-human communication, for example, through speech and gestures. As a critical component in MMUI, Multimodal Input Fusion explores ways to effectively interpret the combined semantic interpretation of user inputs through multiple modalities. This paper presents a novel approach to multi-sensory data fusion based on speech and manual deictic gesture inputs. The effectiveness of the technique has been validated through experiments, using a traffic incident management scenario where an operator interacts with a map on a large display at a distance and issues multimodal commands through speech and manual gestures. The description of the proposed approach and preliminary experiment results are presented.\",\"PeriodicalId\":164924,\"journal\":{\"name\":\"Proceedings of the 18th Australia conference on Computer-Human Interaction: Design: Activities, Artefacts and Environments\",\"volume\":\"105 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 18th Australia conference on Computer-Human Interaction: Design: Activities, Artefacts and Environments\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1228175.1228257\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 18th Australia conference on Computer-Human Interaction: Design: Activities, Artefacts and Environments","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1228175.1228257","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

摘要

多模式用户交互(MMUI)技术旨在建立自然和直观的界面，允许用户以类似于人与人之间交流的方式与计算机交互，例如，通过语音和手势。作为MMUI的关键组成部分，多模态输入融合探索了通过多种模态有效解释用户输入的组合语义解释的方法。提出了一种基于语音和手动指示手势输入的多感官数据融合方法。该技术的有效性已通过实验得到验证，该实验使用了一个交通事故管理场景，在该场景中，操作员与远处的大型显示器上的地图交互，并通过语音和手动手势发出多模式命令。给出了该方法的描述和初步实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A novel method for multi-sensory data fusion in multimodal human computer interaction

Multimodal User Interaction (MMUI) technology aims at building natural and intuitive interfaces allowing a user to interact with computer in a way similar to human-to-human communication, for example, through speech and gestures. As a critical component in MMUI, Multimodal Input Fusion explores ways to effectively interpret the combined semantic interpretation of user inputs through multiple modalities. This paper presents a novel approach to multi-sensory data fusion based on speech and manual deictic gesture inputs. The effectiveness of the technique has been validated through experiments, using a traffic incident management scenario where an operator interacts with a map on a large display at a distance and issues multimodal commands through speech and manual gestures. The description of the proposed approach and preliminary experiment results are presented.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 18th Australia conference on Computer-Human Interaction: Design: Activities, Artefacts and Environments

自引率

0.00%

发文量