触摸界面的手势-语音多模态模式的探索

Proceedings of the 3rd Indian Conference on Human-Computer Interaction Pub Date : 2011-04-07 DOI:10.1145/2407796.2407808

Prasenjit Dey, S. Madhvanath, Amit Ranjan, Suvodeep Das

{"title":"触摸界面的手势-语音多模态模式的探索","authors":"Prasenjit Dey, S. Madhvanath, Amit Ranjan, Suvodeep Das","doi":"10.1145/2407796.2407808","DOIUrl":null,"url":null,"abstract":"Multimodal interfaces that integrate multiple input modalities such as speech, gestures, gaze, and so on have shown considerable promise in terms of higher task efficiency, lower error rates and higher user satisfaction. However, the adoption of such interfaces for real-world systems has proved to be slow, and the reasons may be both technological (e.g. accuracy of recognition engines, fusion engines, authoring) as well as usability-related. In this paper, we explore a few patterns of \"command and control\" style multimodal interaction (MMI) using touch gestures and short speech utterances. We then describe a multimodal interface for a photo browsing application and a user study to understand some of the usability issues with such interfaces. Specifically, we study walk-up use of multimodal commands for photo manipulations, and compare this with unimodal multi-touch interactions. We observe that there is a learning period after which the user gets more comfortable with the multimodal commands, and the average task completions times reduce significantly. We also analyze temporal integration patterns of speech and touch gestures. We see this as the first of many studies leading to more detailed understanding of user preferences and performance for using MMI, which can help inform the judicious use of MMI in designing interactions for future interfaces.","PeriodicalId":179432,"journal":{"name":"Proceedings of the 3rd Indian Conference on Human-Computer Interaction","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"An exploration of gesture-speech multimodal patterns for touch interfaces\",\"authors\":\"Prasenjit Dey, S. Madhvanath, Amit Ranjan, Suvodeep Das\",\"doi\":\"10.1145/2407796.2407808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multimodal interfaces that integrate multiple input modalities such as speech, gestures, gaze, and so on have shown considerable promise in terms of higher task efficiency, lower error rates and higher user satisfaction. However, the adoption of such interfaces for real-world systems has proved to be slow, and the reasons may be both technological (e.g. accuracy of recognition engines, fusion engines, authoring) as well as usability-related. In this paper, we explore a few patterns of \\\"command and control\\\" style multimodal interaction (MMI) using touch gestures and short speech utterances. We then describe a multimodal interface for a photo browsing application and a user study to understand some of the usability issues with such interfaces. Specifically, we study walk-up use of multimodal commands for photo manipulations, and compare this with unimodal multi-touch interactions. We observe that there is a learning period after which the user gets more comfortable with the multimodal commands, and the average task completions times reduce significantly. We also analyze temporal integration patterns of speech and touch gestures. We see this as the first of many studies leading to more detailed understanding of user preferences and performance for using MMI, which can help inform the judicious use of MMI in designing interactions for future interfaces.\",\"PeriodicalId\":179432,\"journal\":{\"name\":\"Proceedings of the 3rd Indian Conference on Human-Computer Interaction\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd Indian Conference on Human-Computer Interaction\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2407796.2407808\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd Indian Conference on Human-Computer Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2407796.2407808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

集成多种输入方式(如语音、手势、凝视等)的多模式界面在提高任务效率、降低错误率和提高用户满意度方面显示出相当大的前景。然而，在现实系统中采用这样的接口已经被证明是缓慢的，原因可能是技术上的(例如识别引擎的准确性，融合引擎，创作)以及与可用性有关。在本文中，我们探索了一些使用触摸手势和短语音的“命令和控制”风格的多模态交互(MMI)模式。然后，我们描述了一个照片浏览应用程序的多模式界面，并通过用户研究来了解这些界面的一些可用性问题。具体来说，我们研究了照片操作的多模态命令的行走使用，并将其与单模态多点触摸交互进行比较。我们观察到有一个学习期，在此之后用户对多模式命令变得更加舒适，并且平均任务完成时间显着减少。我们还分析了语音和触摸手势的时间整合模式。我们认为这是许多研究的第一个，这些研究将更详细地了解使用MMI的用户偏好和性能，这有助于在设计未来界面的交互时明智地使用MMI。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An exploration of gesture-speech multimodal patterns for touch interfaces

Multimodal interfaces that integrate multiple input modalities such as speech, gestures, gaze, and so on have shown considerable promise in terms of higher task efficiency, lower error rates and higher user satisfaction. However, the adoption of such interfaces for real-world systems has proved to be slow, and the reasons may be both technological (e.g. accuracy of recognition engines, fusion engines, authoring) as well as usability-related. In this paper, we explore a few patterns of "command and control" style multimodal interaction (MMI) using touch gestures and short speech utterances. We then describe a multimodal interface for a photo browsing application and a user study to understand some of the usability issues with such interfaces. Specifically, we study walk-up use of multimodal commands for photo manipulations, and compare this with unimodal multi-touch interactions. We observe that there is a learning period after which the user gets more comfortable with the multimodal commands, and the average task completions times reduce significantly. We also analyze temporal integration patterns of speech and touch gestures. We see this as the first of many studies leading to more detailed understanding of user preferences and performance for using MMI, which can help inform the judicious use of MMI in designing interactions for future interfaces.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 3rd Indian Conference on Human-Computer Interaction

自引率

0.00%

发文量