开发虚拟应用的语音输入:人的因素的观点

Int. J. People Oriented Program. Pub Date : 2011-07-01 DOI:10.4018/ijpop.2011070103

A. Stedmon, D. Howard, Christin Kirchhübel

{"title":"开发虚拟应用的语音输入:人的因素的观点","authors":"A. Stedmon, D. Howard, Christin Kirchhübel","doi":"10.4018/ijpop.2011070103","DOIUrl":null,"url":null,"abstract":"This paper contextualises the position of speech input from a user-centred human factors perspective. It is presented as a position paper so that researchers and designers can consider the underlying and future factors of a people-orientated approach to speech input for virtual applications. A number of key areas are explored including: human factors for speech input; speech input for virtual applications; speech as a spare mode of interaction; user acceptance and uptake; incorporating speech in the development of virtual applications; and speech input as an interaction tool. Given the user-centred perspective of this paper, this paper does not set out to address issues associated with spoken dialogue technologies, dialogue, and dialogue management; recent work on conversational agents in virtual environments; or multimodal interaction. This paper places the focus more fundamentally within human factors by looking at the user first as a basis for developing usable virtual applications incorporating speech input rather than to review the current state of the art in interaction design. A particular point this paper makes, however, is that speech input should be designed and used as another interaction tool that users need to learn to use, rather than assuming it will offer a natural or intuitive interface. DOI: 10.4018/ijpop.2011070103 International Journal of People-Oriented Programming, 1(2), 50-65, July-December 2011 51 Copyright © 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. there is a danger of designing systems that do not support user needs or expectations, and if systems evolve along purely interaction based principles then there is a danger of not embracing or exploiting new technological capabilities to their full. To date speech recognition has yet to be implemented in an original application (that has not used another form of input device before) and, as a consequence, it always faces the immediate challenge from more conventional input devices (e.g., keyboard and mouse) that users are familiar with using. Furthermore, if user expectations of speech input are too high, there is a danger that its potential will not be realised due to user frustration leading to poor uptake. The increased availability of speech input “brings with it the need for a full understanding of the ergonomics aspects of these systems with the aim of developing general guidelines to ensure their application should be as effective as possible” (Hapeshi & Jones, 1988, p. 252). Whilst the potential to interact with machines using speech input has been possible for many decades (Ullman, 1987) and technical advances, especially with the development of computing and distributed interaction technologies, allow for new approaches to people orientated interaction, speech input still remains an elusive concept without widespread use or end user acceptance (Stedmon et al., 2011). Alongside the point that user expectations might be higher than the technology can deliver, system interaction is often not as intuitive as it could be. With a lack of consideration for user needs, user satisfaction and the adoption of new technology might be hampered by fundamentally unusable technologies. It is only when system developers and users alike, have a greater awareness of the underlying issues of development and use, that such technologies can be designed that will embrace user needs and technological ability in a balanced manner. With a focus on design needs for usercentred and usable speech interfaces, along with a framework for virtual environment (VE) development, this paper argues that there has been little recent development in understanding generic human factors issues of speech input and even less in the specific area of human factors for speech input in virtual applications. Given the user-centred perspective of this paper and the aim of presenting arguments that transcend specific technologies or trends in solutions, this paper does not set out to address issues associated with natural and spoken dialogue technologies (Cohen, Giangola, & Bagola, 2004; Leuski & Traum, 2010), dialogue and dialogue management (Lemon, 2011) or recent work on embodied conversational agents in virtual applications (Traum, 2008). Furthermore, the technologies and methods underpinning multimodal interaction (Lee & Billinghurst, 2008), tangible or mobile interfaces (Billinghurst, Kato, & Myojin, 2009) is not a primary focus of this paper either. And whilst some researchers may consider it difficult to address the nature of speech-based interaction without addressing principles of dialogue and dialogue management, this paper seeks to place the focus more fundamentally within human factors by looking at the user and then seeking to develop usable virtual applications incorporating speech to support user interaction rather than to review the current state of the art in interaction design. This paper discusses the importance of speech input across a number of key areas: 1. Human factors issues in speech input, 2. The interaction potential of speech input, 3. Acceptance issues of speech input in virtual applications, 4. Designing virtual applications with speech input in mind, 5. Developing speech input for virtual applications, 6. Speech input as an interaction tool. 2. HUMAN FACTORS ISSUES IN SPEECH INPUT A number of reviews have been conducted that provide a valuable resource for researchers and systems designers alike. From these, human factors of speech input can be seen to have 14 more pages are available in the full version of this document, which may be purchased using the \"Add to Cart\" button on the product's webpage: www.igi-global.com/article/developing-speech-input-virtualapplications/72689?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Communications and Social Science, InfoSciTechnology Adoption, Ethics, and Human Computer Interaction eJournal Collection, InfoSci-Digital Marketing, EBusiness, and E-Services eJournal Collection, InfoSciNetworking, Mobile Applications, and Web Technologies eJournal Collection. Recommend this product to your","PeriodicalId":309154,"journal":{"name":"Int. J. People Oriented Program.","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing Speech Input for Virtual Applications: A Human Factors Perspective\",\"authors\":\"A. Stedmon, D. Howard, Christin Kirchhübel\",\"doi\":\"10.4018/ijpop.2011070103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper contextualises the position of speech input from a user-centred human factors perspective. It is presented as a position paper so that researchers and designers can consider the underlying and future factors of a people-orientated approach to speech input for virtual applications. A number of key areas are explored including: human factors for speech input; speech input for virtual applications; speech as a spare mode of interaction; user acceptance and uptake; incorporating speech in the development of virtual applications; and speech input as an interaction tool. Given the user-centred perspective of this paper, this paper does not set out to address issues associated with spoken dialogue technologies, dialogue, and dialogue management; recent work on conversational agents in virtual environments; or multimodal interaction. This paper places the focus more fundamentally within human factors by looking at the user first as a basis for developing usable virtual applications incorporating speech input rather than to review the current state of the art in interaction design. A particular point this paper makes, however, is that speech input should be designed and used as another interaction tool that users need to learn to use, rather than assuming it will offer a natural or intuitive interface. DOI: 10.4018/ijpop.2011070103 International Journal of People-Oriented Programming, 1(2), 50-65, July-December 2011 51 Copyright © 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. there is a danger of designing systems that do not support user needs or expectations, and if systems evolve along purely interaction based principles then there is a danger of not embracing or exploiting new technological capabilities to their full. To date speech recognition has yet to be implemented in an original application (that has not used another form of input device before) and, as a consequence, it always faces the immediate challenge from more conventional input devices (e.g., keyboard and mouse) that users are familiar with using. Furthermore, if user expectations of speech input are too high, there is a danger that its potential will not be realised due to user frustration leading to poor uptake. The increased availability of speech input “brings with it the need for a full understanding of the ergonomics aspects of these systems with the aim of developing general guidelines to ensure their application should be as effective as possible” (Hapeshi & Jones, 1988, p. 252). Whilst the potential to interact with machines using speech input has been possible for many decades (Ullman, 1987) and technical advances, especially with the development of computing and distributed interaction technologies, allow for new approaches to people orientated interaction, speech input still remains an elusive concept without widespread use or end user acceptance (Stedmon et al., 2011). Alongside the point that user expectations might be higher than the technology can deliver, system interaction is often not as intuitive as it could be. With a lack of consideration for user needs, user satisfaction and the adoption of new technology might be hampered by fundamentally unusable technologies. It is only when system developers and users alike, have a greater awareness of the underlying issues of development and use, that such technologies can be designed that will embrace user needs and technological ability in a balanced manner. With a focus on design needs for usercentred and usable speech interfaces, along with a framework for virtual environment (VE) development, this paper argues that there has been little recent development in understanding generic human factors issues of speech input and even less in the specific area of human factors for speech input in virtual applications. Given the user-centred perspective of this paper and the aim of presenting arguments that transcend specific technologies or trends in solutions, this paper does not set out to address issues associated with natural and spoken dialogue technologies (Cohen, Giangola, & Bagola, 2004; Leuski & Traum, 2010), dialogue and dialogue management (Lemon, 2011) or recent work on embodied conversational agents in virtual applications (Traum, 2008). Furthermore, the technologies and methods underpinning multimodal interaction (Lee & Billinghurst, 2008), tangible or mobile interfaces (Billinghurst, Kato, & Myojin, 2009) is not a primary focus of this paper either. And whilst some researchers may consider it difficult to address the nature of speech-based interaction without addressing principles of dialogue and dialogue management, this paper seeks to place the focus more fundamentally within human factors by looking at the user and then seeking to develop usable virtual applications incorporating speech to support user interaction rather than to review the current state of the art in interaction design. This paper discusses the importance of speech input across a number of key areas: 1. Human factors issues in speech input, 2. The interaction potential of speech input, 3. Acceptance issues of speech input in virtual applications, 4. Designing virtual applications with speech input in mind, 5. Developing speech input for virtual applications, 6. Speech input as an interaction tool. 2. HUMAN FACTORS ISSUES IN SPEECH INPUT A number of reviews have been conducted that provide a valuable resource for researchers and systems designers alike. From these, human factors of speech input can be seen to have 14 more pages are available in the full version of this document, which may be purchased using the \\\"Add to Cart\\\" button on the product's webpage: www.igi-global.com/article/developing-speech-input-virtualapplications/72689?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Communications and Social Science, InfoSciTechnology Adoption, Ethics, and Human Computer Interaction eJournal Collection, InfoSci-Digital Marketing, EBusiness, and E-Services eJournal Collection, InfoSciNetworking, Mobile Applications, and Web Technologies eJournal Collection. Recommend this product to your\",\"PeriodicalId\":309154,\"journal\":{\"name\":\"Int. J. People Oriented Program.\",\"volume\":\"89 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. People Oriented Program.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4018/ijpop.2011070103\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. People Oriented Program.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijpop.2011070103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文从以用户为中心的人为因素角度分析语音输入的位置。它是作为一份立场文件提出的，以便研究人员和设计师可以考虑虚拟应用中以人为本的语音输入方法的潜在和未来因素。一些关键领域的探索包括:人为因素的语音输入;虚拟应用的语音输入;言语作为一种备用的互动方式;用户接受和接受;在虚拟应用程序的开发中加入语音;语音输入作为一种交互工具。鉴于本文以用户为中心的观点，本文不着手解决与口语对话技术、对话和对话管理相关的问题;最近对虚拟环境中对话代理的研究;或者多模态交互。本文将重点放在人为因素上，首先将用户视为开发包含语音输入的可用虚拟应用程序的基础，而不是回顾交互设计的当前艺术状态。然而，本文提出的一个特别观点是，语音输入应该被设计和使用为用户需要学习使用的另一种交互工具，而不是假设它将提供一个自然或直观的界面。DOI: 10.4018 / ijpop.201107010351版权所有©2011,IGI Global。未经IGI Global书面许可，禁止以印刷或电子形式复制或分发。设计不支持用户需求或期望的系统是有危险的，如果系统纯粹是基于交互原则发展的，那么就有不充分利用新技术能力的危险。到目前为止，语音识别还没有在原始应用程序中实现(之前没有使用过其他形式的输入设备)，因此，它总是面临来自用户熟悉的更传统输入设备(例如键盘和鼠标)的直接挑战。此外，如果用户对语音输入的期望过高，则存在由于用户受挫导致吸收不良而无法实现其潜力的危险。语音输入可用性的增加“带来了对这些系统的人体工程学方面的充分理解的需要，目的是制定一般指导方针，以确保它们的应用应该尽可能有效”(Hapeshi & Jones, 1988, p. 252)。虽然使用语音输入与机器交互的潜力已经存在了几十年(Ullman, 1987)，技术进步，特别是随着计算和分布式交互技术的发展，允许以人为本的交互的新方法，但语音输入仍然是一个难以捉摸的概念，没有广泛使用或最终用户接受(Stedmon等人，2011)。除此之外，用户的期望可能比技术所能提供的要高，系统交互通常不像它应该的那样直观。由于缺乏对用户需求的考虑，用户满意度和新技术的采用可能会受到根本不可用的技术的阻碍。只有当系统开发人员和用户都对开发和使用的潜在问题有更大的认识时，这样的技术才能被设计成以一种平衡的方式包含用户需求和技术能力。本文重点关注以用户为中心和可用的语音界面的设计需求，以及虚拟环境(VE)开发的框架，本文认为，在理解语音输入的一般人为因素问题方面，最近的发展很少，在虚拟应用程序中语音输入的人为因素特定领域的发展就更少了。鉴于本文以用户为中心的观点，以及提出超越特定技术或解决方案趋势的论点的目的，本文不打算解决与自然和口语对话技术相关的问题(Cohen, gi安哥拉，& Bagola, 2004;Leuski & Traum, 2010)，对话和对话管理(Lemon, 2011)，或者最近关于虚拟应用中的具体化对话代理的研究(Traum, 2008)。此外，支撑多模式交互(Lee & Billinghurst, 2008)、有形或移动界面(Billinghurst, Kato， & Myojin, 2009)的技术和方法也不是本文的主要焦点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Developing Speech Input for Virtual Applications: A Human Factors Perspective

This paper contextualises the position of speech input from a user-centred human factors perspective. It is presented as a position paper so that researchers and designers can consider the underlying and future factors of a people-orientated approach to speech input for virtual applications. A number of key areas are explored including: human factors for speech input; speech input for virtual applications; speech as a spare mode of interaction; user acceptance and uptake; incorporating speech in the development of virtual applications; and speech input as an interaction tool. Given the user-centred perspective of this paper, this paper does not set out to address issues associated with spoken dialogue technologies, dialogue, and dialogue management; recent work on conversational agents in virtual environments; or multimodal interaction. This paper places the focus more fundamentally within human factors by looking at the user first as a basis for developing usable virtual applications incorporating speech input rather than to review the current state of the art in interaction design. A particular point this paper makes, however, is that speech input should be designed and used as another interaction tool that users need to learn to use, rather than assuming it will offer a natural or intuitive interface. DOI: 10.4018/ijpop.2011070103 International Journal of People-Oriented Programming, 1(2), 50-65, July-December 2011 51 Copyright © 2011, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. there is a danger of designing systems that do not support user needs or expectations, and if systems evolve along purely interaction based principles then there is a danger of not embracing or exploiting new technological capabilities to their full. To date speech recognition has yet to be implemented in an original application (that has not used another form of input device before) and, as a consequence, it always faces the immediate challenge from more conventional input devices (e.g., keyboard and mouse) that users are familiar with using. Furthermore, if user expectations of speech input are too high, there is a danger that its potential will not be realised due to user frustration leading to poor uptake. The increased availability of speech input “brings with it the need for a full understanding of the ergonomics aspects of these systems with the aim of developing general guidelines to ensure their application should be as effective as possible” (Hapeshi & Jones, 1988, p. 252). Whilst the potential to interact with machines using speech input has been possible for many decades (Ullman, 1987) and technical advances, especially with the development of computing and distributed interaction technologies, allow for new approaches to people orientated interaction, speech input still remains an elusive concept without widespread use or end user acceptance (Stedmon et al., 2011). Alongside the point that user expectations might be higher than the technology can deliver, system interaction is often not as intuitive as it could be. With a lack of consideration for user needs, user satisfaction and the adoption of new technology might be hampered by fundamentally unusable technologies. It is only when system developers and users alike, have a greater awareness of the underlying issues of development and use, that such technologies can be designed that will embrace user needs and technological ability in a balanced manner. With a focus on design needs for usercentred and usable speech interfaces, along with a framework for virtual environment (VE) development, this paper argues that there has been little recent development in understanding generic human factors issues of speech input and even less in the specific area of human factors for speech input in virtual applications. Given the user-centred perspective of this paper and the aim of presenting arguments that transcend specific technologies or trends in solutions, this paper does not set out to address issues associated with natural and spoken dialogue technologies (Cohen, Giangola, & Bagola, 2004; Leuski & Traum, 2010), dialogue and dialogue management (Lemon, 2011) or recent work on embodied conversational agents in virtual applications (Traum, 2008). Furthermore, the technologies and methods underpinning multimodal interaction (Lee & Billinghurst, 2008), tangible or mobile interfaces (Billinghurst, Kato, & Myojin, 2009) is not a primary focus of this paper either. And whilst some researchers may consider it difficult to address the nature of speech-based interaction without addressing principles of dialogue and dialogue management, this paper seeks to place the focus more fundamentally within human factors by looking at the user and then seeking to develop usable virtual applications incorporating speech to support user interaction rather than to review the current state of the art in interaction design. This paper discusses the importance of speech input across a number of key areas: 1. Human factors issues in speech input, 2. The interaction potential of speech input, 3. Acceptance issues of speech input in virtual applications, 4. Designing virtual applications with speech input in mind, 5. Developing speech input for virtual applications, 6. Speech input as an interaction tool. 2. HUMAN FACTORS ISSUES IN SPEECH INPUT A number of reviews have been conducted that provide a valuable resource for researchers and systems designers alike. From these, human factors of speech input can be seen to have 14 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/article/developing-speech-input-virtualapplications/72689?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Communications and Social Science, InfoSciTechnology Adoption, Ethics, and Human Computer Interaction eJournal Collection, InfoSci-Digital Marketing, EBusiness, and E-Services eJournal Collection, InfoSciNetworking, Mobile Applications, and Web Technologies eJournal Collection. Recommend this product to your

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Int. J. People Oriented Program.

自引率

0.00%

发文量