Talking to machines (abstract)

C. Cowley, Dylan M. Jones
{"title":"Talking to machines (abstract)","authors":"C. Cowley, Dylan M. Jones","doi":"10.1145/169059.169512","DOIUrl":null,"url":null,"abstract":"Despite some extravagant claims made regarding the potential of machines which recognize speech inpuL it is unlikely that they will ever match the speech processing capabilities of humans. The truly conversational computer is still a long way from being realiz@ and the performance of many contempomry recognizes leaves much to be desired. A machine that may perform efficiently for the experienced user in a controlled laboratory setting can often present substantial problems for unskilled users when it is finally installed in the workplace. Building reliable and acceptable speech interfaces is a subtle and sophisticated process. There is a common assumption that speech interfaces automatically y improve system performance when, in fact, the opposite is often the case, particularly if speech is simply added to an existing system rather than included from the outset of development. However, if human factors issues are addressed at the inception of projects rather than as an afterthought and the machines capabilities are not overloaded by overambitious design, a great deal can be achieved with devices that can reliably recognize a few selected utterances and take full advantage of the unique properties of spoken dialogue. The user is afforded greater freedom of movements and is thus released from the constraints imposed by conventional keyboardkreen interaction. Furthermore there is the option of multi-tasking using speech and manual interfaces concnrrentt y. The film shows how dialogue design and elror correction strategies, informed by human factors research, can lead to the development of usable and profitable systems. It starts with a simulation of a truly conversational machine to show the level of Permission to copy without fee all or part of this material is granted provided that the copies ara not mada or distributed for diract commercial advantage, the ACM copyright notioe and tha titla of the publication and its date appaar, and notica is given that copying is by permission of tha Association for Computing Machinary. To copy otherwise, or to republish, requires a fee and/or specifio permission. performance necessary to compete with human recognition. Template matching recognition is clemly explained so that viewem can see how most devices actually work. The film then shows the Digital Equipment Corporation’s DECvoice in a number of voice input and output scenarios which highlight typical design problems and solutions. It concludes with a set of guidelines which will help designers make reasoned decisions about when and how to use speech recognition and avoid the typical problems experienced by users. The film ends with an example of a system which, having been designed with the guidelines in mind, is usable, efficient, and practical within the constraints of contemporary technology. GUIDELINES FOR SYSTEM DESIGNERS 1. Train the machine in the place it will be used. 2. Use speech consistently for one part of a task. 3. Do not use speech too often. 4. Do not use voice input for spatial information. 5. Develop a special command vocabulary. 6. Incorporate clear error-correction strategies. 7. Provide feedback about the recognizer’s activities. 8. Use multiple criteria to evatuate the system.","PeriodicalId":407219,"journal":{"name":"Proceedings of the INTERACT '93 and CHI '93 Conference on Human Factors in Computing Systems","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1993-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the INTERACT '93 and CHI '93 Conference on Human Factors in Computing Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/169059.169512","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Despite some extravagant claims made regarding the potential of machines which recognize speech inpuL it is unlikely that they will ever match the speech processing capabilities of humans. The truly conversational computer is still a long way from being realiz@ and the performance of many contempomry recognizes leaves much to be desired. A machine that may perform efficiently for the experienced user in a controlled laboratory setting can often present substantial problems for unskilled users when it is finally installed in the workplace. Building reliable and acceptable speech interfaces is a subtle and sophisticated process. There is a common assumption that speech interfaces automatically y improve system performance when, in fact, the opposite is often the case, particularly if speech is simply added to an existing system rather than included from the outset of development. However, if human factors issues are addressed at the inception of projects rather than as an afterthought and the machines capabilities are not overloaded by overambitious design, a great deal can be achieved with devices that can reliably recognize a few selected utterances and take full advantage of the unique properties of spoken dialogue. The user is afforded greater freedom of movements and is thus released from the constraints imposed by conventional keyboardkreen interaction. Furthermore there is the option of multi-tasking using speech and manual interfaces concnrrentt y. The film shows how dialogue design and elror correction strategies, informed by human factors research, can lead to the development of usable and profitable systems. It starts with a simulation of a truly conversational machine to show the level of Permission to copy without fee all or part of this material is granted provided that the copies ara not mada or distributed for diract commercial advantage, the ACM copyright notioe and tha titla of the publication and its date appaar, and notica is given that copying is by permission of tha Association for Computing Machinary. To copy otherwise, or to republish, requires a fee and/or specifio permission. performance necessary to compete with human recognition. Template matching recognition is clemly explained so that viewem can see how most devices actually work. The film then shows the Digital Equipment Corporation’s DECvoice in a number of voice input and output scenarios which highlight typical design problems and solutions. It concludes with a set of guidelines which will help designers make reasoned decisions about when and how to use speech recognition and avoid the typical problems experienced by users. The film ends with an example of a system which, having been designed with the guidelines in mind, is usable, efficient, and practical within the constraints of contemporary technology. GUIDELINES FOR SYSTEM DESIGNERS 1. Train the machine in the place it will be used. 2. Use speech consistently for one part of a task. 3. Do not use speech too often. 4. Do not use voice input for spatial information. 5. Develop a special command vocabulary. 6. Incorporate clear error-correction strategies. 7. Provide feedback about the recognizer’s activities. 8. Use multiple criteria to evatuate the system.
与机器对话(抽象)
尽管人们对机器识别语音输入的潜力提出了一些夸张的主张,但它们不太可能与人类的语音处理能力相匹配。真正的对话式计算机离实现还有很长的路要走,而且许多当代计算机的表现也不尽人意。对于有经验的用户来说,一台机器在受控的实验室环境中可以有效地工作,但当它最终安装在工作场所时,对于不熟练的用户来说,往往会出现实质性的问题。构建可靠且可接受的语音接口是一个微妙而复杂的过程。有一个普遍的假设,语音接口会自动提高系统性能,而事实上,情况往往相反,特别是如果语音只是简单地添加到现有系统中,而不是从开发开始就包含在系统中。然而,如果在项目开始时就解决人为因素问题,而不是事后考虑,并且机器的能力不会因过于雄心勃勃的设计而超负荷,那么可以通过能够可靠地识别一些选定的话语并充分利用口语对话的独特属性的设备取得很大成就。用户可以更自由地移动,从而从传统的键盘交互所施加的限制中解脱出来。此外,还可以选择同时使用语音和手动界面进行多任务处理。影片展示了通过人为因素研究,对话设计和错误纠正策略如何能够开发出可用且有利可图的系统。它以一个真正的对话机器的模拟开始,以显示免费复制全部或部分材料的许可水平,前提是副本不是为了直接的商业利益而制作或分发,ACM版权声明和出版物的标题及其日期,并给出通知,复制是由计算机械协会许可的。以其他方式复制或重新发布需要付费和/或特定许可。这是与人类识别能力竞争的必要表现。模板匹配识别清楚地解释,以便视图可以看到大多数设备实际工作。然后,影片展示了数字设备公司的DECvoice在许多语音输入和输出场景中,突出了典型的设计问题和解决方案。它总结了一套指导方针,这些指导方针将帮助设计师在何时以及如何使用语音识别方面做出合理的决定,并避免用户遇到的典型问题。影片以一个系统的例子结束,该系统在设计时考虑了指导方针,在当代技术的限制下是可用的,高效的和实用的。系统设计指南在使用机器的地方训练机器。2. 在任务的某一部分中始终使用语音。3.不要太频繁地使用言语。4. 请勿使用语音输入空间信息。5. 开发一个特殊的命令词汇表。6. 结合清晰的纠错策略。7. 对识别器的活动提供反馈。8. 使用多种标准来评估系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信