Speech recognition as a computer graphics input technique (Panel Session)

Richard Rabin, Alan R. Strass, Mark Robillard, Sue Schedler, M. Peterson
{"title":"Speech recognition as a computer graphics input technique (Panel Session)","authors":"Richard Rabin, Alan R. Strass, Mark Robillard, Sue Schedler, M. Peterson","doi":"10.1145/800064.801278","DOIUrl":null,"url":null,"abstract":"Richard Rabin Interactive graphics systems typically require intense “hands busy/eyes busy and brains busy” activity on the part of the system user/operator. Voice input by means of automatic speech recognition equipment, offers major potential for improving user/operator productivity. It is the only input technique which does not require the direct use of hands and eyes. Voice input can replace or complement keyboards, function keys, tablets and other types of input devices typically employed for entering commands, alpha and numeric data. Alan R. Strass Effective human interfaces are an essential element of plant information systems and computer integrated processes, such as graphics. In the past, data gathering choices in the factory have been generally dominated by clipboards and travelling punched card decks. Similarly, complicated keystroke sequences have often been required to evoke appropriate computer controlled functionality. Today, a number of advanced human interface techniques can be used to improve both source data capture and the selection of appropriate computer controlled operations. These techniques are becoming an integral part of many emerging on-line/real-time engineering and manufacturing applications. Speech recognition, in particular, is emerging as an important interface technology. Speech input can reduce the amount of attention the user has to spend on the mechanics of recording information of selecting functions and allows users to concentrate on their primary task. Some of the benefits include: (1) reduced user training time, (2) increased worker productivity, (3) reduced secondary key input, and (4) improved timeliness and accuracy of information made available via voice. Currently available speech recognition products have already been used to demonstrate these benefits. These advantages will only increase as the next generation of speech products deliver improved recognition performance. Mark Robillard There is an increasing awareness of the potential for improving operator productivity for graphic systems by the use of voice input. We have analyzed a number of applications in conjunction with potential users to determine how best to use this capability. Initial conclusions are that voice input can be effective, providing that the capabilities of the speech recognition equipment utilized to input voices are matched to the requirements of the applications. Key factors to be considered are vocabulary size and types, and the use of isolated words versus continuous speech utterances. Sue Schedler The addition of voice input to Calma's GBSll System provides users with a fast, accurate means by which they can execute commands. A customized Interstate Electronic VRM System has been integrated into the hardware configuration of all microelectronics products, Chips, Sticks and CARDSII. System software recognizes the input from the Harmony VRM in the same manner as a keyboard or menu button input. All three input methods may be used separately or in conjunction with each other. All command inputs are buffered, so the user need never wait for a command to complete before entering the next. A voice file contains up to 50 words, and any number of files may be uploaded or downloaded by one user. Both standard system commands or GPLII Programs can be executed by voice. On-line design productivity by menu input can be increased by 50% when the operator uses voice. Operator training is unaffected by the addition of voice to the system. Voice is trained as a separate module in the class. Its greatest impact appears once the users know operation syntax and can communicate commands quickly. Calma provides standard voice menu to be used by customers or the customer may modify the file as needed. Most commonly used commands are put into the voice file leaving less frequently used commands for the onscreen menu. By implementing the combination of voice and menu, we can completely eliminate the need for keyboard input. Operators initially are skeptical of the “bell and whistle” feature, but find after using voice input that they do not feel uncomfortable “talking” to the computer, and find it an enjoyable and productive tool. Matthew Peterson The use of voice recognition in a “mature” graphics application raises a number of issues that must be faced. “Mature” refers specifically to the maturity of the user interface in the graphics application. Commercial CAD/CAM systems provide such an application: they have been evolving for over ten years, they possess robust user interface features, and their features are subject to the test of the marketplace. Use of voice recognition in such applications raises a number of challenges to the technology: (1) How does it compare to existing features (e.g., tablet menuing) for servicing typical user interface needs? (2) Are there atypical or emerging needs for which it is particularly suitable? (3) How attractive is its price/productivity offering compared to alternative user investment strategies for increasing productivity? (4) In what directions should it be driven to better service the needs of commercial graphics applications? Material on present and future industry needs, other user interface features, and industry price trends will be presented to help assess these issues.","PeriodicalId":276450,"journal":{"name":"Proceedings of the 9th annual conference on Computer graphics and interactive techniques","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1982-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th annual conference on Computer graphics and interactive techniques","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/800064.801278","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Richard Rabin Interactive graphics systems typically require intense “hands busy/eyes busy and brains busy” activity on the part of the system user/operator. Voice input by means of automatic speech recognition equipment, offers major potential for improving user/operator productivity. It is the only input technique which does not require the direct use of hands and eyes. Voice input can replace or complement keyboards, function keys, tablets and other types of input devices typically employed for entering commands, alpha and numeric data. Alan R. Strass Effective human interfaces are an essential element of plant information systems and computer integrated processes, such as graphics. In the past, data gathering choices in the factory have been generally dominated by clipboards and travelling punched card decks. Similarly, complicated keystroke sequences have often been required to evoke appropriate computer controlled functionality. Today, a number of advanced human interface techniques can be used to improve both source data capture and the selection of appropriate computer controlled operations. These techniques are becoming an integral part of many emerging on-line/real-time engineering and manufacturing applications. Speech recognition, in particular, is emerging as an important interface technology. Speech input can reduce the amount of attention the user has to spend on the mechanics of recording information of selecting functions and allows users to concentrate on their primary task. Some of the benefits include: (1) reduced user training time, (2) increased worker productivity, (3) reduced secondary key input, and (4) improved timeliness and accuracy of information made available via voice. Currently available speech recognition products have already been used to demonstrate these benefits. These advantages will only increase as the next generation of speech products deliver improved recognition performance. Mark Robillard There is an increasing awareness of the potential for improving operator productivity for graphic systems by the use of voice input. We have analyzed a number of applications in conjunction with potential users to determine how best to use this capability. Initial conclusions are that voice input can be effective, providing that the capabilities of the speech recognition equipment utilized to input voices are matched to the requirements of the applications. Key factors to be considered are vocabulary size and types, and the use of isolated words versus continuous speech utterances. Sue Schedler The addition of voice input to Calma's GBSll System provides users with a fast, accurate means by which they can execute commands. A customized Interstate Electronic VRM System has been integrated into the hardware configuration of all microelectronics products, Chips, Sticks and CARDSII. System software recognizes the input from the Harmony VRM in the same manner as a keyboard or menu button input. All three input methods may be used separately or in conjunction with each other. All command inputs are buffered, so the user need never wait for a command to complete before entering the next. A voice file contains up to 50 words, and any number of files may be uploaded or downloaded by one user. Both standard system commands or GPLII Programs can be executed by voice. On-line design productivity by menu input can be increased by 50% when the operator uses voice. Operator training is unaffected by the addition of voice to the system. Voice is trained as a separate module in the class. Its greatest impact appears once the users know operation syntax and can communicate commands quickly. Calma provides standard voice menu to be used by customers or the customer may modify the file as needed. Most commonly used commands are put into the voice file leaving less frequently used commands for the onscreen menu. By implementing the combination of voice and menu, we can completely eliminate the need for keyboard input. Operators initially are skeptical of the “bell and whistle” feature, but find after using voice input that they do not feel uncomfortable “talking” to the computer, and find it an enjoyable and productive tool. Matthew Peterson The use of voice recognition in a “mature” graphics application raises a number of issues that must be faced. “Mature” refers specifically to the maturity of the user interface in the graphics application. Commercial CAD/CAM systems provide such an application: they have been evolving for over ten years, they possess robust user interface features, and their features are subject to the test of the marketplace. Use of voice recognition in such applications raises a number of challenges to the technology: (1) How does it compare to existing features (e.g., tablet menuing) for servicing typical user interface needs? (2) Are there atypical or emerging needs for which it is particularly suitable? (3) How attractive is its price/productivity offering compared to alternative user investment strategies for increasing productivity? (4) In what directions should it be driven to better service the needs of commercial graphics applications? Material on present and future industry needs, other user interface features, and industry price trends will be presented to help assess these issues.
语音识别作为计算机图形输入技术(小组讨论)
交互式图形系统通常需要系统用户/操作员进行强烈的“手忙/眼忙和脑忙”活动。通过自动语音识别设备进行语音输入,为提高用户/操作员的工作效率提供了巨大的潜力。这是唯一一种不需要直接使用手和眼睛的输入技术。语音输入可以取代或补充键盘、功能键、平板电脑和其他类型的输入设备,通常用于输入命令、alpha和数字数据。有效的人机界面是植物信息系统和计算机集成过程(如图形)的基本元素。在过去,工厂的数据收集选择通常由剪贴板和移动穿孔卡片组主导。类似地,通常需要复杂的击键序列来唤起适当的计算机控制功能。今天,许多先进的人机界面技术可用于改进源数据捕获和选择适当的计算机控制操作。这些技术正在成为许多新兴的在线/实时工程和制造应用的组成部分。尤其是语音识别,正在成为一种重要的接口技术。语音输入可以减少用户花在记录信息和选择功能上的注意力,让用户专注于他们的主要任务。一些好处包括:(1)减少用户培训时间,(2)提高工人生产率,(3)减少辅助键输入,以及(4)提高通过语音提供的信息的及时性和准确性。目前可用的语音识别产品已经被用来证明这些好处。随着下一代语音产品提供更好的识别性能,这些优势只会增加。越来越多的人意识到,使用语音输入可以提高图形系统操作员的工作效率。我们已经与潜在用户一起分析了许多应用程序,以确定如何最好地使用此功能。初步结论是,只要用于输入声音的语音识别设备的能力与应用程序的要求相匹配,语音输入可以是有效的。要考虑的关键因素是词汇量和类型,以及孤立词汇的使用与连续话语的使用。Calma的GBSll系统增加了语音输入,为用户提供了一种快速、准确的方式来执行命令。定制的州际电子VRM系统已集成到所有微电子产品,芯片,棒和卡的硬件配置中。系统软件对Harmony VRM输入的识别方式与键盘或菜单按钮输入相同。这三种输入法可以单独使用,也可以相互结合使用。所有的命令输入都是缓冲的,所以用户在输入下一个命令之前不需要等待一个命令完成。一个语音文件最多包含50个单词,一个用户可以上传或下载任意数量的文件。标准系统命令或GPLII程序都可以通过语音执行。当操作员使用语音时,通过菜单输入的在线设计生产率可提高50%。操作员培训不受系统中添加语音的影响。语音在课堂上作为一个单独的模块进行训练。一旦用户了解了操作语法并能够快速地进行命令通信,它的最大影响就会显现出来。Calma提供标准的语音菜单供客户使用,或者客户可以根据需要修改文件。最常用的命令放在语音文件中,不常用的命令留给屏幕菜单。通过实现语音和菜单的结合,我们可以完全消除键盘输入的需要。操作员最初对“铃声和哨子”功能持怀疑态度,但在使用语音输入后,他们发现与计算机“交谈”并不感到不舒服,并且发现它是一种令人愉快和高效的工具。在“成熟的”图形应用程序中使用语音识别提出了许多必须面对的问题。“成熟”特指图形应用程序中用户界面的成熟度。商用CAD/CAM系统提供了这样的应用:它们已经发展了十多年,它们拥有强大的用户界面功能,并且它们的功能受到市场的测试。在此类应用中使用语音识别对该技术提出了许多挑战:(1)它与现有功能(例如语音识别)相比如何? (平板电脑菜单),以满足典型的用户界面需求?(2)是否有特别适合的非典型或新出现的需要?(3)与提高生产率的其他用户投资策略相比,其价格/生产率提供的吸引力有多大?(4)为了更好地满足商业图形应用程序的需求,它应该朝着什么方向发展?将提供有关当前和未来行业需求、其他用户界面功能和行业价格趋势的材料,以帮助评估这些问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信