Eyes and Ears for Computers

Proceedings of the IRE Pub Date : 1962-05-01 DOI:10.1109/JRPROC.1962.288011

E. E. David, O. Selfridge

{"title":"Eyes and Ears for Computers","authors":"E. E. David, O. Selfridge","doi":"10.1109/JRPROC.1962.288011","DOIUrl":null,"url":null,"abstract":"Attempts to mechanize character reading and speech recognition have greatly accelerated in the past decade. This increased interest was prompted by the promise of computer inputs more flexible in format than punched cards or magnetic tape. Research has shown that automatic sensing can be done reliably if the task is suitably delimited. Cleverly designed marks on standard forms can be both machine and man readable. A single type font or a few fixed ones are tractable if the print quality is controlled. Handprinting can be handled for careful writers, as can meticulous handwriting. Isolated spoken words taken from a small number of talkers and a limited vocabulary can be automatically recognized. Typical error rates for these machine-sensings run between 0.5 and 25 per cent. These results imply that reading unrestricted typestyles, handwritten scrawl, or recognizing conversational speech is beyond the reach of present methods. From the engineering viewpoint, questions of values enter. Might it not be wiser to punch cards or tape while making copy rather than depend upon complex character recognition hardware? Is it useful to have voice input to a computer when a finger and typewriter are available? Answers to such questions will depend upon the specific application. Certainly, the utility of automatic sensing will depend upon what is to be done with the material after it enters the computer as well as the internal organization of the machine itself.","PeriodicalId":20574,"journal":{"name":"Proceedings of the IRE","volume":"20 1","pages":"1093-1101"},"PeriodicalIF":0.0000,"publicationDate":"1962-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the IRE","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JRPROC.1962.288011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

Abstract

Attempts to mechanize character reading and speech recognition have greatly accelerated in the past decade. This increased interest was prompted by the promise of computer inputs more flexible in format than punched cards or magnetic tape. Research has shown that automatic sensing can be done reliably if the task is suitably delimited. Cleverly designed marks on standard forms can be both machine and man readable. A single type font or a few fixed ones are tractable if the print quality is controlled. Handprinting can be handled for careful writers, as can meticulous handwriting. Isolated spoken words taken from a small number of talkers and a limited vocabulary can be automatically recognized. Typical error rates for these machine-sensings run between 0.5 and 25 per cent. These results imply that reading unrestricted typestyles, handwritten scrawl, or recognizing conversational speech is beyond the reach of present methods. From the engineering viewpoint, questions of values enter. Might it not be wiser to punch cards or tape while making copy rather than depend upon complex character recognition hardware? Is it useful to have voice input to a computer when a finger and typewriter are available? Answers to such questions will depend upon the specific application. Certainly, the utility of automatic sensing will depend upon what is to be done with the material after it enters the computer as well as the internal organization of the machine itself.

查看原文本刊更多论文

电脑的眼睛和耳朵

在过去的十年里，对字符阅读和语音识别的机械化尝试大大加快了速度。计算机输入的格式比打孔卡片或磁带更灵活，这一前景促使人们对计算机的兴趣日益浓厚。研究表明，只要对任务进行适当的划分，就可以可靠地实现自动感知。在标准表格上巧妙设计的标记既可用于机器也可用于人读。如果印刷质量得到控制，单一字体或几个固定字体都是可以处理的。手印可以为细心的作家处理，就像精细的书写一样。从少数说话者和有限的词汇中提取的孤立的口语单词可以自动识别。这些机器感知的典型错误率在0.5%到25%之间。这些结果意味着，阅读不受限制的字体、手写字迹或识别会话语音，目前的方法都无法实现。从工程的观点来看，价值的问题进入了。在复制时打孔卡片或磁带是否比依赖复杂的字符识别硬件更明智?在有手指和打字机的情况下，对电脑进行语音输入有用吗?这些问题的答案将取决于具体的应用。当然，自动感应的效用将取决于材料进入计算机后如何处理，以及机器本身的内部组织。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the IRE

自引率

0.00%

发文量