{"title":"Speech recognition system for automatic telephone operator based on CSS architecture","authors":"N. Hataoka, T. Odaka, A. Amano","doi":"10.1109/IVTTA.1994.341541","DOIUrl":null,"url":null,"abstract":"The paper proposes a new speech recognition system based on CSS (client and server system) architecture and describes a telephone application which combines an existing telephone PBX system and an OA (office automation) system consisting of personal computers and fax machines, The server, which is separated from application software, is mainly for speech recognition, and the client is an application-driven processor which includes a speech input part and application software. The CSS-based speech recognition system makes it much easier to use the installed speech recognition server to various applications without any big changes of the processing algorithm and architecture. The authors also propose a new method for making acoustic models which cover speakers' speech feature variety. This method is based on an automatic generation of speech recognition units from a large speech database.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IVTTA.1994.341541","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The paper proposes a new speech recognition system based on CSS (client and server system) architecture and describes a telephone application which combines an existing telephone PBX system and an OA (office automation) system consisting of personal computers and fax machines, The server, which is separated from application software, is mainly for speech recognition, and the client is an application-driven processor which includes a speech input part and application software. The CSS-based speech recognition system makes it much easier to use the installed speech recognition server to various applications without any big changes of the processing algorithm and architecture. The authors also propose a new method for making acoustic models which cover speakers' speech feature variety. This method is based on an automatic generation of speech recognition units from a large speech database.<>
本文提出了一种基于CSS (client and server system)架构的新型语音识别系统,描述了一种将现有的电话PBX系统与由个人计算机和传真机组成的OA (office automation)系统相结合的电话应用程序,其中服务器端与应用软件分离,主要用于语音识别,客户端是应用驱动的处理器,包括语音输入部分和应用软件。基于css的语音识别系统使得安装好的语音识别服务器在不改变处理算法和体系结构的情况下,更容易用于各种应用。作者还提出了一种新的方法来制作涵盖说话人语音特征多样性的声学模型。该方法基于从大型语音数据库中自动生成语音识别单元。