{"title":"Word recognition based on the combination of a sequential neural network and the GPDM discriminative training algorithm","authors":"Wen-Yuan Chen, Sin-Horng Chen","doi":"10.1109/NNSP.1991.239504","DOIUrl":null,"url":null,"abstract":"The authors propose an isolated-word recognition method based on the combination of a sequential neural network and a discriminative training algorithm using the Generalized Probabilistic Descent Method (GPDM). The sequential neural network deals with the temporal variation of speech by dynamic programming, and the GPDM discriminative training algorithm is used to discriminate easily confused words by enhancing the distinguishing sounds of them during the scoring procedure. A Mandarin digit database uttered by 100 speakers was used to evaluate the performance of this method. The recognition rates are 99.1% on training data and 96.3% on testing data.<<ETX>>","PeriodicalId":354832,"journal":{"name":"Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop","volume":"87 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1991-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NNSP.1991.239504","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The authors propose an isolated-word recognition method based on the combination of a sequential neural network and a discriminative training algorithm using the Generalized Probabilistic Descent Method (GPDM). The sequential neural network deals with the temporal variation of speech by dynamic programming, and the GPDM discriminative training algorithm is used to discriminate easily confused words by enhancing the distinguishing sounds of them during the scoring procedure. A Mandarin digit database uttered by 100 speakers was used to evaluate the performance of this method. The recognition rates are 99.1% on training data and 96.3% on testing data.<>