{"title":"A whole word recurrent neural network for keyword spotting","authors":"K. Li, J. Naylor, M. L. Rossen","doi":"10.1109/ICASSP.1992.226115","DOIUrl":null,"url":null,"abstract":"The authors present a neural network which is trained on word examples to perform the wordspotting task. This network has multiple recurrent connections with time delay to account for temporal dynamics. A single network may be trained to recognize one word or many words. A hybrid wordspotter is evaluated in which a conventional wordspotter (based on dynamic time warping word matching) is used to screen incoming speech for potential keywords which are then passed to the network for the final accept/reject decision. Initial tests on a standard wordspotting test corpora resulted in improved keyword recognition at false alarm rates above zero.<<ETX>>","PeriodicalId":163713,"journal":{"name":"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1992.226115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26
Abstract
The authors present a neural network which is trained on word examples to perform the wordspotting task. This network has multiple recurrent connections with time delay to account for temporal dynamics. A single network may be trained to recognize one word or many words. A hybrid wordspotter is evaluated in which a conventional wordspotter (based on dynamic time warping word matching) is used to screen incoming speech for potential keywords which are then passed to the network for the final accept/reject decision. Initial tests on a standard wordspotting test corpora resulted in improved keyword recognition at false alarm rates above zero.<>