B. Kouhi-Jelehkaran, H. Bakhshi, F. Razzazi, S. Amini
{"title":"Phone-based filter parameter optimization for robust speech recognition using likelihood maximization","authors":"B. Kouhi-Jelehkaran, H. Bakhshi, F. Razzazi, S. Amini","doi":"10.1109/ICOSP.2008.4697194","DOIUrl":null,"url":null,"abstract":"Accuracy of speech recognition systems decreases when the distance between talker and microphone increases. By the using of microphone arrays and appropriate filtering of received signals, the accuracy of recognizer can be increased. Many different methods have been proposed. These methods can be classified in two main approaches: Systems that perform in two independent stages of array processing and recognition, and systems that use the likelihood acoustic information of recognition stage to calibrate the parameters of array processing stage on an utterance-based manner. In this paper a new approach to microphone array processing is proposed in which the parameters of array processing are adjusted based on phones used in language. Optimized filter parameters are stored and used during recognition phase. Persian language is used to find any improvement in speech recognition accuracy.","PeriodicalId":445699,"journal":{"name":"2008 9th International Conference on Signal Processing","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 9th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2008.4697194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Accuracy of speech recognition systems decreases when the distance between talker and microphone increases. By the using of microphone arrays and appropriate filtering of received signals, the accuracy of recognizer can be increased. Many different methods have been proposed. These methods can be classified in two main approaches: Systems that perform in two independent stages of array processing and recognition, and systems that use the likelihood acoustic information of recognition stage to calibrate the parameters of array processing stage on an utterance-based manner. In this paper a new approach to microphone array processing is proposed in which the parameters of array processing are adjusted based on phones used in language. Optimized filter parameters are stored and used during recognition phase. Persian language is used to find any improvement in speech recognition accuracy.