B. Kouhi-Jelehkaran, H. Bakhshi, F. Razzazi, S. Amini
{"title":"基于手机的滤波器参数优化鲁棒语音识别使用似然最大化","authors":"B. Kouhi-Jelehkaran, H. Bakhshi, F. Razzazi, S. Amini","doi":"10.1109/ICOSP.2008.4697194","DOIUrl":null,"url":null,"abstract":"Accuracy of speech recognition systems decreases when the distance between talker and microphone increases. By the using of microphone arrays and appropriate filtering of received signals, the accuracy of recognizer can be increased. Many different methods have been proposed. These methods can be classified in two main approaches: Systems that perform in two independent stages of array processing and recognition, and systems that use the likelihood acoustic information of recognition stage to calibrate the parameters of array processing stage on an utterance-based manner. In this paper a new approach to microphone array processing is proposed in which the parameters of array processing are adjusted based on phones used in language. Optimized filter parameters are stored and used during recognition phase. Persian language is used to find any improvement in speech recognition accuracy.","PeriodicalId":445699,"journal":{"name":"2008 9th International Conference on Signal Processing","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Phone-based filter parameter optimization for robust speech recognition using likelihood maximization\",\"authors\":\"B. Kouhi-Jelehkaran, H. Bakhshi, F. Razzazi, S. Amini\",\"doi\":\"10.1109/ICOSP.2008.4697194\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accuracy of speech recognition systems decreases when the distance between talker and microphone increases. By the using of microphone arrays and appropriate filtering of received signals, the accuracy of recognizer can be increased. Many different methods have been proposed. These methods can be classified in two main approaches: Systems that perform in two independent stages of array processing and recognition, and systems that use the likelihood acoustic information of recognition stage to calibrate the parameters of array processing stage on an utterance-based manner. In this paper a new approach to microphone array processing is proposed in which the parameters of array processing are adjusted based on phones used in language. Optimized filter parameters are stored and used during recognition phase. Persian language is used to find any improvement in speech recognition accuracy.\",\"PeriodicalId\":445699,\"journal\":{\"name\":\"2008 9th International Conference on Signal Processing\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 9th International Conference on Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.2008.4697194\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 9th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2008.4697194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Phone-based filter parameter optimization for robust speech recognition using likelihood maximization
Accuracy of speech recognition systems decreases when the distance between talker and microphone increases. By the using of microphone arrays and appropriate filtering of received signals, the accuracy of recognizer can be increased. Many different methods have been proposed. These methods can be classified in two main approaches: Systems that perform in two independent stages of array processing and recognition, and systems that use the likelihood acoustic information of recognition stage to calibrate the parameters of array processing stage on an utterance-based manner. In this paper a new approach to microphone array processing is proposed in which the parameters of array processing are adjusted based on phones used in language. Optimized filter parameters are stored and used during recognition phase. Persian language is used to find any improvement in speech recognition accuracy.