{"title":"A Study on Noisy Speech Recognition","authors":"K. Saeed, Adam Szczepanski","doi":"10.1109/ICBAKE.2009.17","DOIUrl":null,"url":null,"abstract":"The paper concentrates on a speech recognition algorithm to work with speech samples of nonhomogeneous quality. The speech samples are acquired using different microphones and are of different quality. Two algorithms of feature extraction are utilized,including the use of Toeplitz matrices and distances of feature points from the Cartesian origin point (0;0) as a reference point. For classification the Nearest Neighbor approach is used. The obtained results are promising.The paper also involves the description of the process in the preparation of speech samples. The approach to estimate the frequency range which contains enough information for proper speech recognition is undertaken. The studies in this paper show that cutting frequencies above 2200 Hz have rather low influence on the proper recognition but may rather lead to increase the error rate.","PeriodicalId":137627,"journal":{"name":"2009 International Conference on Biometrics and Kansei Engineering","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Biometrics and Kansei Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBAKE.2009.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The paper concentrates on a speech recognition algorithm to work with speech samples of nonhomogeneous quality. The speech samples are acquired using different microphones and are of different quality. Two algorithms of feature extraction are utilized,including the use of Toeplitz matrices and distances of feature points from the Cartesian origin point (0;0) as a reference point. For classification the Nearest Neighbor approach is used. The obtained results are promising.The paper also involves the description of the process in the preparation of speech samples. The approach to estimate the frequency range which contains enough information for proper speech recognition is undertaken. The studies in this paper show that cutting frequencies above 2200 Hz have rather low influence on the proper recognition but may rather lead to increase the error rate.