{"title":"一种新的基于组延迟的鲁棒语音识别特性","authors":"Erfan Loweimi, S. Ahadi","doi":"10.1109/ICME.2011.6011884","DOIUrl":null,"url":null,"abstract":"In this paper we present a novel feature extraction algorithm based on group delay function for robust speech recognition. The modified group delay function (MODGDF) is the main feature extraction method based on group delay function, generally used for robust speech recognition. The recognition tests indicate this feature does not provide notably better results in the presence of additive noise in comparison with MFCC. In the presence of convolutional noise, the performance of MODGDF is considerably lower than MFCC. The method proposed in this paper is simple and makes more efficient utilization of the high resolution property of GDF. It is formed from three main parts which are signal modeling, GDF computation based on extracted model, and compression. The recognition results obtained over AURORA 2.0 task indicate its superior performance in comparison with MODGDF and MFCC.","PeriodicalId":433997,"journal":{"name":"2011 IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"A new group delay-based feature for robust speech recognition\",\"authors\":\"Erfan Loweimi, S. Ahadi\",\"doi\":\"10.1109/ICME.2011.6011884\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we present a novel feature extraction algorithm based on group delay function for robust speech recognition. The modified group delay function (MODGDF) is the main feature extraction method based on group delay function, generally used for robust speech recognition. The recognition tests indicate this feature does not provide notably better results in the presence of additive noise in comparison with MFCC. In the presence of convolutional noise, the performance of MODGDF is considerably lower than MFCC. The method proposed in this paper is simple and makes more efficient utilization of the high resolution property of GDF. It is formed from three main parts which are signal modeling, GDF computation based on extracted model, and compression. The recognition results obtained over AURORA 2.0 task indicate its superior performance in comparison with MODGDF and MFCC.\",\"PeriodicalId\":433997,\"journal\":{\"name\":\"2011 IEEE International Conference on Multimedia and Expo\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE International Conference on Multimedia and Expo\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICME.2011.6011884\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Multimedia and Expo","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICME.2011.6011884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new group delay-based feature for robust speech recognition
In this paper we present a novel feature extraction algorithm based on group delay function for robust speech recognition. The modified group delay function (MODGDF) is the main feature extraction method based on group delay function, generally used for robust speech recognition. The recognition tests indicate this feature does not provide notably better results in the presence of additive noise in comparison with MFCC. In the presence of convolutional noise, the performance of MODGDF is considerably lower than MFCC. The method proposed in this paper is simple and makes more efficient utilization of the high resolution property of GDF. It is formed from three main parts which are signal modeling, GDF computation based on extracted model, and compression. The recognition results obtained over AURORA 2.0 task indicate its superior performance in comparison with MODGDF and MFCC.