{"title":"双耳树皮子带预处理非平稳噪声信号的鲁棒语音特征提取","authors":"M. Peters","doi":"10.1109/TFSA.1998.721498","DOIUrl":null,"url":null,"abstract":"A two channel approach to noise robust feature extraction for speech recognition in the car is proposed. The coherence function within the Bark subbands of the mel-frequency-cepstral-transform is calculated to estimate the spectral similarity of two statistic processes. It is illustrated how the coherence of speech in binaural signals is used to increase the robustness against incoherent noise. The introduced preprocessing method of nonstationary signals in two microphones results in an additive correction term of the mel-frequency-cepstral-coefficients.","PeriodicalId":395542,"journal":{"name":"Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis (Cat. No.98TH8380)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Binaural Bark subband preprocessing of nonstationary signals for noise robust speech feature extraction\",\"authors\":\"M. Peters\",\"doi\":\"10.1109/TFSA.1998.721498\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A two channel approach to noise robust feature extraction for speech recognition in the car is proposed. The coherence function within the Bark subbands of the mel-frequency-cepstral-transform is calculated to estimate the spectral similarity of two statistic processes. It is illustrated how the coherence of speech in binaural signals is used to increase the robustness against incoherent noise. The introduced preprocessing method of nonstationary signals in two microphones results in an additive correction term of the mel-frequency-cepstral-coefficients.\",\"PeriodicalId\":395542,\"journal\":{\"name\":\"Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis (Cat. No.98TH8380)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis (Cat. No.98TH8380)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TFSA.1998.721498\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis (Cat. No.98TH8380)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TFSA.1998.721498","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Binaural Bark subband preprocessing of nonstationary signals for noise robust speech feature extraction
A two channel approach to noise robust feature extraction for speech recognition in the car is proposed. The coherence function within the Bark subbands of the mel-frequency-cepstral-transform is calculated to estimate the spectral similarity of two statistic processes. It is illustrated how the coherence of speech in binaural signals is used to increase the robustness against incoherent noise. The introduced preprocessing method of nonstationary signals in two microphones results in an additive correction term of the mel-frequency-cepstral-coefficients.