{"title":"基于耳蜗图语音特征的混合模型语音增强","authors":"S. R. Chiluveru, M. Tripathy","doi":"10.1109/temsmet53515.2021.9768782","DOIUrl":null,"url":null,"abstract":"The present work proposes a robust Cochleagram based speech enhancement method using a hybrid model, which is a combination of Denoising Autoencoder (DAE) and Long Short Term Memory (LSTM). The denoising autoencoder model is used for noise removal, but it lacks temporal dependency while the LSTM model shows good temporal connectivity, hence in this work, a hybrid model is proposed which helps to improve the sequential dependence as well as denoising the noisy speech signal. The performance of the proposed Hybrid model is evaluated with unknown noises, and the results of the proposed model are compared with the existing basic Denoising autoencoder and LSTM based speech enhancement algorithms. The proposed model shows improved intelligibility in both low and high signal-to-noise ratio (SNR) environments; higher intelligibility is observed in the negative SNR conditions, whereas the quality results show a slight improvement at all SNR values.","PeriodicalId":170546,"journal":{"name":"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech Enhancement Using Hybrid Model with Cochleagram Speech Feature\",\"authors\":\"S. R. Chiluveru, M. Tripathy\",\"doi\":\"10.1109/temsmet53515.2021.9768782\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The present work proposes a robust Cochleagram based speech enhancement method using a hybrid model, which is a combination of Denoising Autoencoder (DAE) and Long Short Term Memory (LSTM). The denoising autoencoder model is used for noise removal, but it lacks temporal dependency while the LSTM model shows good temporal connectivity, hence in this work, a hybrid model is proposed which helps to improve the sequential dependence as well as denoising the noisy speech signal. The performance of the proposed Hybrid model is evaluated with unknown noises, and the results of the proposed model are compared with the existing basic Denoising autoencoder and LSTM based speech enhancement algorithms. The proposed model shows improved intelligibility in both low and high signal-to-noise ratio (SNR) environments; higher intelligibility is observed in the negative SNR conditions, whereas the quality results show a slight improvement at all SNR values.\",\"PeriodicalId\":170546,\"journal\":{\"name\":\"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/temsmet53515.2021.9768782\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 2nd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/temsmet53515.2021.9768782","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech Enhancement Using Hybrid Model with Cochleagram Speech Feature
The present work proposes a robust Cochleagram based speech enhancement method using a hybrid model, which is a combination of Denoising Autoencoder (DAE) and Long Short Term Memory (LSTM). The denoising autoencoder model is used for noise removal, but it lacks temporal dependency while the LSTM model shows good temporal connectivity, hence in this work, a hybrid model is proposed which helps to improve the sequential dependence as well as denoising the noisy speech signal. The performance of the proposed Hybrid model is evaluated with unknown noises, and the results of the proposed model are compared with the existing basic Denoising autoencoder and LSTM based speech enhancement algorithms. The proposed model shows improved intelligibility in both low and high signal-to-noise ratio (SNR) environments; higher intelligibility is observed in the negative SNR conditions, whereas the quality results show a slight improvement at all SNR values.