Kaibei Peng, Xiaoming Sun, Haowei Chen, Zhen He, Jianrong Wang
{"title":"一种基于注意机制和门控循环单元的语音增强方法","authors":"Kaibei Peng, Xiaoming Sun, Haowei Chen, Zhen He, Jianrong Wang","doi":"10.1109/IAI53119.2021.9619422","DOIUrl":null,"url":null,"abstract":"Noise has great harm to speech. Therefore, speech enhancement plays a vital role in speech signal processing. To further improve the effect of speech enhancement, a speech enhancement method based on a gated recurrent unit with an attention mechanism (AGRU) is proposed. Firstly, the attention mechanism is used to extract important features in the speech signals. Then the gated recurrent unit (GRU) is used to map the complex relationship between noisy speech and pure speech. The collected speeches of different emotions are used for simulation. The results show that the method proposed in this paper can remove speech noise and is better than other methods. The method proposed in this paper can provide some references for the application of deep learning in speech enhancement.","PeriodicalId":106675,"journal":{"name":"2021 3rd International Conference on Industrial Artificial Intelligence (IAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Speech Enhancement Method Using Attention Mechanism and Gated Recurrent Unit\",\"authors\":\"Kaibei Peng, Xiaoming Sun, Haowei Chen, Zhen He, Jianrong Wang\",\"doi\":\"10.1109/IAI53119.2021.9619422\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Noise has great harm to speech. Therefore, speech enhancement plays a vital role in speech signal processing. To further improve the effect of speech enhancement, a speech enhancement method based on a gated recurrent unit with an attention mechanism (AGRU) is proposed. Firstly, the attention mechanism is used to extract important features in the speech signals. Then the gated recurrent unit (GRU) is used to map the complex relationship between noisy speech and pure speech. The collected speeches of different emotions are used for simulation. The results show that the method proposed in this paper can remove speech noise and is better than other methods. The method proposed in this paper can provide some references for the application of deep learning in speech enhancement.\",\"PeriodicalId\":106675,\"journal\":{\"name\":\"2021 3rd International Conference on Industrial Artificial Intelligence (IAI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 3rd International Conference on Industrial Artificial Intelligence (IAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IAI53119.2021.9619422\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Conference on Industrial Artificial Intelligence (IAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAI53119.2021.9619422","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Speech Enhancement Method Using Attention Mechanism and Gated Recurrent Unit
Noise has great harm to speech. Therefore, speech enhancement plays a vital role in speech signal processing. To further improve the effect of speech enhancement, a speech enhancement method based on a gated recurrent unit with an attention mechanism (AGRU) is proposed. Firstly, the attention mechanism is used to extract important features in the speech signals. Then the gated recurrent unit (GRU) is used to map the complex relationship between noisy speech and pure speech. The collected speeches of different emotions are used for simulation. The results show that the method proposed in this paper can remove speech noise and is better than other methods. The method proposed in this paper can provide some references for the application of deep learning in speech enhancement.