{"title":"一种基于深度神经网络的带宽扩展和语音增强联合方法","authors":"Taieba Taher, Nursadul Mamun, Md.Azad Hossain","doi":"10.1109/ECCE57851.2023.10101546","DOIUrl":null,"url":null,"abstract":"Recently, joint bandwidth expansion and speech enhancement has been a topic of interest in the field of speech processing. The main challenge in this task is to increase the bandwidth of speech signals while enhancing their quality, simultaneously. Deep neural networks (DNNs) have shown great promise in addressing this challenge, as they can learn complex relationships between the input and output signals. In this study, a joint bandwidth expansion and speech enhancement approach using DNNs have been proposed, which is designed to simultaneously increase the bandwidth of speech signals and reduce noise, while preserving speech quality and intelligibility. This approach leverages the capability of DNNs to simultaneously estimate the missing speech components and the noise profile in the degraded speech signal. The estimated speech components and the noise profile are then used to synthesize a full-band speech signal from a noisy signal with limited bandwidth with improved quality. The network employs three different phases such as oracle, imaged, and noisy phase along with the magnitude spectra to recover high band components. The joint approach demonstrates that the DNN-based bandwidth extension and speech enhancement can be effectively combined to produce high-quality speech signals, outperforms traditional speech enhancement methods, and offers promising solutions for various applications, including speech communication, speech recognition, and speech synthesis.","PeriodicalId":131537,"journal":{"name":"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Joint Bandwidth Expansion and Speech Enhancement Approach Using Deep Neural Network\",\"authors\":\"Taieba Taher, Nursadul Mamun, Md.Azad Hossain\",\"doi\":\"10.1109/ECCE57851.2023.10101546\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, joint bandwidth expansion and speech enhancement has been a topic of interest in the field of speech processing. The main challenge in this task is to increase the bandwidth of speech signals while enhancing their quality, simultaneously. Deep neural networks (DNNs) have shown great promise in addressing this challenge, as they can learn complex relationships between the input and output signals. In this study, a joint bandwidth expansion and speech enhancement approach using DNNs have been proposed, which is designed to simultaneously increase the bandwidth of speech signals and reduce noise, while preserving speech quality and intelligibility. This approach leverages the capability of DNNs to simultaneously estimate the missing speech components and the noise profile in the degraded speech signal. The estimated speech components and the noise profile are then used to synthesize a full-band speech signal from a noisy signal with limited bandwidth with improved quality. The network employs three different phases such as oracle, imaged, and noisy phase along with the magnitude spectra to recover high band components. The joint approach demonstrates that the DNN-based bandwidth extension and speech enhancement can be effectively combined to produce high-quality speech signals, outperforms traditional speech enhancement methods, and offers promising solutions for various applications, including speech communication, speech recognition, and speech synthesis.\",\"PeriodicalId\":131537,\"journal\":{\"name\":\"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ECCE57851.2023.10101546\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECCE57851.2023.10101546","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Joint Bandwidth Expansion and Speech Enhancement Approach Using Deep Neural Network
Recently, joint bandwidth expansion and speech enhancement has been a topic of interest in the field of speech processing. The main challenge in this task is to increase the bandwidth of speech signals while enhancing their quality, simultaneously. Deep neural networks (DNNs) have shown great promise in addressing this challenge, as they can learn complex relationships between the input and output signals. In this study, a joint bandwidth expansion and speech enhancement approach using DNNs have been proposed, which is designed to simultaneously increase the bandwidth of speech signals and reduce noise, while preserving speech quality and intelligibility. This approach leverages the capability of DNNs to simultaneously estimate the missing speech components and the noise profile in the degraded speech signal. The estimated speech components and the noise profile are then used to synthesize a full-band speech signal from a noisy signal with limited bandwidth with improved quality. The network employs three different phases such as oracle, imaged, and noisy phase along with the magnitude spectra to recover high band components. The joint approach demonstrates that the DNN-based bandwidth extension and speech enhancement can be effectively combined to produce high-quality speech signals, outperforms traditional speech enhancement methods, and offers promising solutions for various applications, including speech communication, speech recognition, and speech synthesis.