Yonglin Wu, Jun Zhang, Yue Wu, Geng-xin Ning, Cui Yang
{"title":"Speech Enhancement Based on Multi-Objective Ensemble Learning","authors":"Yonglin Wu, Jun Zhang, Yue Wu, Geng-xin Ning, Cui Yang","doi":"10.1109/ICSPCC55723.2022.9984412","DOIUrl":null,"url":null,"abstract":"The performance of traditional speech enhancement methods based on deep neural network is limited by using single training objective and network structure. In this paper, we propose a speech enhancement method based on multi-objective ensemble learning. First, the traditional multi-objective learning network structure is modified to reduce the training conflict caused by excess shared parameters. Then, a multi-objective ensemble learning based speech enhancement method is established by employing the modified multi-objective deep neural network (DNN), convolutional neural network (CNN) and gate recurrent unit (GRU), which overcomes the limitation of homogeneity in base models in the traditional ensemble learning based speech enhancement network. The experimental results show that the proposed methods outperforms the traditional multi-objective learning or ensemble learning based speech enhancement methods at the scores of perceptual evaluation of speech quality (PESQ) and short-time objective intelligibility (STOI).","PeriodicalId":346917,"journal":{"name":"2022 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPCC55723.2022.9984412","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The performance of traditional speech enhancement methods based on deep neural network is limited by using single training objective and network structure. In this paper, we propose a speech enhancement method based on multi-objective ensemble learning. First, the traditional multi-objective learning network structure is modified to reduce the training conflict caused by excess shared parameters. Then, a multi-objective ensemble learning based speech enhancement method is established by employing the modified multi-objective deep neural network (DNN), convolutional neural network (CNN) and gate recurrent unit (GRU), which overcomes the limitation of homogeneity in base models in the traditional ensemble learning based speech enhancement network. The experimental results show that the proposed methods outperforms the traditional multi-objective learning or ensemble learning based speech enhancement methods at the scores of perceptual evaluation of speech quality (PESQ) and short-time objective intelligibility (STOI).