{"title":"基于威布尔先验的迭代后验NMF单通道语音增强","authors":"S. Vanambathina, Vaishnavi Anumola, Ponnapalli Tejasree, Nandeesh Kumar, Rama Prakash Reddy Ch","doi":"10.1109/AISP53593.2022.9760648","DOIUrl":null,"url":null,"abstract":"This paper proposes a speech enhancement method for non-stationary Gaussian noise based on regularized non-negative matrix factorization (NMF). The magnitudes of speech and noise are implemented by a model based in iterative posterior NMF which are applied using prior distributions in transform domain. This is used since the sample distributions of the above are well suited to Weibull and Rayleigh densities well. For the accomplishment in time-varying noise environments, both the speech and noise bases of NMF are adapted simultaneously. With the usage of estimated speech presence probability, this paper proposes to adaptively estimate the statistics of these distributions. The method in this paper gives the best results for perceptual evaluation of speech quality (PESQ) and the signal-to-distortion ratio (SDR).","PeriodicalId":6793,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP)","volume":"17 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Weibull Prior based Single Channel Speech Enhancement using Iterative Posterior NMF\",\"authors\":\"S. Vanambathina, Vaishnavi Anumola, Ponnapalli Tejasree, Nandeesh Kumar, Rama Prakash Reddy Ch\",\"doi\":\"10.1109/AISP53593.2022.9760648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a speech enhancement method for non-stationary Gaussian noise based on regularized non-negative matrix factorization (NMF). The magnitudes of speech and noise are implemented by a model based in iterative posterior NMF which are applied using prior distributions in transform domain. This is used since the sample distributions of the above are well suited to Weibull and Rayleigh densities well. For the accomplishment in time-varying noise environments, both the speech and noise bases of NMF are adapted simultaneously. With the usage of estimated speech presence probability, this paper proposes to adaptively estimate the statistics of these distributions. The method in this paper gives the best results for perceptual evaluation of speech quality (PESQ) and the signal-to-distortion ratio (SDR).\",\"PeriodicalId\":6793,\"journal\":{\"name\":\"2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP)\",\"volume\":\"17 1\",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AISP53593.2022.9760648\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AISP53593.2022.9760648","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Weibull Prior based Single Channel Speech Enhancement using Iterative Posterior NMF
This paper proposes a speech enhancement method for non-stationary Gaussian noise based on regularized non-negative matrix factorization (NMF). The magnitudes of speech and noise are implemented by a model based in iterative posterior NMF which are applied using prior distributions in transform domain. This is used since the sample distributions of the above are well suited to Weibull and Rayleigh densities well. For the accomplishment in time-varying noise environments, both the speech and noise bases of NMF are adapted simultaneously. With the usage of estimated speech presence probability, this paper proposes to adaptively estimate the statistics of these distributions. The method in this paper gives the best results for perceptual evaluation of speech quality (PESQ) and the signal-to-distortion ratio (SDR).