{"title":"基于局部信噪比估计的神经网络频谱掩模估计","authors":"A. Hadjahmadi, M. Homayounpour, S. Ahadi","doi":"10.1109/ISTEL.2008.4651373","DOIUrl":null,"url":null,"abstract":"In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.","PeriodicalId":133602,"journal":{"name":"2008 International Symposium on Telecommunications","volume":"191 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Neural Network based local SNR estimation for estimating spectral masks\",\"authors\":\"A. Hadjahmadi, M. Homayounpour, S. Ahadi\",\"doi\":\"10.1109/ISTEL.2008.4651373\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.\",\"PeriodicalId\":133602,\"journal\":{\"name\":\"2008 International Symposium on Telecommunications\",\"volume\":\"191 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Symposium on Telecommunications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISTEL.2008.4651373\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Symposium on Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISTEL.2008.4651373","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Neural Network based local SNR estimation for estimating spectral masks
In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.