基于局部信噪比估计的神经网络频谱掩模估计

2008 International Symposium on Telecommunications Pub Date : 2008-10-14 DOI:10.1109/ISTEL.2008.4651373

A. Hadjahmadi, M. Homayounpour, S. Ahadi

{"title":"基于局部信噪比估计的神经网络频谱掩模估计","authors":"A. Hadjahmadi, M. Homayounpour, S. Ahadi","doi":"10.1109/ISTEL.2008.4651373","DOIUrl":null,"url":null,"abstract":"In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.","PeriodicalId":133602,"journal":{"name":"2008 International Symposium on Telecommunications","volume":"191 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Neural Network based local SNR estimation for estimating spectral masks\",\"authors\":\"A. Hadjahmadi, M. Homayounpour, S. Ahadi\",\"doi\":\"10.1109/ISTEL.2008.4651373\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.\",\"PeriodicalId\":133602,\"journal\":{\"name\":\"2008 International Symposium on Telecommunications\",\"volume\":\"191 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Symposium on Telecommunications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISTEL.2008.4651373\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Symposium on Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISTEL.2008.4651373","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在这项工作中，我们提出了一种新的掩模估计技术，该技术使用神经网络分类器来确定光谱元素的可靠性。此外，本文还比较了几种不考虑噪声干扰，而是利用语音信号的频谱特征进行分类的方法。在不同噪声条件下，利用高斯混合模型(GMM)对该方法在文本无关说话人识别任务中的性能进行了实验评估。使用TFarsdat语料库获得说话人识别结果。通过添加NOISEX-92数据库中的噪声源来模拟有噪声的语音。实验结果表明，基于神经网络的掩模估计方法对噪声条件下的说话人识别是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Neural Network based local SNR estimation for estimating spectral masks

In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 International Symposium on Telecommunications

自引率

0.00%

发文量