基于局部信噪比估计的神经网络频谱掩模估计

A. Hadjahmadi, M. Homayounpour, S. Ahadi
{"title":"基于局部信噪比估计的神经网络频谱掩模估计","authors":"A. Hadjahmadi, M. Homayounpour, S. Ahadi","doi":"10.1109/ISTEL.2008.4651373","DOIUrl":null,"url":null,"abstract":"In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.","PeriodicalId":133602,"journal":{"name":"2008 International Symposium on Telecommunications","volume":"191 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Neural Network based local SNR estimation for estimating spectral masks\",\"authors\":\"A. Hadjahmadi, M. Homayounpour, S. Ahadi\",\"doi\":\"10.1109/ISTEL.2008.4651373\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.\",\"PeriodicalId\":133602,\"journal\":{\"name\":\"2008 International Symposium on Telecommunications\",\"volume\":\"191 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Symposium on Telecommunications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISTEL.2008.4651373\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Symposium on Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISTEL.2008.4651373","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在这项工作中,我们提出了一种新的掩模估计技术,该技术使用神经网络分类器来确定光谱元素的可靠性。此外,本文还比较了几种不考虑噪声干扰,而是利用语音信号的频谱特征进行分类的方法。在不同噪声条件下,利用高斯混合模型(GMM)对该方法在文本无关说话人识别任务中的性能进行了实验评估。使用TFarsdat语料库获得说话人识别结果。通过添加NOISEX-92数据库中的噪声源来模拟有噪声的语音。实验结果表明,基于神经网络的掩模估计方法对噪声条件下的说话人识别是有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Neural Network based local SNR estimation for estimating spectral masks
In this work, we present a new mask estimation technique that uses a neural network classifier to determine the reliability of spectrographic elements. In addition some different kinds of features used for classification were compared that make no assumptions about the corrupting noise signal, but rather exploit spectrographic characteristics of the speech signal. The performance of the proposed method is experimentally evaluated in text independent speaker recognition task using the Gaussian mixture model (GMM) under various noise conditions. The speaker recognition results were achieved using the TFarsdat corpus. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new neural network based mask estimation method is effective for speaker recognition under noisy conditions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信