基于希尔伯特变换的感知语音增强

2006 IEEE International Symposium on Industrial Electronics Pub Date : 2006-07-09 DOI:10.1109/ISIE.2006.295543

N. Derakhshan, M. Savoji

{"title":"基于希尔伯特变换的感知语音增强","authors":"N. Derakhshan, M. Savoji","doi":"10.1109/ISIE.2006.295543","DOIUrl":null,"url":null,"abstract":"A new speech enhancement algorithm using a Hubert transform (HT) based time-frequency (TF) representation of speech signal with respect to human perception is proposed. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing critical bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement the envelope in each CB is modified, based on the conventional spectral subtraction method, using a time varying gain function which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects of all bands using a perception model. Signal is reconstructed from the modified envelopes and the original phases of noisy signal in critical bands. Experimental results show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility","PeriodicalId":296467,"journal":{"name":"2006 IEEE International Symposium on Industrial Electronics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Perceptual Speech Enhancement Using Hilbert Transform\",\"authors\":\"N. Derakhshan, M. Savoji\",\"doi\":\"10.1109/ISIE.2006.295543\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A new speech enhancement algorithm using a Hubert transform (HT) based time-frequency (TF) representation of speech signal with respect to human perception is proposed. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing critical bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement the envelope in each CB is modified, based on the conventional spectral subtraction method, using a time varying gain function which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects of all bands using a perception model. Signal is reconstructed from the modified envelopes and the original phases of noisy signal in critical bands. Experimental results show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility\",\"PeriodicalId\":296467,\"journal\":{\"name\":\"2006 IEEE International Symposium on Industrial Electronics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE International Symposium on Industrial Electronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISIE.2006.295543\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE International Symposium on Industrial Electronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISIE.2006.295543","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

提出了一种基于Hubert变换的语音信号时频增强算法。TF表示是通过对语音信号在听力临界带(CB)的解析分解来实现的，其中使用了解析信号的包络和相位分量。在传统的谱减法的基础上，利用考虑听觉阈值的时变增益函数对每个CB中的包络线进行了改进。该阈值是在使用感知模型的所有波段掩蔽效应的基础上计算的。在关键频带中，利用改进后的包络和噪声信号的原始相位重构信号。实验结果表明，采用包含时间掩蔽的听觉阈值可以有效地消除音乐噪声，而不会显著降低可理解性

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Perceptual Speech Enhancement Using Hilbert Transform

A new speech enhancement algorithm using a Hubert transform (HT) based time-frequency (TF) representation of speech signal with respect to human perception is proposed. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing critical bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement the envelope in each CB is modified, based on the conventional spectral subtraction method, using a time varying gain function which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects of all bands using a perception model. Signal is reconstructed from the modified envelopes and the original phases of noisy signal in critical bands. Experimental results show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2006 IEEE International Symposium on Industrial Electronics

自引率

0.00%

发文量