{"title":"基于希尔伯特变换的感知语音增强","authors":"N. Derakhshan, M. Savoji","doi":"10.1109/ISIE.2006.295543","DOIUrl":null,"url":null,"abstract":"A new speech enhancement algorithm using a Hubert transform (HT) based time-frequency (TF) representation of speech signal with respect to human perception is proposed. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing critical bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement the envelope in each CB is modified, based on the conventional spectral subtraction method, using a time varying gain function which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects of all bands using a perception model. Signal is reconstructed from the modified envelopes and the original phases of noisy signal in critical bands. Experimental results show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility","PeriodicalId":296467,"journal":{"name":"2006 IEEE International Symposium on Industrial Electronics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Perceptual Speech Enhancement Using Hilbert Transform\",\"authors\":\"N. Derakhshan, M. Savoji\",\"doi\":\"10.1109/ISIE.2006.295543\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A new speech enhancement algorithm using a Hubert transform (HT) based time-frequency (TF) representation of speech signal with respect to human perception is proposed. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing critical bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement the envelope in each CB is modified, based on the conventional spectral subtraction method, using a time varying gain function which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects of all bands using a perception model. Signal is reconstructed from the modified envelopes and the original phases of noisy signal in critical bands. Experimental results show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility\",\"PeriodicalId\":296467,\"journal\":{\"name\":\"2006 IEEE International Symposium on Industrial Electronics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE International Symposium on Industrial Electronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISIE.2006.295543\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE International Symposium on Industrial Electronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISIE.2006.295543","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Perceptual Speech Enhancement Using Hilbert Transform
A new speech enhancement algorithm using a Hubert transform (HT) based time-frequency (TF) representation of speech signal with respect to human perception is proposed. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing critical bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement the envelope in each CB is modified, based on the conventional spectral subtraction method, using a time varying gain function which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects of all bands using a perception model. Signal is reconstructed from the modified envelopes and the original phases of noisy signal in critical bands. Experimental results show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility