利用空气和身体传导信号之间的对应关系半监督增强和抑制自产生语音

2020 28th European Signal Processing Conference (EUSIPCO) Pub Date : 2021-01-24 DOI:10.23919/Eusipco47968.2020.9287512

Moe Takada, Shogo Seki, Patrick Lumban Tobing, T. Toda

{"title":"利用空气和身体传导信号之间的对应关系半监督增强和抑制自产生语音","authors":"Moe Takada, Shogo Seki, Patrick Lumban Tobing, T. Toda","doi":"10.23919/Eusipco47968.2020.9287512","DOIUrl":null,"url":null,"abstract":"We propose a semi-supervised method for enhancing and suppressing self-produced speech recorded with wearable air- and body-conductive microphones. Body-conducted signals are robust against external noise and predominantly contain self-produced speech. As a result, these signals provide informative acoustical clues when estimating a linear filter to separate a mixed signal into self-produced speech and background noise. In a previous study, we proposed a blind source separation method for handling air- and body-conducted signals as a multi-channel signal. While our previously proposed method demonstrated the superior performance that can be achieved by using air- and body-conducted signals in comparison to using only air-conducted signals, the enhanced and suppressed air-conducted signals tended to be contaminated with the acoustical characteristics of the body-conducted signals due to the nonlinear relationship between these signals. To address this issue, in this paper, we introduce a new source model which takes into consideration the correspondence between these signals and incorporates them within a semi-supervised framework. Our experimental results reveal that this new method alleviates the negative effects of using the acoustical characteristics of the body-conducted signals, outperforming our previously proposed method, as well as conventional methods, under a semi-supervised condition.","PeriodicalId":6705,"journal":{"name":"2020 28th European Signal Processing Conference (EUSIPCO)","volume":"1 1","pages":"456-460"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Semi-Supervised Enhancement and Suppression of Self-Produced Speech Using Correspondence between Air- and Body-Conducted Signals\",\"authors\":\"Moe Takada, Shogo Seki, Patrick Lumban Tobing, T. Toda\",\"doi\":\"10.23919/Eusipco47968.2020.9287512\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a semi-supervised method for enhancing and suppressing self-produced speech recorded with wearable air- and body-conductive microphones. Body-conducted signals are robust against external noise and predominantly contain self-produced speech. As a result, these signals provide informative acoustical clues when estimating a linear filter to separate a mixed signal into self-produced speech and background noise. In a previous study, we proposed a blind source separation method for handling air- and body-conducted signals as a multi-channel signal. While our previously proposed method demonstrated the superior performance that can be achieved by using air- and body-conducted signals in comparison to using only air-conducted signals, the enhanced and suppressed air-conducted signals tended to be contaminated with the acoustical characteristics of the body-conducted signals due to the nonlinear relationship between these signals. To address this issue, in this paper, we introduce a new source model which takes into consideration the correspondence between these signals and incorporates them within a semi-supervised framework. Our experimental results reveal that this new method alleviates the negative effects of using the acoustical characteristics of the body-conducted signals, outperforming our previously proposed method, as well as conventional methods, under a semi-supervised condition.\",\"PeriodicalId\":6705,\"journal\":{\"name\":\"2020 28th European Signal Processing Conference (EUSIPCO)\",\"volume\":\"1 1\",\"pages\":\"456-460\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 28th European Signal Processing Conference (EUSIPCO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/Eusipco47968.2020.9287512\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 28th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/Eusipco47968.2020.9287512","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

我们提出了一种半监督的方法来增强和抑制可穿戴式空气传声器和身体传声器记录的自生语音。体传导信号对外部噪声具有鲁棒性，并且主要包含自产生的语音。因此，当估计线性滤波器将混合信号分离为自产生的语音和背景噪声时，这些信号提供了信息丰富的声学线索。在之前的研究中，我们提出了一种盲源分离方法，将空气和身体传导的信号作为多通道信号处理。虽然我们之前提出的方法表明，与仅使用空气传导信号相比，使用空气传导信号和身体传导信号可以实现优越的性能，但由于这些信号之间的非线性关系，增强和抑制的空气传导信号往往会受到身体传导信号的声学特性的污染。为了解决这个问题，在本文中，我们引入了一个新的源模型，该模型考虑了这些信号之间的对应关系，并将它们合并到一个半监督框架中。我们的实验结果表明，在半监督条件下，这种新方法减轻了使用身体传导信号的声学特性的负面影响，优于我们之前提出的方法以及传统方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Semi-Supervised Enhancement and Suppression of Self-Produced Speech Using Correspondence between Air- and Body-Conducted Signals

We propose a semi-supervised method for enhancing and suppressing self-produced speech recorded with wearable air- and body-conductive microphones. Body-conducted signals are robust against external noise and predominantly contain self-produced speech. As a result, these signals provide informative acoustical clues when estimating a linear filter to separate a mixed signal into self-produced speech and background noise. In a previous study, we proposed a blind source separation method for handling air- and body-conducted signals as a multi-channel signal. While our previously proposed method demonstrated the superior performance that can be achieved by using air- and body-conducted signals in comparison to using only air-conducted signals, the enhanced and suppressed air-conducted signals tended to be contaminated with the acoustical characteristics of the body-conducted signals due to the nonlinear relationship between these signals. To address this issue, in this paper, we introduce a new source model which takes into consideration the correspondence between these signals and incorporates them within a semi-supervised framework. Our experimental results reveal that this new method alleviates the negative effects of using the acoustical characteristics of the body-conducted signals, outperforming our previously proposed method, as well as conventional methods, under a semi-supervised condition.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 28th European Signal Processing Conference (EUSIPCO)

自引率

0.00%

发文量