从说话和非说话背景中分离语音的联合系统,以及去混响:在真实录音中的应用

Belhedi Wiem, M. B. Messaoud, A. Bouzid
{"title":"从说话和非说话背景中分离语音的联合系统,以及去混响:在真实录音中的应用","authors":"Belhedi Wiem, M. B. Messaoud, A. Bouzid","doi":"10.1109/ICFSP.2017.8097055","DOIUrl":null,"url":null,"abstract":"In real-life environment, the speech of interest is often correlated with different kinds of perturbation. Perturbation can be caused by speaking or non-speaking noise, or even by reverberation. This could make the speech signal auditable but not intelligible. In this case, speech cannot be exploited by other automated applications such as voice-command or speech/speaker identification and identification. Extracting a meaningful signal of good quality is a bigger challenge in monaural case. In this paper, we propose an extensible full joint system that deals with real-environment perturbations that include speaking and non-speaking background as well as reverberation. After introducing the input signal, a decision is taken on the process to opt for. The main chain blocks of the proposed system are speech denoising, speech separation and speech de-reverberation. The system operates in single channel case in a fully unsupervised manner. Furthermore, it requires minimal information about the reference signal. As the system is targeting real-time enhancement, results evaluation is conduct in terms of non-intrusive metrics in addition to intrusive metrics. The evaluation results prove the effectiveness of the proposed system in cancelling difficult noise types, in extracting desired speaker from speaking background, and in enhancing reverberated speech.","PeriodicalId":382413,"journal":{"name":"2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Joint system for speech separation from speaking and non-speaking background, and de-reverberation: Application on real-world recordings\",\"authors\":\"Belhedi Wiem, M. B. Messaoud, A. Bouzid\",\"doi\":\"10.1109/ICFSP.2017.8097055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In real-life environment, the speech of interest is often correlated with different kinds of perturbation. Perturbation can be caused by speaking or non-speaking noise, or even by reverberation. This could make the speech signal auditable but not intelligible. In this case, speech cannot be exploited by other automated applications such as voice-command or speech/speaker identification and identification. Extracting a meaningful signal of good quality is a bigger challenge in monaural case. In this paper, we propose an extensible full joint system that deals with real-environment perturbations that include speaking and non-speaking background as well as reverberation. After introducing the input signal, a decision is taken on the process to opt for. The main chain blocks of the proposed system are speech denoising, speech separation and speech de-reverberation. The system operates in single channel case in a fully unsupervised manner. Furthermore, it requires minimal information about the reference signal. As the system is targeting real-time enhancement, results evaluation is conduct in terms of non-intrusive metrics in addition to intrusive metrics. The evaluation results prove the effectiveness of the proposed system in cancelling difficult noise types, in extracting desired speaker from speaking background, and in enhancing reverberated speech.\",\"PeriodicalId\":382413,\"journal\":{\"name\":\"2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICFSP.2017.8097055\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFSP.2017.8097055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

在现实生活环境中,感兴趣的言语往往与不同类型的扰动相关。干扰可以由说话或非说话噪声引起,甚至由混响引起。这可能使语音信号可听,但无法理解。在这种情况下,语音不能被其他自动化应用程序(如语音命令或语音/说话人识别和识别)利用。在单音频情况下,提取有意义的高质量信号是一个更大的挑战。在本文中,我们提出了一个可扩展的全联合系统,该系统处理包括说话和非说话背景以及混响的真实环境摄动。在引入输入信号后,对要选择的过程做出决定。该系统的主要链块是语音去噪、语音分离和语音去混响。该系统在单通道情况下完全无监督运行。此外,它对参考信号的信息要求最少。由于系统的目标是增强实时性,因此除了采用侵入性指标外,还采用非侵入性指标进行结果评估。评价结果证明了该系统在消除困难噪声类型、从说话背景中提取需要的说话人以及增强混响语音方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Joint system for speech separation from speaking and non-speaking background, and de-reverberation: Application on real-world recordings
In real-life environment, the speech of interest is often correlated with different kinds of perturbation. Perturbation can be caused by speaking or non-speaking noise, or even by reverberation. This could make the speech signal auditable but not intelligible. In this case, speech cannot be exploited by other automated applications such as voice-command or speech/speaker identification and identification. Extracting a meaningful signal of good quality is a bigger challenge in monaural case. In this paper, we propose an extensible full joint system that deals with real-environment perturbations that include speaking and non-speaking background as well as reverberation. After introducing the input signal, a decision is taken on the process to opt for. The main chain blocks of the proposed system are speech denoising, speech separation and speech de-reverberation. The system operates in single channel case in a fully unsupervised manner. Furthermore, it requires minimal information about the reference signal. As the system is targeting real-time enhancement, results evaluation is conduct in terms of non-intrusive metrics in addition to intrusive metrics. The evaluation results prove the effectiveness of the proposed system in cancelling difficult noise types, in extracting desired speaker from speaking background, and in enhancing reverberated speech.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信