Adaptive multi-band filter structure-based far-end speech enhancement

P. ActlinJeevaM, Tushar Nagarajan, P. Vijayalakshmi
{"title":"Adaptive multi-band filter structure-based far-end speech enhancement","authors":"P. ActlinJeevaM, Tushar Nagarajan, P. Vijayalakshmi","doi":"10.1049/iet-spr.2019.0226","DOIUrl":null,"url":null,"abstract":"Speech signals degraded by noise tend to lose their quality and intelligibility. Therefore, the goal of speech enhancement algorithms is to restore these attributes of speech. The current work proposes a dynamic filter structure, dynamic over every utterance, that would vary simultaneously based on (i) the class of sound units in a given noisy/degraded signal, to improve intelligibility, and (ii) the noise components present, to improve quality. This filter structure is employed in the temporal-domain filtering-based multi-band speech enhancement algorithm, proposed by Jeeva et al.. The performance of the algorithm is evaluated subjectively and objectively, in terms of quality and intelligibility, and the algorithm is observed to successfully improve both attributes of degraded speech. Since the improvement in intelligibility depends on the effective restoration of sound units in the utterance, this process is language-specific. In this regard, an analysis is performed to study the influence of phone class distribution on the intelligibility improvement achieved for four Indian languages, namely Tamil, Hindi, Telugu, and Malayalam, and Indian English.","PeriodicalId":272888,"journal":{"name":"IET Signal Process.","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Signal Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/iet-spr.2019.0226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Speech signals degraded by noise tend to lose their quality and intelligibility. Therefore, the goal of speech enhancement algorithms is to restore these attributes of speech. The current work proposes a dynamic filter structure, dynamic over every utterance, that would vary simultaneously based on (i) the class of sound units in a given noisy/degraded signal, to improve intelligibility, and (ii) the noise components present, to improve quality. This filter structure is employed in the temporal-domain filtering-based multi-band speech enhancement algorithm, proposed by Jeeva et al.. The performance of the algorithm is evaluated subjectively and objectively, in terms of quality and intelligibility, and the algorithm is observed to successfully improve both attributes of degraded speech. Since the improvement in intelligibility depends on the effective restoration of sound units in the utterance, this process is language-specific. In this regard, an analysis is performed to study the influence of phone class distribution on the intelligibility improvement achieved for four Indian languages, namely Tamil, Hindi, Telugu, and Malayalam, and Indian English.
基于自适应多频带滤波器结构的远端语音增强
由于噪声的影响,语音信号往往会失去其质量和可理解性。因此,语音增强算法的目标就是恢复语音的这些属性。目前的工作提出了一种动态滤波器结构,在每个话语上都是动态的,它将根据(i)给定噪声/退化信号中的声音单元类别同时变化,以提高可理解性,以及(ii)存在的噪声成分,以提高质量。该滤波器结构被用于Jeeva等人提出的基于时域滤波的多频带语音增强算法。从质量和可理解性两个方面对算法的性能进行了主观和客观的评价,并观察到该算法成功地改善了退化语音的两个属性。由于可理解性的提高取决于话语中声音单位的有效恢复,因此这一过程是特定于语言的。在这方面,进行了一项分析,研究电话类别分布对四种印度语言(即泰米尔语、印地语、泰卢固语和马拉雅拉姆语)和印度英语的可理解性改善的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信