Role of noise elimination algorithms in speech processing applications: A comprehensive research and some experimental results

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-05-29 DOI:10.1016/j.engappai.2025.111116

Nagaraja B.G. , Thimmaraja Yadava G. , Raghudathesh G.P. , Jayanna H.S.

{"title":"Role of noise elimination algorithms in speech processing applications: A comprehensive research and some experimental results","authors":"Nagaraja B.G. , Thimmaraja Yadava G. , Raghudathesh G.P. , Jayanna H.S.","doi":"10.1016/j.engappai.2025.111116","DOIUrl":null,"url":null,"abstract":"<div><div>The performance of speech-based systems is severely degraded due to the presence of background noise in real-world environments. Effective noise elimination algorithms are essential for enhancing speech quality and improving the performance of speech processing applications, such as voice activity detection (VAD) and speech encoding. Various speech enhancement techniques have been proposed to tackle this, and in this context, choosing an appropriate enhancement technique for improving speech quality and intelligibility is an important consideration. This paper presents a concise experimental review of different noise elimination techniques using objective and subjective metrics. The experiments are conducted on the noisy speech corpus (NOIZEUS) across different noise types and signal-to-noise ratio (SNR) levels. Comparative results indicate that the soft mask estimator with a <em>priori</em> SNR uncertainty (SMPR) is considerably more useful in enhancing speech quality. Furthermore, we analyze the SMPR performance in enhancing speech quality under various noise conditions, specifically focusing on their impact on speech encoding and VAD applications. Our results reveal that integrating the SMPR enhancement module into linear predictive coding (LPC)-based speech encoding system significantly improves speech quality. Additionally, the application of SMPR in VAD systems demonstrates notable improvements, enhancing the accuracy of speech detection.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"156 ","pages":"Article 111116"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625011170","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

The performance of speech-based systems is severely degraded due to the presence of background noise in real-world environments. Effective noise elimination algorithms are essential for enhancing speech quality and improving the performance of speech processing applications, such as voice activity detection (VAD) and speech encoding. Various speech enhancement techniques have been proposed to tackle this, and in this context, choosing an appropriate enhancement technique for improving speech quality and intelligibility is an important consideration. This paper presents a concise experimental review of different noise elimination techniques using objective and subjective metrics. The experiments are conducted on the noisy speech corpus (NOIZEUS) across different noise types and signal-to-noise ratio (SNR) levels. Comparative results indicate that the soft mask estimator with a priori SNR uncertainty (SMPR) is considerably more useful in enhancing speech quality. Furthermore, we analyze the SMPR performance in enhancing speech quality under various noise conditions, specifically focusing on their impact on speech encoding and VAD applications. Our results reveal that integrating the SMPR enhancement module into linear predictive coding (LPC)-based speech encoding system significantly improves speech quality. Additionally, the application of SMPR in VAD systems demonstrates notable improvements, enhancing the accuracy of speech detection.

查看原文本刊更多论文

噪声消除算法在语音处理应用中的作用：综合研究和一些实验结果

在现实环境中，由于背景噪声的存在，语音系统的性能严重下降。有效的噪声消除算法对于提高语音质量和改善语音处理应用（如语音活动检测（VAD）和语音编码）的性能至关重要。为了解决这个问题，人们提出了各种语音增强技术，在这种情况下，选择合适的增强技术来提高语音质量和可理解性是一个重要的考虑因素。本文简要介绍了使用客观和主观度量的不同噪声消除技术的实验综述。在不同噪声类型和信噪比（SNR）水平的噪声语音语料库（NOIZEUS）上进行了实验。对比结果表明，具有先验信噪比不确定性的软掩码估计器在提高语音质量方面更为有效。此外，我们分析了SMPR在各种噪声条件下提高语音质量的性能，特别关注了它们对语音编码和VAD应用的影响。结果表明，将SMPR增强模块集成到基于线性预测编码（LPC）的语音编码系统中，可以显著提高语音质量。此外，SMPR在VAD系统中的应用也得到了显著改善，提高了语音检测的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.