Noise robust automatic speaker verification systems: review and analysis

IF 1.7 4区 计算机科学 Q3 TELECOMMUNICATIONS
Sanil Joshi, Mohit Dua
{"title":"Noise robust automatic speaker verification systems: review and analysis","authors":"Sanil Joshi, Mohit Dua","doi":"10.1007/s11235-024-01212-8","DOIUrl":null,"url":null,"abstract":"<p>Like any other biometric systems, Automatic Speaker Verification (ASV) systems are also vulnerable to the spoofing attacks. Hence, it is important to develop the countermeasures in order to handle these attacks. In spoofing mainly two types of attacks are considered, logical access attacks and presentation attacks. In the last few decades, several systems have been proposed by various researchers for handling these kinds of attacks. However, noise handling capability of ASV systems is of major concern, as the presence of noise may make an ASV system to falsely evaluate the original human voice as the spoofed audio. Hence, the main objective of this paper is to review and analyze the various noise robust ASV systems proposed by different researchers in recent years. The paper discusses the various front end and back-end approaches that have been used to develop these systems with putting emphasis on the noise handling techniques. Various kinds of noises such as babble, white, background noises, pop noise, channel noises etc. affect the development of an ASV system. This survey starts with discussion about the various components of ASV system. Then, the paper classifies and discusses various enhanced front end feature extraction techniques like phase based, deep learning based, magnitude-based feature extraction techniques etc., which have been proven to be robust in handling noise. Secondly, the survey highlights the various deep learning and other baseline models that are used in backend, for classification of the audio correctly. Finally, it highlights the challenges and issues that still exist in noise handling and detection, while developing noise robust ASV systems. Therefore, on the basis of the proposed survey it can be interpreted that the noise robustness of ASV system is the challenging issue. Hence the researchers should consider the robustness of ASV against noise along with spoofing attacks.</p>","PeriodicalId":51194,"journal":{"name":"Telecommunication Systems","volume":"12 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Telecommunication Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11235-024-01212-8","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Like any other biometric systems, Automatic Speaker Verification (ASV) systems are also vulnerable to the spoofing attacks. Hence, it is important to develop the countermeasures in order to handle these attacks. In spoofing mainly two types of attacks are considered, logical access attacks and presentation attacks. In the last few decades, several systems have been proposed by various researchers for handling these kinds of attacks. However, noise handling capability of ASV systems is of major concern, as the presence of noise may make an ASV system to falsely evaluate the original human voice as the spoofed audio. Hence, the main objective of this paper is to review and analyze the various noise robust ASV systems proposed by different researchers in recent years. The paper discusses the various front end and back-end approaches that have been used to develop these systems with putting emphasis on the noise handling techniques. Various kinds of noises such as babble, white, background noises, pop noise, channel noises etc. affect the development of an ASV system. This survey starts with discussion about the various components of ASV system. Then, the paper classifies and discusses various enhanced front end feature extraction techniques like phase based, deep learning based, magnitude-based feature extraction techniques etc., which have been proven to be robust in handling noise. Secondly, the survey highlights the various deep learning and other baseline models that are used in backend, for classification of the audio correctly. Finally, it highlights the challenges and issues that still exist in noise handling and detection, while developing noise robust ASV systems. Therefore, on the basis of the proposed survey it can be interpreted that the noise robustness of ASV system is the challenging issue. Hence the researchers should consider the robustness of ASV against noise along with spoofing attacks.

Abstract Image

噪声稳健型自动语音验证系统:回顾与分析
与其他生物识别系统一样,自动语音验证(ASV)系统也容易受到欺骗攻击。因此,开发应对这些攻击的措施非常重要。在欺骗攻击中,主要有两类攻击,即逻辑访问攻击和呈现攻击。在过去的几十年里,不同的研究人员已经提出了几种系统来处理这类攻击。然而,ASV 系统的噪声处理能力是一个主要问题,因为噪声的存在可能会使 ASV 系统错误地将原始人声评估为欺骗音频。因此,本文的主要目的是回顾和分析近年来不同研究人员提出的各种抗噪声 ASV 系统。本文讨论了用于开发这些系统的各种前端和后端方法,并重点讨论了噪声处理技术。各种噪声,如嗡嗡声、白噪声、背景噪声、流行噪声、信道噪声等,都会影响 ASV 系统的开发。本研究首先讨论了 ASV 系统的各个组成部分。然后,本文对各种增强型前端特征提取技术进行了分类和讨论,如基于相位的特征提取技术、基于深度学习的特征提取技术、基于幅度的特征提取技术等,这些技术已被证明在处理噪声方面具有鲁棒性。其次,调查重点介绍了后端使用的各种深度学习和其他基线模型,以便对音频进行正确分类。最后,它强调了在开发噪声稳健型 ASV 系统时,噪声处理和检测方面仍然存在的挑战和问题。因此,根据所提议的调查,可以认为 ASV 系统的噪声鲁棒性是一个具有挑战性的问题。因此,研究人员应考虑 ASV 对噪声和欺骗攻击的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Telecommunication Systems
Telecommunication Systems 工程技术-电信学
CiteScore
5.40
自引率
8.00%
发文量
105
审稿时长
6.0 months
期刊介绍: Telecommunication Systems is a journal covering all aspects of modeling, analysis, design and management of telecommunication systems. The journal publishes high quality articles dealing with the use of analytic and quantitative tools for the modeling, analysis, design and management of telecommunication systems covering: Performance Evaluation of Wide Area and Local Networks; Network Interconnection; Wire, wireless, Adhoc, mobile networks; Impact of New Services (economic and organizational impact); Fiberoptics and photonic switching; DSL, ADSL, cable TV and their impact; Design and Analysis Issues in Metropolitan Area Networks; Networking Protocols; Dynamics and Capacity Expansion of Telecommunication Systems; Multimedia Based Systems, Their Design Configuration and Impact; Configuration of Distributed Systems; Pricing for Networking and Telecommunication Services; Performance Analysis of Local Area Networks; Distributed Group Decision Support Systems; Configuring Telecommunication Systems with Reliability and Availability; Cost Benefit Analysis and Economic Impact of Telecommunication Systems; Standardization and Regulatory Issues; Security, Privacy and Encryption in Telecommunication Systems; Cellular, Mobile and Satellite Based Systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信