Audio Replay Spoof Attack Detection Using A GMM-RFPNN Model as Back-end Classifier

2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE) Pub Date : 2021-11-01 DOI:10.1109/ICAICE54393.2021.00089

Kaikai Qi, Wei Huang, Dan Wang, Honghao Zhang

{"title":"Audio Replay Spoof Attack Detection Using A GMM-RFPNN Model as Back-end Classifier","authors":"Kaikai Qi, Wei Huang, Dan Wang, Honghao Zhang","doi":"10.1109/ICAICE54393.2021.00089","DOIUrl":null,"url":null,"abstract":"Research on automatic speaker verification (ASV) techniques has received academic attention in recent years and has begun to be applied to authentication, but research on the security performance of ASV is just beginning. In this paper, we will focus on speech replay spoofing attack detection in speaker authentication techniques. Voice is a biological behavioral feature with high inter-class variability and susceptibility to environmental and temporal influences. In this paper, classical constant Q cepstral coefficient features (CQCC) and Gaussian super-vectors are used as front-end feature extractors and fuzzy polynomial neural network (FPNN) models with regularization processing are used as back-end classifiers for true and false speech detection. Compared with other traditional machine learning models and deep learning models, this model shows stronger robustness and generalization ability on acoustic environment and time variation, and good detection results can be obtained using a small number of samples for training. Tested on the ASV spoof 2017 version 2.0 database, the detection performance is improved by about 39% compared to the original baseline system.","PeriodicalId":388444,"journal":{"name":"2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICE54393.2021.00089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Research on automatic speaker verification (ASV) techniques has received academic attention in recent years and has begun to be applied to authentication, but research on the security performance of ASV is just beginning. In this paper, we will focus on speech replay spoofing attack detection in speaker authentication techniques. Voice is a biological behavioral feature with high inter-class variability and susceptibility to environmental and temporal influences. In this paper, classical constant Q cepstral coefficient features (CQCC) and Gaussian super-vectors are used as front-end feature extractors and fuzzy polynomial neural network (FPNN) models with regularization processing are used as back-end classifiers for true and false speech detection. Compared with other traditional machine learning models and deep learning models, this model shows stronger robustness and generalization ability on acoustic environment and time variation, and good detection results can be obtained using a small number of samples for training. Tested on the ASV spoof 2017 version 2.0 database, the detection performance is improved by about 39% compared to the original baseline system.

查看原文本刊更多论文

基于GMM-RFPNN模型的音频重放欺骗攻击检测

自动说话人验证(ASV)技术的研究近年来受到学术界的关注，并开始应用于身份验证，但对ASV安全性能的研究才刚刚起步。在本文中，我们将重点研究说话人认证技术中的语音重放欺骗攻击检测。声音是一种生物行为特征，具有高度的阶层间变异性，易受环境和时间影响。本文采用经典的常Q倒谱系数特征(CQCC)和高斯超向量作为前端特征提取器，采用经过正则化处理的模糊多项式神经网络(FPNN)模型作为后端分类器进行语音真假检测。与其他传统机器学习模型和深度学习模型相比，该模型对声环境和时间变化具有更强的鲁棒性和泛化能力，使用少量样本进行训练即可获得较好的检测结果。在ASV spoof 2017 2.0数据库上进行测试，检测性能比原始基准系统提高了约39%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)

自引率

0.00%

发文量