{"title":"Audio Replay Spoof Attack Detection Using A GMM-RFPNN Model as Back-end Classifier","authors":"Kaikai Qi, Wei Huang, Dan Wang, Honghao Zhang","doi":"10.1109/ICAICE54393.2021.00089","DOIUrl":null,"url":null,"abstract":"Research on automatic speaker verification (ASV) techniques has received academic attention in recent years and has begun to be applied to authentication, but research on the security performance of ASV is just beginning. In this paper, we will focus on speech replay spoofing attack detection in speaker authentication techniques. Voice is a biological behavioral feature with high inter-class variability and susceptibility to environmental and temporal influences. In this paper, classical constant Q cepstral coefficient features (CQCC) and Gaussian super-vectors are used as front-end feature extractors and fuzzy polynomial neural network (FPNN) models with regularization processing are used as back-end classifiers for true and false speech detection. Compared with other traditional machine learning models and deep learning models, this model shows stronger robustness and generalization ability on acoustic environment and time variation, and good detection results can be obtained using a small number of samples for training. Tested on the ASV spoof 2017 version 2.0 database, the detection performance is improved by about 39% compared to the original baseline system.","PeriodicalId":388444,"journal":{"name":"2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICE54393.2021.00089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Research on automatic speaker verification (ASV) techniques has received academic attention in recent years and has begun to be applied to authentication, but research on the security performance of ASV is just beginning. In this paper, we will focus on speech replay spoofing attack detection in speaker authentication techniques. Voice is a biological behavioral feature with high inter-class variability and susceptibility to environmental and temporal influences. In this paper, classical constant Q cepstral coefficient features (CQCC) and Gaussian super-vectors are used as front-end feature extractors and fuzzy polynomial neural network (FPNN) models with regularization processing are used as back-end classifiers for true and false speech detection. Compared with other traditional machine learning models and deep learning models, this model shows stronger robustness and generalization ability on acoustic environment and time variation, and good detection results can be obtained using a small number of samples for training. Tested on the ASV spoof 2017 version 2.0 database, the detection performance is improved by about 39% compared to the original baseline system.