{"title":"Multi-band Spectral Entropy Information for Detection of Replay Attacks","authors":"Yitong Liu, Rohan Kumar Das, Haizhou Li","doi":"10.1109/APSIPAASC47483.2019.9023062","DOIUrl":null,"url":null,"abstract":"Replay attacks have been proven to be a potential threat to practical automatic speaker verification systems. In this work, we explore a novel feature based on spectral entropy for the detection of replay attacks. The spectral entropy is a measure to capture spectral distortions and flatness. It is found that the replay speech carries artifacts in the process of recording and playback. We hypothesize that spectral entropy can be a useful information to capture such artifacts. In this regard, we explore multi-band spectral entropy feature for replay attack detection. The studies are conducted on ASVspoof 2017 Version 2.0 database that deals with replay speech attacks. A baseline system with popular constant-Q cepstral coefficient (CQCC) feature is also developed. Finally, a combined system is proposed with multi-band spectral entropy and CQCC features that outperforms the baseline. The experiments validate the idea of multi-band spectral entropy feature.","PeriodicalId":145222,"journal":{"name":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPAASC47483.2019.9023062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Replay attacks have been proven to be a potential threat to practical automatic speaker verification systems. In this work, we explore a novel feature based on spectral entropy for the detection of replay attacks. The spectral entropy is a measure to capture spectral distortions and flatness. It is found that the replay speech carries artifacts in the process of recording and playback. We hypothesize that spectral entropy can be a useful information to capture such artifacts. In this regard, we explore multi-band spectral entropy feature for replay attack detection. The studies are conducted on ASVspoof 2017 Version 2.0 database that deals with replay speech attacks. A baseline system with popular constant-Q cepstral coefficient (CQCC) feature is also developed. Finally, a combined system is proposed with multi-band spectral entropy and CQCC features that outperforms the baseline. The experiments validate the idea of multi-band spectral entropy feature.
重播攻击已被证明是对实际自动说话人验证系统的潜在威胁。在这项工作中,我们探索了一种基于谱熵的新特征,用于检测重放攻击。光谱熵是捕获光谱失真和平坦度的一种度量。研究发现,重放语音在录音和重放过程中存在伪影。我们假设谱熵可以成为捕获此类伪影的有用信息。在这方面,我们探索了用于重放攻击检测的多波段频谱熵特征。本研究在ASVspoof 2017 Version 2.0数据库上进行,该数据库处理语音重放攻击。开发了一种具有常q倒谱系数(CQCC)特征的基线系统。最后,结合多波段谱熵和CQCC特征,提出了一种优于基线的组合系统。实验验证了多波段谱熵特征的思想。