Study of approaches to the synthesis and detection of cloned voices (DeepFake)

Digital Technology Security Pub Date : 2022-09-29 DOI:10.17212/2782-2230-2022-3-62-80

Andrey Ivanov, Stepan A. Primak, Viktor A. Mazurenko

{"title":"Study of approaches to the synthesis and detection of cloned voices (DeepFake)","authors":"Andrey Ivanov, Stepan A. Primak, Viktor A. Mazurenko","doi":"10.17212/2782-2230-2022-3-62-80","DOIUrl":null,"url":null,"abstract":"Modern methods of protecting personal information often uses the voice biometric data of the owner of the information to identify the user. When the owner of the information voices the passphrase, he confirms his identity. However, attackers take advantage of the imperfection of such systems and develop methods for voice cloning, to create a twinkly voice for a cyberattack on personal data protection systems. Within the framework of this article, an attempt is made to explore existing methods for detecting cloned voices in order to protect information and counteract cyberattacks. Also, to achieve results, detection systems will be tested on a sample of Russian-language voice recordings taken from open sources. A comparative assessment of existing approaches is carried out in terms of their practical applicability. In particular, the requirements for the occupied memory of a computing device, computational complexity, complexity in implementation and data collection for training were taken into account. In addition, an analysis of the existing prerequisites and trends for the use of voice synthesis and substitution systems was carried out, potential risks were described, and examples of possible damage from the theft of biometric data were given. An attempt was also made to describe the experimental procedure for evaluating the performance of the considered methods with specifying and clarifying conditions. The criteria for verification and validation of the results are set, which allow drawing conclusions about the efficiency of the systems.","PeriodicalId":207311,"journal":{"name":"Digital Technology Security","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Technology Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17212/2782-2230-2022-3-62-80","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Modern methods of protecting personal information often uses the voice biometric data of the owner of the information to identify the user. When the owner of the information voices the passphrase, he confirms his identity. However, attackers take advantage of the imperfection of such systems and develop methods for voice cloning, to create a twinkly voice for a cyberattack on personal data protection systems. Within the framework of this article, an attempt is made to explore existing methods for detecting cloned voices in order to protect information and counteract cyberattacks. Also, to achieve results, detection systems will be tested on a sample of Russian-language voice recordings taken from open sources. A comparative assessment of existing approaches is carried out in terms of their practical applicability. In particular, the requirements for the occupied memory of a computing device, computational complexity, complexity in implementation and data collection for training were taken into account. In addition, an analysis of the existing prerequisites and trends for the use of voice synthesis and substitution systems was carried out, potential risks were described, and examples of possible damage from the theft of biometric data were given. An attempt was also made to describe the experimental procedure for evaluating the performance of the considered methods with specifying and clarifying conditions. The criteria for verification and validation of the results are set, which allow drawing conclusions about the efficiency of the systems.

查看原文本刊更多论文

克隆语音合成与检测方法研究(DeepFake)

现代保护个人信息的方法通常使用信息所有者的语音生物识别数据来识别用户。当信息的所有者发出密码短语时，他就确认了自己的身份。然而，攻击者利用这些系统的不完善，开发了语音克隆的方法，为个人数据保护系统的网络攻击创造了一个闪烁的声音。在本文的框架内，试图探索现有的检测克隆声音的方法，以保护信息和抵御网络攻击。此外，为了取得成果，检测系统将在公开来源的俄语录音样本上进行测试。对现有方法的实际适用性进行了比较评估。特别是考虑到对计算设备占用内存的要求、计算复杂性、执行的复杂性和用于训练的数据收集。此外，还分析了使用语音合成和替代系统的现有先决条件和趋势，描述了潜在风险，并给出了盗窃生物特征数据可能造成的损害的示例。还试图描述实验程序，以确定和澄清条件来评估所考虑的方法的性能。确定了结果验证和确认的标准，从而可以得出有关系统效率的结论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Digital Technology Security

自引率

0.00%

发文量