Defending Low-Bandwidth Talking Head Videoconferencing Systems From Real-Time Puppeteering Attacks

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2023-06-01 DOI:10.1109/CVPRW59228.2023.00105

Danial Samadi Vahdati, T. D. Nguyen, M. Stamm

{"title":"Defending Low-Bandwidth Talking Head Videoconferencing Systems From Real-Time Puppeteering Attacks","authors":"Danial Samadi Vahdati, T. D. Nguyen, M. Stamm","doi":"10.1109/CVPRW59228.2023.00105","DOIUrl":null,"url":null,"abstract":"Talking head videos have gained significant attention in recent years due to advances in AI that allow for the synthesis of realistic videos from only a single image of the speaker. Recently, researchers have proposed low bandwidth talking head video systems for use in applications such as videoconferencing and video calls. However, these systems are vulnerable to puppeteering attacks, where an attacker can control a synthetic version of a different target speaker in real-time. This can be potentially used spread misinformation or committing fraud. Because the receiver always creates a synthetic video of the speaker, deepfake detectors cannot protect against these attacks. As a result, there are currently no defenses against puppeteering in these systems. In this paper, we propose a new defense against puppeteering attacks in low-bandwidth talking head video systems by utilizing the biometric information inherent in the facial expression and pose data transmitted to the receiver. Our proposed system requires no modifications to the video transmission system and operates with low computational cost. We present experimental evidence to demonstrate the effectiveness of our proposed defense and provide a new dataset for benchmarking defenses against puppeteering attacks.","PeriodicalId":355438,"journal":{"name":"2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPRW59228.2023.00105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Talking head videos have gained significant attention in recent years due to advances in AI that allow for the synthesis of realistic videos from only a single image of the speaker. Recently, researchers have proposed low bandwidth talking head video systems for use in applications such as videoconferencing and video calls. However, these systems are vulnerable to puppeteering attacks, where an attacker can control a synthetic version of a different target speaker in real-time. This can be potentially used spread misinformation or committing fraud. Because the receiver always creates a synthetic video of the speaker, deepfake detectors cannot protect against these attacks. As a result, there are currently no defenses against puppeteering in these systems. In this paper, we propose a new defense against puppeteering attacks in low-bandwidth talking head video systems by utilizing the biometric information inherent in the facial expression and pose data transmitted to the receiver. Our proposed system requires no modifications to the video transmission system and operates with low computational cost. We present experimental evidence to demonstrate the effectiveness of our proposed defense and provide a new dataset for benchmarking defenses against puppeteering attacks.

查看原文本刊更多论文

保护低带宽说话头视频会议系统免受实时操纵攻击

近年来，由于人工智能的进步，只需要一张说话者的图像就可以合成逼真的视频，“会说话的头”视频受到了极大的关注。最近，研究人员提出了用于视频会议和视频通话等应用的低带宽说话头视频系统。然而，这些系统很容易受到操纵攻击，攻击者可以实时控制不同目标扬声器的合成版本。这可能被用来传播错误信息或实施欺诈。因为接收器总是生成扬声器的合成视频，深度假探测器无法抵御这些攻击。因此，目前在这些系统中没有针对操纵木偶的防御措施。在本文中，我们提出了一种在低带宽说话头部视频系统中利用面部表情和姿势数据中固有的生物特征信息来防御木偶攻击的新方法。该系统不需要对视频传输系统进行任何修改，且计算成本低。我们提出了实验证据来证明我们提出的防御的有效性，并为针对木偶攻击的基准防御提供了一个新的数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

自引率

0.00%

发文量