基于模型参数分析的深度神经网络后门攻击检测

2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI) Pub Date : 2022-10-01 DOI:10.1109/ICTAI56018.2022.00098

Mingyuan Ma, Hu Li, Xiaohui Kuang

{"title":"基于模型参数分析的深度神经网络后门攻击检测","authors":"Mingyuan Ma, Hu Li, Xiaohui Kuang","doi":"10.1109/ICTAI56018.2022.00098","DOIUrl":null,"url":null,"abstract":"With the introduction of the backdoor in deep neural networks (DNNs), much research focuses on backdoor attacks and defenses against DNNs. Since many DNN models are developed based on public datasets and pre-trained models often published by untrusted third parties, backdoors can be easily injected. The defender usually cannot access training data and does not know the target class or the triggers of the backdoor injected by the attacker. All these make it challenging to guarantee the security of decision guidance and support systems. In this paper, we proposed to detect backdoor attacks on DNNs based on model parameters analysis (MPA). We extracted and selected parameters related to the backdoor in the model's hidden layer and decision layer and trained the MPA classifier based on these parameters. We evaluated the effectiveness of the MPA classifier on various target models. The results show that the area under the receiver operating characteristic curve of the MPA classifier reaches 0.96 and 0.86 on the CIFAR10 and Troj target models, respectively. The MPA classifier improved the detection rate of backdoor attacks by 2%-6% compared with other advanced methods, with less prior knowledge and more relaxed constraints.","PeriodicalId":354314,"journal":{"name":"2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detecting Backdoor Attacks on Deep Neural Networks Based on Model Parameters Analysis\",\"authors\":\"Mingyuan Ma, Hu Li, Xiaohui Kuang\",\"doi\":\"10.1109/ICTAI56018.2022.00098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the introduction of the backdoor in deep neural networks (DNNs), much research focuses on backdoor attacks and defenses against DNNs. Since many DNN models are developed based on public datasets and pre-trained models often published by untrusted third parties, backdoors can be easily injected. The defender usually cannot access training data and does not know the target class or the triggers of the backdoor injected by the attacker. All these make it challenging to guarantee the security of decision guidance and support systems. In this paper, we proposed to detect backdoor attacks on DNNs based on model parameters analysis (MPA). We extracted and selected parameters related to the backdoor in the model's hidden layer and decision layer and trained the MPA classifier based on these parameters. We evaluated the effectiveness of the MPA classifier on various target models. The results show that the area under the receiver operating characteristic curve of the MPA classifier reaches 0.96 and 0.86 on the CIFAR10 and Troj target models, respectively. The MPA classifier improved the detection rate of backdoor attacks by 2%-6% compared with other advanced methods, with less prior knowledge and more relaxed constraints.\",\"PeriodicalId\":354314,\"journal\":{\"name\":\"2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTAI56018.2022.00098\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI56018.2022.00098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着后门在深度神经网络(dnn)中的引入，对后门攻击和防御的研究越来越多。由于许多DNN模型是基于公共数据集和预训练模型开发的，通常由不受信任的第三方发布，因此很容易注入后门。防御者通常无法访问训练数据，也不知道攻击者注入的目标类或后门的触发器。这些都给决策指导和支持系统的安全性保障带来了挑战。本文提出了基于模型参数分析(MPA)的深度神经网络后门攻击检测方法。我们在模型的隐藏层和决策层中提取和选择与后门相关的参数，并基于这些参数训练MPA分类器。我们评估了MPA分类器在不同目标模型上的有效性。结果表明，在CIFAR10和Troj目标模型上，MPA分类器的受机工作特性曲线下面积分别达到0.96和0.86。与其他先进的方法相比，MPA分类器的后门攻击检出率提高了2%-6%，先验知识更少，约束更宽松。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Detecting Backdoor Attacks on Deep Neural Networks Based on Model Parameters Analysis

With the introduction of the backdoor in deep neural networks (DNNs), much research focuses on backdoor attacks and defenses against DNNs. Since many DNN models are developed based on public datasets and pre-trained models often published by untrusted third parties, backdoors can be easily injected. The defender usually cannot access training data and does not know the target class or the triggers of the backdoor injected by the attacker. All these make it challenging to guarantee the security of decision guidance and support systems. In this paper, we proposed to detect backdoor attacks on DNNs based on model parameters analysis (MPA). We extracted and selected parameters related to the backdoor in the model's hidden layer and decision layer and trained the MPA classifier based on these parameters. We evaluated the effectiveness of the MPA classifier on various target models. The results show that the area under the receiver operating characteristic curve of the MPA classifier reaches 0.96 and 0.86 on the CIFAR10 and Troj target models, respectively. The MPA classifier improved the detection rate of backdoor attacks by 2%-6% compared with other advanced methods, with less prior knowledge and more relaxed constraints.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)

自引率

0.00%

发文量