{"title":"基于模型参数分析的深度神经网络后门攻击检测","authors":"Mingyuan Ma, Hu Li, Xiaohui Kuang","doi":"10.1109/ICTAI56018.2022.00098","DOIUrl":null,"url":null,"abstract":"With the introduction of the backdoor in deep neural networks (DNNs), much research focuses on backdoor attacks and defenses against DNNs. Since many DNN models are developed based on public datasets and pre-trained models often published by untrusted third parties, backdoors can be easily injected. The defender usually cannot access training data and does not know the target class or the triggers of the backdoor injected by the attacker. All these make it challenging to guarantee the security of decision guidance and support systems. In this paper, we proposed to detect backdoor attacks on DNNs based on model parameters analysis (MPA). We extracted and selected parameters related to the backdoor in the model's hidden layer and decision layer and trained the MPA classifier based on these parameters. We evaluated the effectiveness of the MPA classifier on various target models. The results show that the area under the receiver operating characteristic curve of the MPA classifier reaches 0.96 and 0.86 on the CIFAR10 and Troj target models, respectively. The MPA classifier improved the detection rate of backdoor attacks by 2%-6% compared with other advanced methods, with less prior knowledge and more relaxed constraints.","PeriodicalId":354314,"journal":{"name":"2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detecting Backdoor Attacks on Deep Neural Networks Based on Model Parameters Analysis\",\"authors\":\"Mingyuan Ma, Hu Li, Xiaohui Kuang\",\"doi\":\"10.1109/ICTAI56018.2022.00098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the introduction of the backdoor in deep neural networks (DNNs), much research focuses on backdoor attacks and defenses against DNNs. Since many DNN models are developed based on public datasets and pre-trained models often published by untrusted third parties, backdoors can be easily injected. The defender usually cannot access training data and does not know the target class or the triggers of the backdoor injected by the attacker. All these make it challenging to guarantee the security of decision guidance and support systems. In this paper, we proposed to detect backdoor attacks on DNNs based on model parameters analysis (MPA). We extracted and selected parameters related to the backdoor in the model's hidden layer and decision layer and trained the MPA classifier based on these parameters. We evaluated the effectiveness of the MPA classifier on various target models. The results show that the area under the receiver operating characteristic curve of the MPA classifier reaches 0.96 and 0.86 on the CIFAR10 and Troj target models, respectively. The MPA classifier improved the detection rate of backdoor attacks by 2%-6% compared with other advanced methods, with less prior knowledge and more relaxed constraints.\",\"PeriodicalId\":354314,\"journal\":{\"name\":\"2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTAI56018.2022.00098\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI56018.2022.00098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Detecting Backdoor Attacks on Deep Neural Networks Based on Model Parameters Analysis
With the introduction of the backdoor in deep neural networks (DNNs), much research focuses on backdoor attacks and defenses against DNNs. Since many DNN models are developed based on public datasets and pre-trained models often published by untrusted third parties, backdoors can be easily injected. The defender usually cannot access training data and does not know the target class or the triggers of the backdoor injected by the attacker. All these make it challenging to guarantee the security of decision guidance and support systems. In this paper, we proposed to detect backdoor attacks on DNNs based on model parameters analysis (MPA). We extracted and selected parameters related to the backdoor in the model's hidden layer and decision layer and trained the MPA classifier based on these parameters. We evaluated the effectiveness of the MPA classifier on various target models. The results show that the area under the receiver operating characteristic curve of the MPA classifier reaches 0.96 and 0.86 on the CIFAR10 and Troj target models, respectively. The MPA classifier improved the detection rate of backdoor attacks by 2%-6% compared with other advanced methods, with less prior knowledge and more relaxed constraints.