DeepWise Cyber Teen Guardian: Protecting Internet Environment via a Novel Automatic Adversarial Samples Detection System Based on Revised Neural Network

Yi-Hsien Lin
{"title":"DeepWise Cyber Teen Guardian: Protecting Internet Environment via a Novel Automatic Adversarial Samples Detection System Based on Revised Neural Network","authors":"Yi-Hsien Lin","doi":"10.1109/EPCE58798.2023.00022","DOIUrl":null,"url":null,"abstract":"In order to assist teenagers to surf the internet healthily, there are image filters based on image recognition neural networks removing inappropriate content, but image filters are vulnerable to attack by adversarial samples, which are generated by adding well-crafted noise to an image, making the filter consider an inappropriate image as appropriate. In order to defend against adversarial attacks, various adversarial detection algorithms are designed. The state-of-the-art detector enjoys a promising detection performance, but it suffers from high computational overhead. In this work, DeepWise is proposed, which records the mean and covariance for each class of images at each recording layer of the neural network during training. The input image’s mean and covariance at each layer and the network’s classification of the image are also recorded. Then, the Mahalanobis distance is calculated between the distributions of input image and the training images at each layer based on the network’s prediction. A linear regressor takes these distances as input, and determines whether the input image is an adversarial sample or not. DeepWise achieves a computational saving ranging from 39.77% to 87.98% for datasets SVHN, CIFAR- 10, and CIFAR-100 on the image recognition model ResNet-34 and DenseNet-3 while preserving a comparable AUROC to the existing state-of-the-art method.","PeriodicalId":355442,"journal":{"name":"2023 2nd Asia Conference on Electrical, Power and Computer Engineering (EPCE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd Asia Conference on Electrical, Power and Computer Engineering (EPCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EPCE58798.2023.00022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In order to assist teenagers to surf the internet healthily, there are image filters based on image recognition neural networks removing inappropriate content, but image filters are vulnerable to attack by adversarial samples, which are generated by adding well-crafted noise to an image, making the filter consider an inappropriate image as appropriate. In order to defend against adversarial attacks, various adversarial detection algorithms are designed. The state-of-the-art detector enjoys a promising detection performance, but it suffers from high computational overhead. In this work, DeepWise is proposed, which records the mean and covariance for each class of images at each recording layer of the neural network during training. The input image’s mean and covariance at each layer and the network’s classification of the image are also recorded. Then, the Mahalanobis distance is calculated between the distributions of input image and the training images at each layer based on the network’s prediction. A linear regressor takes these distances as input, and determines whether the input image is an adversarial sample or not. DeepWise achieves a computational saving ranging from 39.77% to 87.98% for datasets SVHN, CIFAR- 10, and CIFAR-100 on the image recognition model ResNet-34 and DenseNet-3 while preserving a comparable AUROC to the existing state-of-the-art method.
DeepWise网络青少年守护者:基于修正神经网络的新型自动对抗样本检测系统保护网络环境
为了帮助青少年健康上网,有基于图像识别神经网络的图像过滤器可以去除不合适的内容,但图像过滤器容易受到对抗性样本的攻击,对抗性样本是通过在图像中添加精心制作的噪声产生的,使过滤器将不合适的图像视为合适的。为了防御对抗性攻击,设计了各种对抗性检测算法。最先进的检测器具有良好的检测性能,但其计算开销较大。在这项工作中,提出了DeepWise,它在训练过程中在神经网络的每个记录层记录每一类图像的均值和协方差。记录输入图像在每一层的均值和协方差以及网络对图像的分类。然后,基于网络的预测,计算输入图像与训练图像在每一层的分布之间的马氏距离。线性回归器将这些距离作为输入,并确定输入图像是否是对抗性样本。DeepWise在图像识别模型ResNet-34和DenseNet-3上对SVHN、CIFAR-10和CIFAR-100数据集实现了39.77%至87.98%的计算节省,同时保持了与现有最先进方法相当的AUROC。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信