Attacking Deep Learning AI Hardware with Universal Adversarial Perturbation

IF 2.4 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information (Switzerland) Pub Date : 2023-09-19 DOI:10.3390/info14090516

Mehdi Sadi, B. M. S. Bahar Talukder, Kaniz Mishty, Tauhidur Rahman

引用次数: 0

Abstract

Universal adversarial perturbations are image-agnostic and model-independent noise that, when added to any image, can mislead the trained deep convolutional neural networks into the wrong prediction. Since these universal adversarial perturbations can seriously jeopardize the security and integrity of practical deep learning applications, the existing techniques use additional neural networks to detect the existence of these noises at the input image source. In this paper, we demonstrate an attack strategy that, when activated by rogue means (e.g., malware, trojan), can bypass these existing countermeasures by augmenting the adversarial noise at the AI hardware accelerator stage. We demonstrate the accelerator-level universal adversarial noise attack on several deep learning models using co-simulation of the software kernel of the Conv2D function and the Verilog RTL model of the hardware under the FuseSoC environment.

查看原文本刊更多论文

用通用对抗性扰动攻击深度学习AI硬件

通用对抗性扰动是图像不可知和模型无关的噪声，当添加到任何图像中时，可能会误导训练有素的深度卷积神经网络进行错误的预测。由于这些普遍的对抗性扰动会严重危及实际深度学习应用的安全性和完整性，现有的技术使用额外的神经网络来检测输入图像源处这些噪声的存在。在本文中，我们展示了一种攻击策略，当被流氓手段(例如，恶意软件，特洛伊木马)激活时，可以通过增加人工智能硬件加速器阶段的对抗性噪声来绕过这些现有的对策。在FuseSoC环境下，利用Conv2D函数的软件内核和硬件的Verilog RTL模型的联合仿真，我们演示了加速器级通用对抗性噪声攻击在几个深度学习模型上的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊