Adversarial Perturbation Attacks on ML-based CAD

ACM Transactions on Design Automation of Electronic Systems (TODAES) Pub Date : 2020-08-21 DOI:10.1145/3408288

Kang Liu, Haoyu Yang, Yuzhe Ma, Benjamin Tan, Bei Yu, Evangeline F. Y. Young, R. Karri, S. Garg

{"title":"Adversarial Perturbation Attacks on ML-based CAD","authors":"Kang Liu, Haoyu Yang, Yuzhe Ma, Benjamin Tan, Bei Yu, Evangeline F. Y. Young, R. Karri, S. Garg","doi":"10.1145/3408288","DOIUrl":null,"url":null,"abstract":"There is substantial interest in the use of machine learning (ML)-based techniques throughout the electronic computer-aided design (CAD) flow, particularly those based on deep learning. However, while deep learning methods have surpassed state-of-the-art performance in several applications, they have exhibited intrinsic susceptibility to adversarial perturbations—small but deliberate alterations to the input of a neural network, precipitating incorrect predictions. In this article, we seek to investigate whether adversarial perturbations pose risks to ML-based CAD tools, and if so, how these risks can be mitigated. To this end, we use a motivating case study of lithographic hotspot detection, for which convolutional neural networks (CNN) have shown great promise. In this context, we show the first adversarial perturbation attacks on state-of-the-art CNN-based hotspot detectors; specifically, we show that small (on average 0.5% modified area), functionality preserving, and design-constraint-satisfying changes to a layout can nonetheless trick a CNN-based hotspot detector into predicting the modified layout as hotspot free (with up to 99.7% success in finding perturbations that flip a detector’s output prediction, based on a given set of attack constraints). We propose an adversarial retraining strategy to improve the robustness of CNN-based hotspot detection and show that this strategy significantly improves robustness (by a factor of ~3) against adversarial attacks without compromising classification accuracy.","PeriodicalId":6933,"journal":{"name":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","volume":"41 1","pages":"1 - 31"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Design Automation of Electronic Systems (TODAES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3408288","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

Abstract

There is substantial interest in the use of machine learning (ML)-based techniques throughout the electronic computer-aided design (CAD) flow, particularly those based on deep learning. However, while deep learning methods have surpassed state-of-the-art performance in several applications, they have exhibited intrinsic susceptibility to adversarial perturbations—small but deliberate alterations to the input of a neural network, precipitating incorrect predictions. In this article, we seek to investigate whether adversarial perturbations pose risks to ML-based CAD tools, and if so, how these risks can be mitigated. To this end, we use a motivating case study of lithographic hotspot detection, for which convolutional neural networks (CNN) have shown great promise. In this context, we show the first adversarial perturbation attacks on state-of-the-art CNN-based hotspot detectors; specifically, we show that small (on average 0.5% modified area), functionality preserving, and design-constraint-satisfying changes to a layout can nonetheless trick a CNN-based hotspot detector into predicting the modified layout as hotspot free (with up to 99.7% success in finding perturbations that flip a detector’s output prediction, based on a given set of attack constraints). We propose an adversarial retraining strategy to improve the robustness of CNN-based hotspot detection and show that this strategy significantly improves robustness (by a factor of ~3) against adversarial attacks without compromising classification accuracy.

查看原文本刊更多论文

基于ml的CAD对抗摄动攻击

在整个电子计算机辅助设计(CAD)流程中，特别是基于深度学习的技术，对使用基于机器学习(ML)的技术有很大的兴趣。然而，虽然深度学习方法在一些应用中已经超越了最先进的性能，但它们表现出对对抗性扰动的内在敏感性——对神经网络输入的微小但故意的改变，导致不正确的预测。在本文中，我们试图调查对抗性扰动是否对基于ml的CAD工具构成风险，如果是，如何减轻这些风险。为此，我们使用了光刻热点检测的激励案例研究，其中卷积神经网络(CNN)显示出巨大的前景。在这种情况下，我们展示了对最先进的基于cnn的热点探测器的第一次对抗性摄动攻击;具体来说，我们表明，对布局进行小的(平均0.5%的修改面积)，功能保留和设计约束满足的更改仍然可以欺骗基于cnn的热点检测器预测修改后的布局为无热点(基于给定的攻击约束集，发现推翻检测器输出预测的扰动的成功率高达99.7%)。我们提出了一种对抗性再训练策略来提高基于cnn的热点检测的鲁棒性，并表明该策略在不影响分类精度的情况下显著提高了对对抗性攻击的鲁棒性(提高了约3倍)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Design Automation of Electronic Systems (TODAES)

自引率

0.00%

发文量