Verifying Integrity of Deep Ensemble Models by Lossless Black-box Watermarking with Sensitive Samples

2022 10th International Symposium on Digital Forensics and Security (ISDFS) Pub Date : 2022-05-09 DOI:10.1109/ISDFS55398.2022.9800818

Li-Chiun Lin, Hanzhou Wu

{"title":"Verifying Integrity of Deep Ensemble Models by Lossless Black-box Watermarking with Sensitive Samples","authors":"Li-Chiun Lin, Hanzhou Wu","doi":"10.1109/ISDFS55398.2022.9800818","DOIUrl":null,"url":null,"abstract":"With the widespread use of deep neural networks (DNNs) in many areas, more and more studies focus on protecting DNN models from intellectual property (IP) infringement. Many existing methods apply digital watermarking to protect the DNN models. The majority of them either embed a watermark directly into the internal network structure/parameters or insert a zero-bit watermark by fine-tuning a model to be protected with a set of so-called trigger samples. Though these methods work very well, they were designed for individual DNN models, which cannot be directly applied to deep ensemble models (DEMs) that combine multiple DNN models to make the final decision. It motivates us to propose a novel black-box watermarking method in this paper for DEMs, which can be used for verifying the integrity of DEMs. In the proposed method, a certain number of sensitive samples are carefully selected through mimicking real-world DEM attacks and analyzing the prediction results of the sub-models of the non-attacked DEM and the attacked DEM on the carefully crafted dataset. By analyzing the prediction results of the target DEM on these carefully crafted sensitive samples, we are able to verify the integrity of the target DEM. Different from many previous methods, the proposed method does not modify the original DEM to be protected, which indicates that the proposed method is lossless. Experimental results have shown that the DEM integrity can be reliably verified even if only one sub-model was attacked, which has good potential in practice.","PeriodicalId":114335,"journal":{"name":"2022 10th International Symposium on Digital Forensics and Security (ISDFS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th International Symposium on Digital Forensics and Security (ISDFS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDFS55398.2022.9800818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

With the widespread use of deep neural networks (DNNs) in many areas, more and more studies focus on protecting DNN models from intellectual property (IP) infringement. Many existing methods apply digital watermarking to protect the DNN models. The majority of them either embed a watermark directly into the internal network structure/parameters or insert a zero-bit watermark by fine-tuning a model to be protected with a set of so-called trigger samples. Though these methods work very well, they were designed for individual DNN models, which cannot be directly applied to deep ensemble models (DEMs) that combine multiple DNN models to make the final decision. It motivates us to propose a novel black-box watermarking method in this paper for DEMs, which can be used for verifying the integrity of DEMs. In the proposed method, a certain number of sensitive samples are carefully selected through mimicking real-world DEM attacks and analyzing the prediction results of the sub-models of the non-attacked DEM and the attacked DEM on the carefully crafted dataset. By analyzing the prediction results of the target DEM on these carefully crafted sensitive samples, we are able to verify the integrity of the target DEM. Different from many previous methods, the proposed method does not modify the original DEM to be protected, which indicates that the proposed method is lossless. Experimental results have shown that the DEM integrity can be reliably verified even if only one sub-model was attacked, which has good potential in practice.

查看原文本刊更多论文

利用敏感样本无损黑盒水印验证深度集成模型的完整性

随着深度神经网络(DNN)在许多领域的广泛应用，保护深度神经网络模型免受知识产权(IP)侵犯的研究越来越受到关注。现有的许多方法都采用数字水印来保护深度神经网络模型。它们中的大多数要么直接将水印嵌入到内部网络结构/参数中，要么通过一组所谓的触发样本微调被保护的模型来插入零比特水印。虽然这些方法效果很好，但它们是为单个DNN模型设计的，不能直接应用于深度集成模型(dem)，后者需要组合多个DNN模型来做出最终决定。这促使我们提出了一种新的dem黑盒水印方法，可以用来验证dem的完整性。该方法通过模拟真实的DEM攻击，分析未受攻击DEM和受攻击DEM的子模型在精心制作的数据集上的预测结果，精心选择一定数量的敏感样本。通过分析目标DEM在这些精心制作的敏感样本上的预测结果，我们能够验证目标DEM的完整性。与以往许多方法不同的是，该方法不修改待保护的原始DEM，具有无损性。实验结果表明，即使只有一个子模型受到攻击，也能可靠地验证DEM的完整性，具有良好的应用潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 10th International Symposium on Digital Forensics and Security (ISDFS)

自引率

0.00%

发文量