黑盒深度学习模型的漂移检测

IF 2.6 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

IT Professional Pub Date : 2024-05-01 DOI:10.1109/mitp.2023.3338007

Luca Piano, Fabio Garcea, Andrea Cavallone, Ignacio Aparicio Vazquez, Lia Morra, Fabrizio Lamberti

{"title":"黑盒深度学习模型的漂移检测","authors":"Luca Piano, Fabio Garcea, Andrea Cavallone, Ignacio Aparicio Vazquez, Lia Morra, Fabrizio Lamberti","doi":"10.1109/mitp.2023.3338007","DOIUrl":null,"url":null,"abstract":"Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this article, we propose a new approach for the detection of data drift in black-box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The article analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.","PeriodicalId":49045,"journal":{"name":"IT Professional","volume":"1 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Drift Detection for Black-Box Deep Learning Models\",\"authors\":\"Luca Piano, Fabio Garcea, Andrea Cavallone, Ignacio Aparicio Vazquez, Lia Morra, Fabrizio Lamberti\",\"doi\":\"10.1109/mitp.2023.3338007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this article, we propose a new approach for the detection of data drift in black-box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The article analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.\",\"PeriodicalId\":49045,\"journal\":{\"name\":\"IT Professional\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IT Professional\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1109/mitp.2023.3338007\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IT Professional","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/mitp.2023.3338007","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

数据集漂移是机器学习中的一个常见挑战，尤其是对于在非结构化数据（如图像）上训练的模型而言。在本文中，我们提出了一种检测黑盒模型数据漂移的新方法，它基于海灵格距离和特征提取方法。所提出的方法旨在检测数据漂移，而无需知道要监控的模型架构、训练模型的数据集或两者。文章分析了三种不同的用例，以评估所提方法的有效性，其中包括文档分割、分类和手写识别等多种任务。漂移考虑的用例包括对抗性攻击、领域转移和数据集偏差。实验结果表明，我们的漂移检测方法能在各种训练设置下有效识别分布变化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Drift Detection for Black-Box Deep Learning Models

Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this article, we propose a new approach for the detection of data drift in black-box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The article analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IT Professional COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

5.00

自引率

0.00%

发文量

111

审稿时长

>12 weeks

期刊介绍： IT Professional is a technical magazine of the IEEE Computer Society. It publishes peer-reviewed articles, columns and departments written for and by IT practitioners and researchers covering: practical aspects of emerging and leading-edge digital technologies, original ideas and guidance for IT applications, and novel IT solutions for the enterprise. IT Professional’s goal is to inform the broad spectrum of IT executives, IT project managers, IT researchers, and IT application developers from industry, government, and academia.