Towards few-call model stealing via active self-paced knowledge distillation and diffusion-based image generation

IF 10.7 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence Review Pub Date : 2025-05-26 DOI:10.1007/s10462-025-11184-z

Vlad Hondru, Radu Tudor Ionescu

{"title":"Towards few-call model stealing via active self-paced knowledge distillation and diffusion-based image generation","authors":"Vlad Hondru, Radu Tudor Ionescu","doi":"10.1007/s10462-025-11184-z","DOIUrl":null,"url":null,"abstract":"<div><p>Diffusion models showcase strong capabilities in image synthesis, being used in many computer vision tasks with great success. To this end, we propose to explore a new use case, namely to copy black-box classification models without having access to the original training data, the architecture, and the weights of the model, i.e. the model is only exposed through an inference API. More specifically, we can only observe the (soft or hard) labels for some image samples passed as input to the model. Furthermore, we consider an additional constraint limiting the number of model calls, mostly focusing our research on few-call model stealing. In order to solve the model extraction task given the applied restrictions, we propose the following framework. As training data, we create a synthetic data set (called proxy data set) by leveraging the ability of diffusion models to generate realistic and diverse images. Given a maximum number of allowed API calls, we pass the respective number of samples through the black-box model to collect labels. Finally, we distill the knowledge of the black-box teacher (attacked model) into a student model (copy of the attacked model), harnessing both labeled and unlabeled data generated by the diffusion model. We employ a novel active self-paced learning framework to make the most of the proxy data during distillation. Our empirical results on three data sets confirm the superiority of our framework over four state-of-the-art methods in the few-call model extraction scenario. We release our code for free non-commercial use at https://github.com/vladhondru25/model-stealing.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 8","pages":""},"PeriodicalIF":10.7000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11184-z.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11184-z","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Diffusion models showcase strong capabilities in image synthesis, being used in many computer vision tasks with great success. To this end, we propose to explore a new use case, namely to copy black-box classification models without having access to the original training data, the architecture, and the weights of the model, i.e. the model is only exposed through an inference API. More specifically, we can only observe the (soft or hard) labels for some image samples passed as input to the model. Furthermore, we consider an additional constraint limiting the number of model calls, mostly focusing our research on few-call model stealing. In order to solve the model extraction task given the applied restrictions, we propose the following framework. As training data, we create a synthetic data set (called proxy data set) by leveraging the ability of diffusion models to generate realistic and diverse images. Given a maximum number of allowed API calls, we pass the respective number of samples through the black-box model to collect labels. Finally, we distill the knowledge of the black-box teacher (attacked model) into a student model (copy of the attacked model), harnessing both labeled and unlabeled data generated by the diffusion model. We employ a novel active self-paced learning framework to make the most of the proxy data during distillation. Our empirical results on three data sets confirm the superiority of our framework over four state-of-the-art methods in the few-call model extraction scenario. We release our code for free non-commercial use at https://github.com/vladhondru25/model-stealing.

查看原文本刊更多论文

基于主动自节奏知识蒸馏和扩散的图像生成实现少调用模型窃取

扩散模型在图像合成方面表现出强大的能力，在许多计算机视觉任务中得到了成功的应用。为此，我们建议探索一种新的用例，即在不访问原始训练数据、体系结构和模型权重的情况下复制黑箱分类模型，即仅通过推理API暴露模型。更具体地说，我们只能观察一些作为模型输入的图像样本的（软或硬）标签。此外，我们考虑了一个限制模型调用数量的附加约束，主要关注我们对少调用模型窃取的研究。为了解决给定应用条件下的模型提取任务，我们提出了以下框架。作为训练数据，我们利用扩散模型的能力创建一个合成数据集（称为代理数据集）来生成逼真和多样化的图像。给定允许的API调用的最大数量，我们通过黑箱模型传递相应数量的样本以收集标签。最后，我们将黑盒教师（被攻击模型）的知识提取到学生模型（被攻击模型的副本）中，利用扩散模型生成的标记和未标记数据。我们采用一种新颖的主动自定进度学习框架，在蒸馏过程中充分利用代理数据。我们在三个数据集上的实证结果证实了我们的框架在少调用模型提取场景中优于四种最先进的方法。我们在https://github.com/vladhondru25/model-stealing上免费发布我们的代码，供非商业用途。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial Intelligence Review 工程技术-计算机：人工智能

CiteScore

22.00

自引率

3.30%

发文量

194

审稿时长

5.3 months

期刊介绍： Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.