Core unlearning: A multi-modal gradient-efficient architecture for exact and approximate model rewriting

IF 6.9 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management Pub Date : 2025-10-01 DOI:10.1016/j.ipm.2025.104417

Saeed Iqbal , Xiaopin Zhong , Muhammad Attique Khan , Zongze Wu , Nouf Abdullah Almujally , Weixiang Liu , Amir Hussain

{"title":"Core unlearning: A multi-modal gradient-efficient architecture for exact and approximate model rewriting","authors":"Saeed Iqbal , Xiaopin Zhong , Muhammad Attique Khan , Zongze Wu , Nouf Abdullah Almujally , Weixiang Liu , Amir Hussain","doi":"10.1016/j.ipm.2025.104417","DOIUrl":null,"url":null,"abstract":"<div><div>Machine unlearning is important for data security, user confidence, and regulatory compliance in AI systems. Despite the significant achievement, existing techniques have limited generalizability across a broad set of forgetting scenarios — feature, class, task, stream, or catastrophic forgetting, and are devoid of a theoretical base, scalability, or computational efficiency. The proposed Core Unlearning (CU) framework bypasses these limitations by integrating state-of-the-art methods like latent space loss optimization, gradient ascent-augmented updates, Adapter Partition and Aggregation (APA), and Projection-Based Residual Adjustment (PBRA) into a unified structure that supports both Exact Unlearning (EU) and Approximate Unlearning (AU). In EU, Negative Preference Optimization (NPO) is employed, a strategy that treats target data as negative samples to actively suppress their influence during unlearning by penalizing correct predictions on forgotten data. Evaluating across multi-modal datasets like CIFAR-10, CIFAR, 100, IMDB4K, CORA, FEMNIST, and MVTec AD, CU achieves improved performance in forgetting fidelity, model utility, and privacy preservation. The GA+APA+NPO achieves up to 2.3% decreased accuracy loss, with 95.2% retraining equivalence, proving high-fidelity unlearning. In AU mode, our approach gets 92.3% forgetting accuracy, 85.7% utility score, and 90.2% unlearning efficiency, enabling a scalable solution for time-critical applications. With a seamless combination of EU and AU into a single paradigm, CU enables versatile management of the precision-speed trade-off, with support for strong application-specific unlearning. The work in this paper demonstrates an early step toward useful, mathematically robust, and privacy-preserving machine unlearning. Code available at: <span><span>CoreUnlearning</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 2","pages":"Article 104417"},"PeriodicalIF":6.9000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325003589","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Machine unlearning is important for data security, user confidence, and regulatory compliance in AI systems. Despite the significant achievement, existing techniques have limited generalizability across a broad set of forgetting scenarios — feature, class, task, stream, or catastrophic forgetting, and are devoid of a theoretical base, scalability, or computational efficiency. The proposed Core Unlearning (CU) framework bypasses these limitations by integrating state-of-the-art methods like latent space loss optimization, gradient ascent-augmented updates, Adapter Partition and Aggregation (APA), and Projection-Based Residual Adjustment (PBRA) into a unified structure that supports both Exact Unlearning (EU) and Approximate Unlearning (AU). In EU, Negative Preference Optimization (NPO) is employed, a strategy that treats target data as negative samples to actively suppress their influence during unlearning by penalizing correct predictions on forgotten data. Evaluating across multi-modal datasets like CIFAR-10, CIFAR, 100, IMDB4K, CORA, FEMNIST, and MVTec AD, CU achieves improved performance in forgetting fidelity, model utility, and privacy preservation. The GA+APA+NPO achieves up to 2.3% decreased accuracy loss, with 95.2% retraining equivalence, proving high-fidelity unlearning. In AU mode, our approach gets 92.3% forgetting accuracy, 85.7% utility score, and 90.2% unlearning efficiency, enabling a scalable solution for time-critical applications. With a seamless combination of EU and AU into a single paradigm, CU enables versatile management of the precision-speed trade-off, with support for strong application-specific unlearning. The work in this paper demonstrates an early step toward useful, mathematically robust, and privacy-preserving machine unlearning. Code available at: CoreUnlearning.

查看原文本刊更多论文

核心学习：用于精确和近似模型重写的多模态梯度高效架构

机器学习对于人工智能系统中的数据安全、用户信心和法规遵从性非常重要。尽管取得了重大成就，但现有技术在广泛的遗忘场景（特征、类、任务、流或灾难性遗忘）中的推广能力有限，并且缺乏理论基础、可扩展性或计算效率。提出的核心学习（CU）框架通过将潜在空间损失优化、梯度上升增强更新、适配器划分和聚合（APA）以及基于投影的残差调整（PBRA）等最先进的方法集成到一个支持精确学习（EU）和近似学习（AU）的统一结构中，从而绕过了这些限制。在EU中，采用了负偏好优化（NPO）策略，该策略将目标数据视为负样本，通过惩罚对被遗忘数据的正确预测来主动抑制其在遗忘过程中的影响。在CIFAR-10、CIFAR、100、IMDB4K、CORA、FEMNIST和MVTec AD等多模态数据集上进行评估，CU在遗忘保真度、模型效用和隐私保护方面取得了更好的性能。GA+APA+NPO的准确率损失降低了2.3%，再训练等效性达到95.2%，证明了高保真的去学习。在AU模式下，我们的方法获得了92.3%的遗忘准确率、85.7%的效用得分和90.2%的遗忘效率，为时间紧迫的应用提供了可扩展的解决方案。通过将EU和AU无缝地结合到一个范例中，CU可以实现对精度速度权衡的通用管理，并支持强大的特定于应用程序的学习。本文的工作向有用的、数学上健壮的、保护隐私的机器学习迈出了早期的一步。代码可在：CoreUnlearning。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Processing & Management 工程技术-计算机：信息系统

CiteScore

17.00

自引率

11.60%

发文量

276

审稿时长

39 days

期刊介绍： Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.