BEFM: A balanced and efficient fine-tuning model in class-incremental learning

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2025-03-10 DOI:10.1016/j.knosys.2025.113298

Lize Liu, Jian Ji, Lei Zhao

{"title":"BEFM: A balanced and efficient fine-tuning model in class-incremental learning","authors":"Lize Liu, Jian Ji, Lei Zhao","doi":"10.1016/j.knosys.2025.113298","DOIUrl":null,"url":null,"abstract":"<div><div>The ultimate objective of class-incremental learning (CIL) is to solve the stability–plasticity dilemma by continuously learning new courses without losing what has been learnt from the previous ones. Typical CIL methods tend to favor sample replay and parameter regularization, but recent studies have shown that exploiting and adapting historical models can effectively improve model performance. We present A Balanced and Efficient Fine-tuning Model (BEFM) to improve memory consumption efficiency and prevent catastrophic forgetting in CIL by utilizing historical models. In order to fully utilize various normalization techniques and preserve the model’s high plasticity and stability throughout training, we first create a multi-normalization module to replace the original single batch normalization procedure. On the other hand, we construct a model expansion method with knowledge distillation and offer a logit fine-tuning strategy to increase memory and computational efficiency. The model expansion strategy effectively solves the memory problem caused by feature aggregation by expanding only the deeper models with greater influence when learning a new task, while the knowledge distillation strategy encourages the model to retain the memory of the old task and improves the stability of the model. As a way to address the issue of class imbalance, the logit fine-tuning technique optimizes the standard softmax cross entropy, improving classifier design without adding computational burden. We test our method on the CIFAR10, CIFAR100, and miniImageNet100 datasets under various conditions. According to experimental results, our strategy outperforms current approaches with notable gains in performance. The code is available at <span><span>https://github.com/Lize-Liu/BEFM-main</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"315 ","pages":"Article 113298"},"PeriodicalIF":7.2000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125003454","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The ultimate objective of class-incremental learning (CIL) is to solve the stability–plasticity dilemma by continuously learning new courses without losing what has been learnt from the previous ones. Typical CIL methods tend to favor sample replay and parameter regularization, but recent studies have shown that exploiting and adapting historical models can effectively improve model performance. We present A Balanced and Efficient Fine-tuning Model (BEFM) to improve memory consumption efficiency and prevent catastrophic forgetting in CIL by utilizing historical models. In order to fully utilize various normalization techniques and preserve the model’s high plasticity and stability throughout training, we first create a multi-normalization module to replace the original single batch normalization procedure. On the other hand, we construct a model expansion method with knowledge distillation and offer a logit fine-tuning strategy to increase memory and computational efficiency. The model expansion strategy effectively solves the memory problem caused by feature aggregation by expanding only the deeper models with greater influence when learning a new task, while the knowledge distillation strategy encourages the model to retain the memory of the old task and improves the stability of the model. As a way to address the issue of class imbalance, the logit fine-tuning technique optimizes the standard softmax cross entropy, improving classifier design without adding computational burden. We test our method on the CIFAR10, CIFAR100, and miniImageNet100 datasets under various conditions. According to experimental results, our strategy outperforms current approaches with notable gains in performance. The code is available at https://github.com/Lize-Liu/BEFM-main.

查看原文本刊更多论文

类递增学习（CIL）的最终目标是通过不断学习新课程而不丢失从以前课程中学到的知识，从而解决稳定性和可塑性的两难问题。典型的 CIL 方法倾向于样本重放和参数正则化，但最近的研究表明，利用和调整历史模型可以有效提高模型性能。我们提出了平衡高效微调模型（BEFM），通过利用历史模型来提高 CIL 的内存消耗效率并防止灾难性遗忘。为了充分利用各种归一化技术，并在整个训练过程中保持模型的高可塑性和稳定性，我们首先创建了一个多归一化模块，以取代原来的单批次归一化程序。另一方面，我们构建了一种具有知识蒸馏功能的模型扩展方法，并提供了一种 logit 微调策略，以提高内存和计算效率。模型扩展策略有效解决了特征聚合带来的记忆问题，在学习新任务时只扩展影响较大的深层模型，而知识蒸馏策略则鼓励模型保留对旧任务的记忆，提高了模型的稳定性。作为解决类不平衡问题的一种方法，logit 微调技术优化了标准 softmax 交叉熵，在不增加计算负担的情况下改进了分类器的设计。我们在不同条件下对 CIFAR10、CIFAR100 和 miniImageNet100 数据集测试了我们的方法。实验结果表明，我们的策略优于现有方法，性能显著提高。代码见 https://github.com/Lize-Liu/BEFM-main。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.