{"title":"A soft-masking continual pre-training method based on domain knowledge relevance","authors":"Lazhi Zhao , Jianzhou Feng , Huaxiao Qiu , Chenghan Gu , Haonan Qin","doi":"10.1016/j.ipm.2025.104224","DOIUrl":null,"url":null,"abstract":"<div><div>Most existing continual learning (CL) methods primarily focus on reducing catastrophic forgetting. Although some approaches have achieved CF-free learning, they often treat parameter optimization across tasks as a conflicting process. This assumption not only inhibits the learning of new tasks but also limits knowledge transfer between tasks. To address this, we propose a soft-masking continual pre-training method based on domain knowledge relevance (DKR) within the framework of continual domain-adaptive pre-training. Unlike traditional methods, DKR does not fully regard parameter optimization as adversarial. Instead, it dynamically applied soft masks to model parameters based on the degree of relevance between domain knowledge. Moreover, we introduce an efficient importance calculation method based on dual interactions between input and weights, which accurately assesses the importance of parameters. Our experimental results on six public datasets demonstrate that our method significantly outperforms existing methods in terms of mitigating forgetting. Specifically, DKR achieves negative forgetting with improvements of 1.95% in macro-F1 and 1.52% in accuracy over Naive CL(NCL).</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 6","pages":"Article 104224"},"PeriodicalIF":6.9000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325001657","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Most existing continual learning (CL) methods primarily focus on reducing catastrophic forgetting. Although some approaches have achieved CF-free learning, they often treat parameter optimization across tasks as a conflicting process. This assumption not only inhibits the learning of new tasks but also limits knowledge transfer between tasks. To address this, we propose a soft-masking continual pre-training method based on domain knowledge relevance (DKR) within the framework of continual domain-adaptive pre-training. Unlike traditional methods, DKR does not fully regard parameter optimization as adversarial. Instead, it dynamically applied soft masks to model parameters based on the degree of relevance between domain knowledge. Moreover, we introduce an efficient importance calculation method based on dual interactions between input and weights, which accurately assesses the importance of parameters. Our experimental results on six public datasets demonstrate that our method significantly outperforms existing methods in terms of mitigating forgetting. Specifically, DKR achieves negative forgetting with improvements of 1.95% in macro-F1 and 1.52% in accuracy over Naive CL(NCL).
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.