Law LLM unlearning via interfere prompt, review output and update parameter: new challenges, method and baseline

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-06-14 DOI:10.1016/j.eswa.2025.128612

Rui Shao, Yiping Tang, Lingyan Yang, Fang Wang

{"title":"Law LLM unlearning via interfere prompt, review output and update parameter: new challenges, method and baseline","authors":"Rui Shao, Yiping Tang, Lingyan Yang, Fang Wang","doi":"10.1016/j.eswa.2025.128612","DOIUrl":null,"url":null,"abstract":"<div><div>Law large language models (Law LLMs) often generate hallucinations, one of the reasons is due to memorizing sensitive, inaccurate, or outdated information, and unlearning such information has important research value, yet the legal domain lacks publicly available datasets and effective methods for LLM unlearning tasks and evaluations. This work innovatively proposes three unlearning tasks in legal field, providing new datasets for unlearning tasks. In addition, proposes a Law LLM unlearning method via loss adjustment with only need forgotten sequence (UNFS), providing a new baseline and unlearning method for the unlearning tasks. Further, after UNFS unlearning, proposing an inference method for Law LLMs that combines interfering input and reviewing output, reinforcing that Law LLMs avoid including erroneous information in the output. Designing a new metric for law LLM unlearning, the legal data memory evaluation method (LawME), LawME automatically judges the output quality of Law LLMs by comparing the content output by Law LLMs with the ground truth. Real-world dataset experiments and analyses validate UNFS’s effectiveness: on the three proposed legal unlearning datasets, UNFS’s accuracy decreases by 16.53 %, perplexity increased by 3.94, and AUC decreased by 16.09 %. On the retained datasets, UNFS’s accuracy only decreased by 0.02 %-0.26 %, and on the generalized task MMLU by only 0.07 %-0.15 %. These results demonstrate that UNFS has excellent unlearning performance and does not harm the performance on other data that do not participate in unlearning. Other experiments and analyses verified the validity of the proposed inference approach, LawME metrics.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"292 ","pages":"Article 128612"},"PeriodicalIF":7.5000,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425022316","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Law large language models (Law LLMs) often generate hallucinations, one of the reasons is due to memorizing sensitive, inaccurate, or outdated information, and unlearning such information has important research value, yet the legal domain lacks publicly available datasets and effective methods for LLM unlearning tasks and evaluations. This work innovatively proposes three unlearning tasks in legal field, providing new datasets for unlearning tasks. In addition, proposes a Law LLM unlearning method via loss adjustment with only need forgotten sequence (UNFS), providing a new baseline and unlearning method for the unlearning tasks. Further, after UNFS unlearning, proposing an inference method for Law LLMs that combines interfering input and reviewing output, reinforcing that Law LLMs avoid including erroneous information in the output. Designing a new metric for law LLM unlearning, the legal data memory evaluation method (LawME), LawME automatically judges the output quality of Law LLMs by comparing the content output by Law LLMs with the ground truth. Real-world dataset experiments and analyses validate UNFS’s effectiveness: on the three proposed legal unlearning datasets, UNFS’s accuracy decreases by 16.53 %, perplexity increased by 3.94, and AUC decreased by 16.09 %. On the retained datasets, UNFS’s accuracy only decreased by 0.02 %-0.26 %, and on the generalized task MMLU by only 0.07 %-0.15 %. These results demonstrate that UNFS has excellent unlearning performance and does not harm the performance on other data that do not participate in unlearning. Other experiments and analyses verified the validity of the proposed inference approach, LawME metrics.

查看原文本刊更多论文

法学硕士通过干扰提示、检查输出和更新参数来学习：新的挑战、方法和基线

法律大语言模型（Law LLM）经常产生幻觉，其中一个原因是由于记忆敏感，不准确或过时的信息，而遗忘这些信息具有重要的研究价值，但法律领域缺乏公开可用的数据集和有效的方法来完成法学硕士的遗忘任务和评估。本文创新性地提出了法律领域的三种遗忘任务，为遗忘任务提供了新的数据集。此外，提出了一种仅需要遗忘序列（UNFS）损失调整的Law LLM遗忘方法，为遗忘任务提供了一种新的基线和遗忘方法。进一步，在UNFS解除学习后，提出了一种结合干扰输入和审查输出的Law llm推理方法，强化了Law llm避免在输出中包含错误信息。为法学硕士学习设计了一种新的度量标准——法律数据记忆评价方法（LawME），该方法通过比较法学硕士输出的内容与实际情况，自动判断法学硕士输出的质量。实际数据集实验和分析验证了UNFS的有效性：在提出的3个法律学习数据集上，UNFS的准确率下降了16.53%，困惑度增加了3.94，AUC下降了16.09%。在保留的数据集上，UNFS的准确率仅下降了0.02% - 0.26%，在广义任务MMLU上仅下降了0.07% - 0.15%。这些结果表明，UNFS具有优异的学习性能，并且不会影响其他未参与学习的数据的性能。其他实验和分析验证了所提出的推理方法LawME度量的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.