Enhancing charge prediction through the collaboration of large and small models

IF 3.2 3区 社会学 Q1 LAW
Bin Wei , Yaoyao Yu , Jiawen Zhang , Yiquan Wu
{"title":"Enhancing charge prediction through the collaboration of large and small models","authors":"Bin Wei ,&nbsp;Yaoyao Yu ,&nbsp;Jiawen Zhang ,&nbsp;Yiquan Wu","doi":"10.1016/j.clsr.2025.106168","DOIUrl":null,"url":null,"abstract":"<div><div>Charge prediction is a fundamental task in AI&amp;Law, where the goal is to predict charges based on fact descriptions. Although various methods have been introduced to enhance performance, challenges remain. Specifically, small models (SMs)-based methods such as BERT struggle with hard cases involving low-frequency or confusing charges due to their limited capacity, whereas large language models (LLMs)-based approaches like GPT-4 exhibit difficulties in handling diverse charges owing to insufficient legal knowledge. To overcome these limitations, we propose a hybrid framework that collaborates both large and small models to improve charge prediction performance, based on the idea that combining the strengths of each can overcome their limitations. Initially, SMs provide an initial prediction along with a predicted probability distribution. If the maximum predicted probability falls below a threshold, LLMs step in to reflect and re-predict as needed. Additionally, we construct a confusing charges dictionary and design a two-stage legal inference prompt, which helps LLMs make the secondary prediction for the hard cases. Extensive experiments on two datasets from China and Italy demonstrate the effectiveness of this approach, yielding average F1 improvements of 7.94% and 11.46% respectively. Moreover, a fine-grained analysis demonstrates that our proposed framework is effective in identifying low-frequency and confusing charges.</div></div>","PeriodicalId":51516,"journal":{"name":"Computer Law & Security Review","volume":"58 ","pages":"Article 106168"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Law & Security Review","FirstCategoryId":"90","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212473X25000410","RegionNum":3,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"LAW","Score":null,"Total":0}
引用次数: 0

Abstract

Charge prediction is a fundamental task in AI&Law, where the goal is to predict charges based on fact descriptions. Although various methods have been introduced to enhance performance, challenges remain. Specifically, small models (SMs)-based methods such as BERT struggle with hard cases involving low-frequency or confusing charges due to their limited capacity, whereas large language models (LLMs)-based approaches like GPT-4 exhibit difficulties in handling diverse charges owing to insufficient legal knowledge. To overcome these limitations, we propose a hybrid framework that collaborates both large and small models to improve charge prediction performance, based on the idea that combining the strengths of each can overcome their limitations. Initially, SMs provide an initial prediction along with a predicted probability distribution. If the maximum predicted probability falls below a threshold, LLMs step in to reflect and re-predict as needed. Additionally, we construct a confusing charges dictionary and design a two-stage legal inference prompt, which helps LLMs make the secondary prediction for the hard cases. Extensive experiments on two datasets from China and Italy demonstrate the effectiveness of this approach, yielding average F1 improvements of 7.94% and 11.46% respectively. Moreover, a fine-grained analysis demonstrates that our proposed framework is effective in identifying low-frequency and confusing charges.
通过大型和小型模型的协作增强电荷预测
电荷预测是人工智能法中的一项基本任务,其目标是根据事实描述预测电荷。尽管已经引入了各种方法来提高性能,但挑战仍然存在。具体来说,基于小模型(SMs)的方法,如BERT,由于其有限的容量,难以处理涉及低频或混淆收费的疑难案件,而基于大型语言模型(LLMs)的方法,如GPT-4,由于缺乏法律知识,在处理各种收费方面表现出困难。为了克服这些限制,我们提出了一个混合框架,结合大型和小型模型来提高电荷预测性能,基于结合每个模型的优势可以克服它们的局限性的想法。最初,SMs提供了一个初始预测以及预测的概率分布。如果最大预测概率低于阈值,llm会介入以反映并根据需要重新预测。此外,我们构建了一个混淆收费词典,并设计了一个两阶段的法律推理提示,这有助于法学硕士对困难案例进行二次预测。在中国和意大利的两个数据集上进行的大量实验证明了该方法的有效性,平均F1分别提高了7.94%和11.46%。此外,细粒度分析表明,我们提出的框架是有效的识别低频和混淆收费。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.60
自引率
10.30%
发文量
81
审稿时长
67 days
期刊介绍: CLSR publishes refereed academic and practitioner papers on topics such as Web 2.0, IT security, Identity management, ID cards, RFID, interference with privacy, Internet law, telecoms regulation, online broadcasting, intellectual property, software law, e-commerce, outsourcing, data protection, EU policy, freedom of information, computer security and many other topics. In addition it provides a regular update on European Union developments, national news from more than 20 jurisdictions in both Europe and the Pacific Rim. It is looking for papers within the subject area that display good quality legal analysis and new lines of legal thought or policy development that go beyond mere description of the subject area, however accurate that may be.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信