大型法律虚构：剖析大型语言模型中的法律幻觉

IF 1.8 1区社会学 Q1 LAW

Journal of Legal Analysis Pub Date : 2024-06-26 DOI:10.1093/jla/laae003

Matthew Dahl, Varun Magesh, Mirac Suzgun, Daniel E Ho

{"title":"大型法律虚构：剖析大型语言模型中的法律幻觉","authors":"Matthew Dahl, Varun Magesh, Mirac Suzgun, Daniel E Ho","doi":"10.1093/jla/laae003","DOIUrl":null,"url":null,"abstract":"Do large language models (LLMs) know the law? LLMs are increasingly being used to augment legal practice, education, and research, yet their revolutionary potential is threatened by the presence of “hallucinations”—textual output that is not consistent with legal facts. We present the first systematic evidence of these hallucinations in public-facing LLMs, documenting trends across jurisdictions, courts, time periods, and cases. Using OpenAI’s ChatGPT 4 and other public models, we show that LLMs hallucinate at least 58% of the time, struggle to predict their own hallucinations, and often uncritically accept users’ incorrect legal assumptions. We conclude by cautioning against the rapid and unsupervised integration of popular LLMs into legal tasks, and we develop a typology of legal hallucinations to guide future research in this area.","PeriodicalId":45189,"journal":{"name":"Journal of Legal Analysis","volume":"87 1","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models\",\"authors\":\"Matthew Dahl, Varun Magesh, Mirac Suzgun, Daniel E Ho\",\"doi\":\"10.1093/jla/laae003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Do large language models (LLMs) know the law? LLMs are increasingly being used to augment legal practice, education, and research, yet their revolutionary potential is threatened by the presence of “hallucinations”—textual output that is not consistent with legal facts. We present the first systematic evidence of these hallucinations in public-facing LLMs, documenting trends across jurisdictions, courts, time periods, and cases. Using OpenAI’s ChatGPT 4 and other public models, we show that LLMs hallucinate at least 58% of the time, struggle to predict their own hallucinations, and often uncritically accept users’ incorrect legal assumptions. We conclude by cautioning against the rapid and unsupervised integration of popular LLMs into legal tasks, and we develop a typology of legal hallucinations to guide future research in this area.\",\"PeriodicalId\":45189,\"journal\":{\"name\":\"Journal of Legal Analysis\",\"volume\":\"87 1\",\"pages\":\"\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2024-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Legal Analysis\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1093/jla/laae003\",\"RegionNum\":1,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"LAW\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Legal Analysis","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1093/jla/laae003","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"LAW","Score":null,"Total":0}

引用次数: 0

摘要

大型语言模型（LLMs）了解法律吗？大型语言模型正越来越多地被用于增强法律实践、教育和研究，然而其革命性潜力却受到了 "幻觉"--与法律事实不符的文本输出--的威胁。我们首次系统地展示了面向公众的法律硕士中的这些幻觉，并记录了不同司法管辖区、法院、时间段和案例的趋势。通过使用 OpenAI 的 ChatGPT 4 和其他公共模型，我们发现法律硕士至少有 58% 的时间会产生幻觉，他们难以预测自己的幻觉，并且经常不加批判地接受用户错误的法律假设。最后，我们告诫大家不要将流行的 LLM 快速、无监督地整合到法律任务中，我们还提出了一种法律幻觉类型学，以指导该领域的未来研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

Do large language models (LLMs) know the law? LLMs are increasingly being used to augment legal practice, education, and research, yet their revolutionary potential is threatened by the presence of “hallucinations”—textual output that is not consistent with legal facts. We present the first systematic evidence of these hallucinations in public-facing LLMs, documenting trends across jurisdictions, courts, time periods, and cases. Using OpenAI’s ChatGPT 4 and other public models, we show that LLMs hallucinate at least 58% of the time, struggle to predict their own hallucinations, and often uncritically accept users’ incorrect legal assumptions. We conclude by cautioning against the rapid and unsupervised integration of popular LLMs into legal tasks, and we develop a typology of legal hallucinations to guide future research in this area.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Legal Analysis LAW-

CiteScore

4.10

自引率

0.00%

发文量

审稿时长

16 weeks