微调语言模型以发出不确定性的语言表达

Arslan Chaudhry, Sridhar Thiagarajan, Dilan Gorur
{"title":"微调语言模型以发出不确定性的语言表达","authors":"Arslan Chaudhry, Sridhar Thiagarajan, Dilan Gorur","doi":"arxiv-2409.12180","DOIUrl":null,"url":null,"abstract":"Large language models (LLMs) are increasingly employed in information-seeking\nand decision-making tasks. Despite their broad utility, LLMs tend to generate\ninformation that conflicts with real-world facts, and their persuasive style\ncan make these inaccuracies appear confident and convincing. As a result,\nend-users struggle to consistently align the confidence expressed by LLMs with\nthe accuracy of their predictions, often leading to either blind trust in all\noutputs or a complete disregard for their reliability. In this work, we explore\nsupervised finetuning on uncertainty-augmented predictions as a method to\ndevelop models that produce linguistic expressions of uncertainty.\nSpecifically, we measure the calibration of pre-trained models and then\nfine-tune language models to generate calibrated linguistic expressions of\nuncertainty. Through experiments on various question-answering datasets, we\ndemonstrate that LLMs are well-calibrated in assessing their predictions, and\nsupervised finetuning based on the model's own confidence leads to\nwell-calibrated expressions of uncertainty, particularly for single-claim\nanswers.","PeriodicalId":501030,"journal":{"name":"arXiv - CS - Computation and Language","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Finetuning Language Models to Emit Linguistic Expressions of Uncertainty\",\"authors\":\"Arslan Chaudhry, Sridhar Thiagarajan, Dilan Gorur\",\"doi\":\"arxiv-2409.12180\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Large language models (LLMs) are increasingly employed in information-seeking\\nand decision-making tasks. Despite their broad utility, LLMs tend to generate\\ninformation that conflicts with real-world facts, and their persuasive style\\ncan make these inaccuracies appear confident and convincing. As a result,\\nend-users struggle to consistently align the confidence expressed by LLMs with\\nthe accuracy of their predictions, often leading to either blind trust in all\\noutputs or a complete disregard for their reliability. In this work, we explore\\nsupervised finetuning on uncertainty-augmented predictions as a method to\\ndevelop models that produce linguistic expressions of uncertainty.\\nSpecifically, we measure the calibration of pre-trained models and then\\nfine-tune language models to generate calibrated linguistic expressions of\\nuncertainty. Through experiments on various question-answering datasets, we\\ndemonstrate that LLMs are well-calibrated in assessing their predictions, and\\nsupervised finetuning based on the model's own confidence leads to\\nwell-calibrated expressions of uncertainty, particularly for single-claim\\nanswers.\",\"PeriodicalId\":501030,\"journal\":{\"name\":\"arXiv - CS - Computation and Language\",\"volume\":\"18 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Computation and Language\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.12180\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computation and Language","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.12180","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

大语言模型(LLM)越来越多地被用于信息搜索和决策任务中。尽管大型语言模型具有广泛的用途,但它们生成的信息往往与现实世界中的事实相冲突,而且它们具有说服力的风格会让这些不准确的信息显得信心十足、令人信服。因此,最终用户很难将 LLM 所表达的信心与其预测的准确性保持一致,这往往导致他们要么盲目信任所有输出,要么完全无视其可靠性。在这项工作中,我们探索了对不确定性增强预测进行监督微调的方法,以此来开发能够生成不确定性语言表达的模型。具体来说,我们测量了预训练模型的校准情况,然后对语言模型进行微调,以生成经过校准的不确定性语言表达。通过在各种问题解答数据集上的实验,我们证明了 LLM 在评估其预测时校准良好,而基于模型自身置信度的监督微调则会带来校准良好的不确定性表达,尤其是对于单个请求的回答。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
Large language models (LLMs) are increasingly employed in information-seeking and decision-making tasks. Despite their broad utility, LLMs tend to generate information that conflicts with real-world facts, and their persuasive style can make these inaccuracies appear confident and convincing. As a result, end-users struggle to consistently align the confidence expressed by LLMs with the accuracy of their predictions, often leading to either blind trust in all outputs or a complete disregard for their reliability. In this work, we explore supervised finetuning on uncertainty-augmented predictions as a method to develop models that produce linguistic expressions of uncertainty. Specifically, we measure the calibration of pre-trained models and then fine-tune language models to generate calibrated linguistic expressions of uncertainty. Through experiments on various question-answering datasets, we demonstrate that LLMs are well-calibrated in assessing their predictions, and supervised finetuning based on the model's own confidence leads to well-calibrated expressions of uncertainty, particularly for single-claim answers.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信