大型语言模型(LLM)作为增强民主的代理。

IF 4.3 3区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Jairo F Gudiño, Umberto Grandi, César Hidalgo
{"title":"大型语言模型(LLM)作为增强民主的代理。","authors":"Jairo F Gudiño, Umberto Grandi, César Hidalgo","doi":"10.1098/rsta.2024.0100","DOIUrl":null,"url":null,"abstract":"<p><p>We explore an augmented democracy system built on off-the-shelf large language models (LLMs) fine-tuned to augment data on citizens' preferences elicited over policies extracted from the government programmes of the two main candidates of Brazil's 2022 presidential election. We use a train-test cross-validation set-up to estimate the accuracy with which the LLMs predict both: a subject's individual political choices and the aggregate preferences of the full sample of participants. At the individual level, we find that LLMs predict out of sample preferences more accurately than a 'bundle rule', which would assume that citizens always vote for the proposals of the candidate aligned with their self-reported political orientation. At the population level, we show that a probabilistic sample augmented by an LLM provides a more accurate estimate of the aggregate preferences of a population than the non-augmented probabilistic sample alone. Together, these results indicate that policy preference data augmented using LLMs can capture nuances that transcend party lines and represents a promising avenue of research for data augmentation. This article is part of the theme issue 'Co-creating the future: participatory cities and digital governance'.</p>","PeriodicalId":19879,"journal":{"name":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","volume":"382 2285","pages":"20240100"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large language models (LLMs) as agents for augmented democracy.\",\"authors\":\"Jairo F Gudiño, Umberto Grandi, César Hidalgo\",\"doi\":\"10.1098/rsta.2024.0100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>We explore an augmented democracy system built on off-the-shelf large language models (LLMs) fine-tuned to augment data on citizens' preferences elicited over policies extracted from the government programmes of the two main candidates of Brazil's 2022 presidential election. We use a train-test cross-validation set-up to estimate the accuracy with which the LLMs predict both: a subject's individual political choices and the aggregate preferences of the full sample of participants. At the individual level, we find that LLMs predict out of sample preferences more accurately than a 'bundle rule', which would assume that citizens always vote for the proposals of the candidate aligned with their self-reported political orientation. At the population level, we show that a probabilistic sample augmented by an LLM provides a more accurate estimate of the aggregate preferences of a population than the non-augmented probabilistic sample alone. Together, these results indicate that policy preference data augmented using LLMs can capture nuances that transcend party lines and represents a promising avenue of research for data augmentation. This article is part of the theme issue 'Co-creating the future: participatory cities and digital governance'.</p>\",\"PeriodicalId\":19879,\"journal\":{\"name\":\"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences\",\"volume\":\"382 2285\",\"pages\":\"20240100\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1098/rsta.2024.0100\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/13 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsta.2024.0100","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

我们探索了一种增强民主系统,该系统建立在现成的大型语言模型(LLMs)基础上,对其进行了微调,以增强从巴西 2022 年总统大选两位主要候选人的政府计划中提取的公民政策偏好数据。我们使用训练-测试交叉验证设置来估算 LLM 预测以下两方面的准确性:受试者的个人政治选择和所有参与者样本的总体偏好。在个人层面,我们发现 LLM 预测样本外偏好的准确度高于 "捆绑规则",后者假定公民总是投票支持与其自我报告的政治倾向一致的候选人的提案。在人口层面,我们表明,与单独的非增量概率样本相比,使用 LLM 的增量概率样本能更准确地估计人口的总体偏好。总之,这些结果表明,使用 LLM 增强的政策偏好数据可以捕捉到超越党派界限的细微差别,是数据增强的一个有前途的研究方向。本文是 "共创未来:参与式城市与数字治理 "主题期刊的一部分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Large language models (LLMs) as agents for augmented democracy.

We explore an augmented democracy system built on off-the-shelf large language models (LLMs) fine-tuned to augment data on citizens' preferences elicited over policies extracted from the government programmes of the two main candidates of Brazil's 2022 presidential election. We use a train-test cross-validation set-up to estimate the accuracy with which the LLMs predict both: a subject's individual political choices and the aggregate preferences of the full sample of participants. At the individual level, we find that LLMs predict out of sample preferences more accurately than a 'bundle rule', which would assume that citizens always vote for the proposals of the candidate aligned with their self-reported political orientation. At the population level, we show that a probabilistic sample augmented by an LLM provides a more accurate estimate of the aggregate preferences of a population than the non-augmented probabilistic sample alone. Together, these results indicate that policy preference data augmented using LLMs can capture nuances that transcend party lines and represents a promising avenue of research for data augmentation. This article is part of the theme issue 'Co-creating the future: participatory cities and digital governance'.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.30
自引率
2.00%
发文量
367
审稿时长
3 months
期刊介绍: Continuing its long history of influential scientific publishing, Philosophical Transactions A publishes high-quality theme issues on topics of current importance and general interest within the physical, mathematical and engineering sciences, guest-edited by leading authorities and comprising new research, reviews and opinions from prominent researchers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信