大型语言模型（LLM）作为增强民主的代理。

IF 4.3 3区综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences Pub Date : 2024-12-16 Epub Date: 2024-11-13 DOI:10.1098/rsta.2024.0100

Jairo F Gudiño, Umberto Grandi, César Hidalgo

{"title":"大型语言模型（LLM）作为增强民主的代理。","authors":"Jairo F Gudiño, Umberto Grandi, César Hidalgo","doi":"10.1098/rsta.2024.0100","DOIUrl":null,"url":null,"abstract":"We explore an augmented democracy system built on off-the-shelf large language models (LLMs) fine-tuned to augment data on citizens' preferences elicited over policies extracted from the government programmes of the two main candidates of Brazil's 2022 presidential election. We use a train-test cross-validation set-up to estimate the accuracy with which the LLMs predict both: a subject's individual political choices and the aggregate preferences of the full sample of participants. At the individual level, we find that LLMs predict out of sample preferences more accurately than a 'bundle rule', which would assume that citizens always vote for the proposals of the candidate aligned with their self-reported political orientation. At the population level, we show that a probabilistic sample augmented by an LLM provides a more accurate estimate of the aggregate preferences of a population than the non-augmented probabilistic sample alone. Together, these results indicate that policy preference data augmented using LLMs can capture nuances that transcend party lines and represents a promising avenue of research for data augmentation. This article is part of the theme issue 'Co-creating the future: participatory cities and digital governance'.","PeriodicalId":19879,"journal":{"name":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","volume":"382 2285","pages":"20240100"},"PeriodicalIF":4.3000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large language models (LLMs) as agents for augmented democracy.\",\"authors\":\"Jairo F Gudiño, Umberto Grandi, César Hidalgo\",\"doi\":\"10.1098/rsta.2024.0100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We explore an augmented democracy system built on off-the-shelf large language models (LLMs) fine-tuned to augment data on citizens' preferences elicited over policies extracted from the government programmes of the two main candidates of Brazil's 2022 presidential election. We use a train-test cross-validation set-up to estimate the accuracy with which the LLMs predict both: a subject's individual political choices and the aggregate preferences of the full sample of participants. At the individual level, we find that LLMs predict out of sample preferences more accurately than a 'bundle rule', which would assume that citizens always vote for the proposals of the candidate aligned with their self-reported political orientation. At the population level, we show that a probabilistic sample augmented by an LLM provides a more accurate estimate of the aggregate preferences of a population than the non-augmented probabilistic sample alone. Together, these results indicate that policy preference data augmented using LLMs can capture nuances that transcend party lines and represents a promising avenue of research for data augmentation. This article is part of the theme issue 'Co-creating the future: participatory cities and digital governance'.\",\"PeriodicalId\":19879,\"journal\":{\"name\":\"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences\",\"volume\":\"382 2285\",\"pages\":\"20240100\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1098/rsta.2024.0100\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/13 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsta.2024.0100","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

我们探索了一种增强民主系统，该系统建立在现成的大型语言模型（LLMs）基础上，对其进行了微调，以增强从巴西 2022 年总统大选两位主要候选人的政府计划中提取的公民政策偏好数据。我们使用训练-测试交叉验证设置来估算 LLM 预测以下两方面的准确性：受试者的个人政治选择和所有参与者样本的总体偏好。在个人层面，我们发现 LLM 预测样本外偏好的准确度高于 "捆绑规则"，后者假定公民总是投票支持与其自我报告的政治倾向一致的候选人的提案。在人口层面，我们表明，与单独的非增量概率样本相比，使用 LLM 的增量概率样本能更准确地估计人口的总体偏好。总之，这些结果表明，使用 LLM 增强的政策偏好数据可以捕捉到超越党派界限的细微差别，是数据增强的一个有前途的研究方向。本文是 "共创未来：参与式城市与数字治理 "主题期刊的一部分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Large language models (LLMs) as agents for augmented democracy.

We explore an augmented democracy system built on off-the-shelf large language models (LLMs) fine-tuned to augment data on citizens' preferences elicited over policies extracted from the government programmes of the two main candidates of Brazil's 2022 presidential election. We use a train-test cross-validation set-up to estimate the accuracy with which the LLMs predict both: a subject's individual political choices and the aggregate preferences of the full sample of participants. At the individual level, we find that LLMs predict out of sample preferences more accurately than a 'bundle rule', which would assume that citizens always vote for the proposals of the candidate aligned with their self-reported political orientation. At the population level, we show that a probabilistic sample augmented by an LLM provides a more accurate estimate of the aggregate preferences of a population than the non-augmented probabilistic sample alone. Together, these results indicate that policy preference data augmented using LLMs can capture nuances that transcend party lines and represents a promising avenue of research for data augmentation. This article is part of the theme issue 'Co-creating the future: participatory cities and digital governance'.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 综合性期刊-综合性期刊

CiteScore

9.30

自引率

2.00%

发文量

367

审稿时长

3 months

期刊介绍： Continuing its long history of influential scientific publishing, Philosophical Transactions A publishes high-quality theme issues on topics of current importance and general interest within the physical, mathematical and engineering sciences, guest-edited by leading authorities and comprising new research, reviews and opinions from prominent researchers.