Sudeep Bhatia, Simon T. van Baal, Feiyi Wang, Lukasz Walasek
{"title":"100k选择困境的计算分析:决策属性、权衡结构和基于模型的预测","authors":"Sudeep Bhatia, Simon T. van Baal, Feiyi Wang, Lukasz Walasek","doi":"10.1073/pnas.2406489122","DOIUrl":null,"url":null,"abstract":"We present a dataset of over 100 K textual descriptions of real-life choice dilemmas, obtained from social media posts and large-scale survey data. Using large language models (LLMs), we extract hundreds of choice attributes at play in these dilemmas and map them onto a common representational space. This representation allows us to quantify the broader themes and specific trade-offs inherent in life choices and analyze how they vary across different contexts. We also present our dilemmas to human participants and find that our LLM pipeline, when combined with established decision models, accurately predicts people’s choices, outperforming models based on unstructured textual content, demographics, and personality. In this way, our research provides insights into the attributes, outcomes, and goals that underpin life choices, and shows how large-scale LLM-based structure extraction can be used in combination with existing scientific theory to study complex real-world human behavior.","PeriodicalId":20548,"journal":{"name":"Proceedings of the National Academy of Sciences of the United States of America","volume":"64 1","pages":""},"PeriodicalIF":9.4000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Computational analysis of 100 K choice dilemmas: Decision attributes, trade-off structures, and model-based prediction\",\"authors\":\"Sudeep Bhatia, Simon T. van Baal, Feiyi Wang, Lukasz Walasek\",\"doi\":\"10.1073/pnas.2406489122\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a dataset of over 100 K textual descriptions of real-life choice dilemmas, obtained from social media posts and large-scale survey data. Using large language models (LLMs), we extract hundreds of choice attributes at play in these dilemmas and map them onto a common representational space. This representation allows us to quantify the broader themes and specific trade-offs inherent in life choices and analyze how they vary across different contexts. We also present our dilemmas to human participants and find that our LLM pipeline, when combined with established decision models, accurately predicts people’s choices, outperforming models based on unstructured textual content, demographics, and personality. In this way, our research provides insights into the attributes, outcomes, and goals that underpin life choices, and shows how large-scale LLM-based structure extraction can be used in combination with existing scientific theory to study complex real-world human behavior.\",\"PeriodicalId\":20548,\"journal\":{\"name\":\"Proceedings of the National Academy of Sciences of the United States of America\",\"volume\":\"64 1\",\"pages\":\"\"},\"PeriodicalIF\":9.4000,\"publicationDate\":\"2025-04-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the National Academy of Sciences of the United States of America\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1073/pnas.2406489122\",\"RegionNum\":1,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the National Academy of Sciences of the United States of America","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1073/pnas.2406489122","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Computational analysis of 100 K choice dilemmas: Decision attributes, trade-off structures, and model-based prediction
We present a dataset of over 100 K textual descriptions of real-life choice dilemmas, obtained from social media posts and large-scale survey data. Using large language models (LLMs), we extract hundreds of choice attributes at play in these dilemmas and map them onto a common representational space. This representation allows us to quantify the broader themes and specific trade-offs inherent in life choices and analyze how they vary across different contexts. We also present our dilemmas to human participants and find that our LLM pipeline, when combined with established decision models, accurately predicts people’s choices, outperforming models based on unstructured textual content, demographics, and personality. In this way, our research provides insights into the attributes, outcomes, and goals that underpin life choices, and shows how large-scale LLM-based structure extraction can be used in combination with existing scientific theory to study complex real-world human behavior.
期刊介绍:
The Proceedings of the National Academy of Sciences (PNAS), a peer-reviewed journal of the National Academy of Sciences (NAS), serves as an authoritative source for high-impact, original research across the biological, physical, and social sciences. With a global scope, the journal welcomes submissions from researchers worldwide, making it an inclusive platform for advancing scientific knowledge.