LLMs generate structurally realistic social networks but overestimate political homophily

arXiv - CS - Social and Information Networks Pub Date : 2024-08-29 DOI:arxiv-2408.16629

Serina Chang, Alicja Chaszczewicz, Emma Wang, Maya Josifovska, Emma Pierson, Jure Leskovec

{"title":"LLMs generate structurally realistic social networks but overestimate political homophily","authors":"Serina Chang, Alicja Chaszczewicz, Emma Wang, Maya Josifovska, Emma Pierson, Jure Leskovec","doi":"arxiv-2408.16629","DOIUrl":null,"url":null,"abstract":"Generating social networks is essential for many applications, such as\nepidemic modeling and social simulations. Prior approaches either involve deep\nlearning models, which require many observed networks for training, or stylized\nmodels, which are limited in their realism and flexibility. In contrast, LLMs\noffer the potential for zero-shot and flexible network generation. However, two\nkey questions are: (1) are LLM's generated networks realistic, and (2) what are\nrisks of bias, given the importance of demographics in forming social ties? To\nanswer these questions, we develop three prompting methods for network\ngeneration and compare the generated networks to real social networks. We find\nthat more realistic networks are generated with \"local\" methods, where the LLM\nconstructs relations for one persona at a time, compared to \"global\" methods\nthat construct the entire network at once. We also find that the generated\nnetworks match real networks on many characteristics, including density,\nclustering, community structure, and degree. However, we find that LLMs\nemphasize political homophily over all other types of homophily and\noverestimate political homophily relative to real-world measures.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"44 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.16629","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Generating social networks is essential for many applications, such as epidemic modeling and social simulations. Prior approaches either involve deep learning models, which require many observed networks for training, or stylized models, which are limited in their realism and flexibility. In contrast, LLMs offer the potential for zero-shot and flexible network generation. However, two key questions are: (1) are LLM's generated networks realistic, and (2) what are risks of bias, given the importance of demographics in forming social ties? To answer these questions, we develop three prompting methods for network generation and compare the generated networks to real social networks. We find that more realistic networks are generated with "local" methods, where the LLM constructs relations for one persona at a time, compared to "global" methods that construct the entire network at once. We also find that the generated networks match real networks on many characteristics, including density, clustering, community structure, and degree. However, we find that LLMs emphasize political homophily over all other types of homophily and overestimate political homophily relative to real-world measures.

查看原文本刊更多论文

LLM 生成结构真实的社会网络，但高估了政治同质性

生成社交网络对流行病建模和社会模拟等许多应用都至关重要。先前的方法要么涉及深度学习模型，需要许多观察到的网络进行训练，要么涉及风格化模型，在真实性和灵活性方面受到限制。相比之下，LLM 提供了生成零镜头和灵活网络的潜力。然而，两个关键问题是(1) LLM 生成的网络是否真实；(2) 鉴于人口统计学在形成社会联系方面的重要性，存在哪些偏差风险？为了回答这些问题，我们开发了三种网络生成的提示方法，并将生成的网络与真实的社交网络进行了比较。我们发现，与一次性构建整个网络的 "全局 "方法相比，使用 "局部 "方法生成的网络更加真实。我们还发现，生成的网络在许多特性上都与真实网络相匹配，包括密度、聚类、社群结构和程度。然而，我们发现 LLM 更强调政治亲缘性，而不是所有其他类型的亲缘性，而且相对于真实世界的衡量标准，LLM 高估了政治亲缘性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Social and Information Networks

自引率

0.00%

发文量