Serina Chang, Alicja Chaszczewicz, Emma Wang, Maya Josifovska, Emma Pierson, Jure Leskovec
{"title":"LLM 生成结构真实的社会网络,但高估了政治同质性","authors":"Serina Chang, Alicja Chaszczewicz, Emma Wang, Maya Josifovska, Emma Pierson, Jure Leskovec","doi":"arxiv-2408.16629","DOIUrl":null,"url":null,"abstract":"Generating social networks is essential for many applications, such as\nepidemic modeling and social simulations. Prior approaches either involve deep\nlearning models, which require many observed networks for training, or stylized\nmodels, which are limited in their realism and flexibility. In contrast, LLMs\noffer the potential for zero-shot and flexible network generation. However, two\nkey questions are: (1) are LLM's generated networks realistic, and (2) what are\nrisks of bias, given the importance of demographics in forming social ties? To\nanswer these questions, we develop three prompting methods for network\ngeneration and compare the generated networks to real social networks. We find\nthat more realistic networks are generated with \"local\" methods, where the LLM\nconstructs relations for one persona at a time, compared to \"global\" methods\nthat construct the entire network at once. We also find that the generated\nnetworks match real networks on many characteristics, including density,\nclustering, community structure, and degree. However, we find that LLMs\nemphasize political homophily over all other types of homophily and\noverestimate political homophily relative to real-world measures.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LLMs generate structurally realistic social networks but overestimate political homophily\",\"authors\":\"Serina Chang, Alicja Chaszczewicz, Emma Wang, Maya Josifovska, Emma Pierson, Jure Leskovec\",\"doi\":\"arxiv-2408.16629\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generating social networks is essential for many applications, such as\\nepidemic modeling and social simulations. Prior approaches either involve deep\\nlearning models, which require many observed networks for training, or stylized\\nmodels, which are limited in their realism and flexibility. In contrast, LLMs\\noffer the potential for zero-shot and flexible network generation. However, two\\nkey questions are: (1) are LLM's generated networks realistic, and (2) what are\\nrisks of bias, given the importance of demographics in forming social ties? To\\nanswer these questions, we develop three prompting methods for network\\ngeneration and compare the generated networks to real social networks. We find\\nthat more realistic networks are generated with \\\"local\\\" methods, where the LLM\\nconstructs relations for one persona at a time, compared to \\\"global\\\" methods\\nthat construct the entire network at once. We also find that the generated\\nnetworks match real networks on many characteristics, including density,\\nclustering, community structure, and degree. However, we find that LLMs\\nemphasize political homophily over all other types of homophily and\\noverestimate political homophily relative to real-world measures.\",\"PeriodicalId\":501032,\"journal\":{\"name\":\"arXiv - CS - Social and Information Networks\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Social and Information Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.16629\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.16629","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
LLMs generate structurally realistic social networks but overestimate political homophily
Generating social networks is essential for many applications, such as
epidemic modeling and social simulations. Prior approaches either involve deep
learning models, which require many observed networks for training, or stylized
models, which are limited in their realism and flexibility. In contrast, LLMs
offer the potential for zero-shot and flexible network generation. However, two
key questions are: (1) are LLM's generated networks realistic, and (2) what are
risks of bias, given the importance of demographics in forming social ties? To
answer these questions, we develop three prompting methods for network
generation and compare the generated networks to real social networks. We find
that more realistic networks are generated with "local" methods, where the LLM
constructs relations for one persona at a time, compared to "global" methods
that construct the entire network at once. We also find that the generated
networks match real networks on many characteristics, including density,
clustering, community structure, and degree. However, we find that LLMs
emphasize political homophily over all other types of homophily and
overestimate political homophily relative to real-world measures.