从社交媒体上的部分联系记录预测网络成员:机器学习方法

IF 2.9 2区 社会学 Q1 ANTHROPOLOGY
Shu-Mei Lai , Tso-Jung Yen , Ming-Yi Chang , Yang-chih Fu , Wei-Chung Liu
{"title":"从社交媒体上的部分联系记录预测网络成员:机器学习方法","authors":"Shu-Mei Lai ,&nbsp;Tso-Jung Yen ,&nbsp;Ming-Yi Chang ,&nbsp;Yang-chih Fu ,&nbsp;Wei-Chung Liu","doi":"10.1016/j.socnet.2024.08.004","DOIUrl":null,"url":null,"abstract":"<div><p>Surveys conducted on social groups often generate incomplete information due to imperfect response rates. Drawing on Facebook data from a nationally representative sample of graduating college students in Taiwan, we examined the extent to which partial contact records predict which Facebook users belong to a specific class. We first used data from classes with low to middle response rates to train a model for classmate prediction. Based on data from classes with high or perfect response rates, we simulated data by using four different sampling methods with various response rates, and applied the trained model on simulated data to classmate prediction. With a minimal response rate of 40 percent, we achieved an accuracy rate of 90 percent and a true positive rate of 86 percent. Chronological order sampling had the best prediction performance, followed closely by popularity sampling, then by random sampling, and lastly by unpopularity sampling.</p></div>","PeriodicalId":48353,"journal":{"name":"Social Networks","volume":"80 ","pages":"Pages 10-24"},"PeriodicalIF":2.9000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0378873324000455/pdfft?md5=cc4c9fafb61af2fdbf03825448a7f086&pid=1-s2.0-S0378873324000455-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Predicting network members from partial contact records on social media: A machine learning approach\",\"authors\":\"Shu-Mei Lai ,&nbsp;Tso-Jung Yen ,&nbsp;Ming-Yi Chang ,&nbsp;Yang-chih Fu ,&nbsp;Wei-Chung Liu\",\"doi\":\"10.1016/j.socnet.2024.08.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Surveys conducted on social groups often generate incomplete information due to imperfect response rates. Drawing on Facebook data from a nationally representative sample of graduating college students in Taiwan, we examined the extent to which partial contact records predict which Facebook users belong to a specific class. We first used data from classes with low to middle response rates to train a model for classmate prediction. Based on data from classes with high or perfect response rates, we simulated data by using four different sampling methods with various response rates, and applied the trained model on simulated data to classmate prediction. With a minimal response rate of 40 percent, we achieved an accuracy rate of 90 percent and a true positive rate of 86 percent. Chronological order sampling had the best prediction performance, followed closely by popularity sampling, then by random sampling, and lastly by unpopularity sampling.</p></div>\",\"PeriodicalId\":48353,\"journal\":{\"name\":\"Social Networks\",\"volume\":\"80 \",\"pages\":\"Pages 10-24\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2024-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0378873324000455/pdfft?md5=cc4c9fafb61af2fdbf03825448a7f086&pid=1-s2.0-S0378873324000455-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Social Networks\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378873324000455\",\"RegionNum\":2,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ANTHROPOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Social Networks","FirstCategoryId":"90","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378873324000455","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANTHROPOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

由于回复率不完全,针对社会群体进行的调查往往会产生不完整的信息。我们利用具有全国代表性的台湾即将毕业大学生的 Facebook 数据,研究了部分联系记录在多大程度上可以预测哪些 Facebook 用户属于某个特定班级。我们首先使用中低回复率班级的数据来训练同学预测模型。在高回复率或完美回复率班级数据的基础上,我们使用四种不同回复率的抽样方法模拟数据,并将训练好的模型应用于模拟数据的同学预测。在最低应答率为 40% 的情况下,我们取得了 90% 的准确率和 86% 的真阳性率。按时间顺序抽样的预测效果最好,受欢迎程度抽样紧随其后,然后是随机抽样,最后是不受欢迎程度抽样。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting network members from partial contact records on social media: A machine learning approach

Surveys conducted on social groups often generate incomplete information due to imperfect response rates. Drawing on Facebook data from a nationally representative sample of graduating college students in Taiwan, we examined the extent to which partial contact records predict which Facebook users belong to a specific class. We first used data from classes with low to middle response rates to train a model for classmate prediction. Based on data from classes with high or perfect response rates, we simulated data by using four different sampling methods with various response rates, and applied the trained model on simulated data to classmate prediction. With a minimal response rate of 40 percent, we achieved an accuracy rate of 90 percent and a true positive rate of 86 percent. Chronological order sampling had the best prediction performance, followed closely by popularity sampling, then by random sampling, and lastly by unpopularity sampling.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Social Networks
Social Networks Multiple-
CiteScore
5.90
自引率
12.90%
发文量
118
期刊介绍: Social Networks is an interdisciplinary and international quarterly. It provides a common forum for representatives of anthropology, sociology, history, social psychology, political science, human geography, biology, economics, communications science and other disciplines who share an interest in the study of the empirical structure of social relations and associations that may be expressed in network form. It publishes both theoretical and substantive papers. Critical reviews of major theoretical or methodological approaches using the notion of networks in the analysis of social behaviour are also included, as are reviews of recent books dealing with social networks and social structure.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信