利用手机社交网络拓扑推断用户人口统计属性

J. Brea, Javier Burroni, Martin Minnoni, Carlos Sarraute
{"title":"利用手机社交网络拓扑推断用户人口统计属性","authors":"J. Brea, Javier Burroni, Martin Minnoni, Carlos Sarraute","doi":"10.1145/2659480.2659492","DOIUrl":null,"url":null,"abstract":"We study the structure of the social graph of mobile phone users in the country of Mexico, with a focus on demographic attributes of the users (more specifically the users' age). We examine assortativity patterns in the graph, and observe a strong age homophily in the communications preferences. We propose a graph based algorithm for the prediction of the age of mobile phone users. The algorithm exploits the topology of the mobile phone network, together with a subset of known users ages (seeds), to infer the age of remaining users. We provide the details of the methodology, and show experimental results on a network GT with more than 70 million users. By carefully examining the topological relations of the seeds to the rest of the nodes in GT, we find topological metrics which have a direct influence on the performance of the algorithm. In particular we characterize subsets of users for which the accuracy of the algorithm is 62% when predicting between 4 age categories (whereas a pure random guess would yield an accuracy of 25%). We also show that we can use the probabilistic information computed by the algorithm to further increase its inference power to 72% on a significant subset of users.","PeriodicalId":74521,"journal":{"name":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","volume":"7 1","pages":"1:1-1:9"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":"{\"title\":\"Harnessing Mobile Phone Social Network Topology to Infer Users Demographic Attributes\",\"authors\":\"J. Brea, Javier Burroni, Martin Minnoni, Carlos Sarraute\",\"doi\":\"10.1145/2659480.2659492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We study the structure of the social graph of mobile phone users in the country of Mexico, with a focus on demographic attributes of the users (more specifically the users' age). We examine assortativity patterns in the graph, and observe a strong age homophily in the communications preferences. We propose a graph based algorithm for the prediction of the age of mobile phone users. The algorithm exploits the topology of the mobile phone network, together with a subset of known users ages (seeds), to infer the age of remaining users. We provide the details of the methodology, and show experimental results on a network GT with more than 70 million users. By carefully examining the topological relations of the seeds to the rest of the nodes in GT, we find topological metrics which have a direct influence on the performance of the algorithm. In particular we characterize subsets of users for which the accuracy of the algorithm is 62% when predicting between 4 age categories (whereas a pure random guess would yield an accuracy of 25%). We also show that we can use the probabilistic information computed by the algorithm to further increase its inference power to 72% on a significant subset of users.\",\"PeriodicalId\":74521,\"journal\":{\"name\":\"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining\",\"volume\":\"7 1\",\"pages\":\"1:1-1:9\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"18\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2659480.2659492\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... IEEE/ACM International Conference on Advances in Social Network Analysis and Mining. International Conference on Advances in Social Network Analysis and Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2659480.2659492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

摘要

我们研究了墨西哥手机用户的社交图谱结构,重点关注用户的人口统计属性(更具体地说,是用户的年龄)。我们研究了图中的分类模式,并观察到通信偏好中强烈的年龄同质性。我们提出了一种基于图的算法来预测手机用户的年龄。该算法利用移动电话网络的拓扑结构,以及已知用户年龄的子集(种子)来推断剩余用户的年龄。我们提供了方法的细节,并展示了在拥有超过7000万用户的网络GT上的实验结果。通过仔细检查种子到GT中其余节点的拓扑关系,我们发现拓扑指标对算法的性能有直接影响。特别是,我们描述了用户子集,当预测4个年龄类别时,算法的准确率为62%(而纯随机猜测的准确率为25%)。我们还表明,我们可以使用算法计算的概率信息进一步将其推断能力提高到72%,在一个重要的用户子集上。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Harnessing Mobile Phone Social Network Topology to Infer Users Demographic Attributes
We study the structure of the social graph of mobile phone users in the country of Mexico, with a focus on demographic attributes of the users (more specifically the users' age). We examine assortativity patterns in the graph, and observe a strong age homophily in the communications preferences. We propose a graph based algorithm for the prediction of the age of mobile phone users. The algorithm exploits the topology of the mobile phone network, together with a subset of known users ages (seeds), to infer the age of remaining users. We provide the details of the methodology, and show experimental results on a network GT with more than 70 million users. By carefully examining the topological relations of the seeds to the rest of the nodes in GT, we find topological metrics which have a direct influence on the performance of the algorithm. In particular we characterize subsets of users for which the accuracy of the algorithm is 62% when predicting between 4 age categories (whereas a pure random guess would yield an accuracy of 25%). We also show that we can use the probabilistic information computed by the algorithm to further increase its inference power to 72% on a significant subset of users.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信