Yimin Liu;Xiangyang Luo;Zhiyuan Tao;Meng Zhang;Shaoyong Du
{"title":"UGCC: Social Media User Geolocation via Cyclic Coupling","authors":"Yimin Liu;Xiangyang Luo;Zhiyuan Tao;Meng Zhang;Shaoyong Du","doi":"10.1109/TBDATA.2023.3242961","DOIUrl":null,"url":null,"abstract":"Social media user geolocation is to infer users’ resident locations based on social media data, including user texts and social relationships. Existing methods mainly rely on the textual feature propagation in the social graph to fuse users’ textual and social information. The geolocation accuracy is susceptible to insufficient data sources and inadequate fusion. In this paper, a social media user geolocation algorithm based on cyclic coupling (called UGCC) is proposed. We collapse the social graph based on the neighbor location proximity, which reduces noisy information while enriching social relationships. Unlike existing methods that ignore the social graph's structure, UGCC measures the probability of users being in the candidate locations according to users’ structural location in the social sub-graph. Finally, we design a cyclic coupling mechanism to fuse the users’ textual and social information, which enables the two kinds of information to enhance each other and geolocate users cooperatively. Compared with ten typical existing methods (such as RELP and HGNN), experimental results show UGCC's superior performance. On two public datasets, the city-level accuracies of UGCC reach 40.8% and 50.1%; the median errors are 35.1% and 23.4% lower than the state-of-the-art methods.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 4","pages":"1128-1141"},"PeriodicalIF":7.5000,"publicationDate":"2023-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10040233/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Social media user geolocation is to infer users’ resident locations based on social media data, including user texts and social relationships. Existing methods mainly rely on the textual feature propagation in the social graph to fuse users’ textual and social information. The geolocation accuracy is susceptible to insufficient data sources and inadequate fusion. In this paper, a social media user geolocation algorithm based on cyclic coupling (called UGCC) is proposed. We collapse the social graph based on the neighbor location proximity, which reduces noisy information while enriching social relationships. Unlike existing methods that ignore the social graph's structure, UGCC measures the probability of users being in the candidate locations according to users’ structural location in the social sub-graph. Finally, we design a cyclic coupling mechanism to fuse the users’ textual and social information, which enables the two kinds of information to enhance each other and geolocate users cooperatively. Compared with ten typical existing methods (such as RELP and HGNN), experimental results show UGCC's superior performance. On two public datasets, the city-level accuracies of UGCC reach 40.8% and 50.1%; the median errors are 35.1% and 23.4% lower than the state-of-the-art methods.
期刊介绍:
The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.