{"title":"Detecting Fake Users on Social Media with a Graph Database","authors":"Yichun Zhao, Jens Weber","doi":"10.18357/tar121202120027","DOIUrl":null,"url":null,"abstract":"Social media has become a major part of people’s daily lives as it provides users with the convenience to connect with people, interact with friends, share personal content with others, and gather information. However, it also creates opportunities for fake users. Fake users on social media may be perceived as popular and influential if not detected. They might spread false information or fake news by making it look real, manipulating real users into making certain decisions. In computer science, a social network can be treated as a graph, which is a data structure consisting of nodes being the social media users, and edges being the connections between users. Graph data can be stored in a graph database for efficient data analysis. In this paper, we propose using a graph database to achieve an increased scalability to accommodate larger graphs. Centrality measures as features were extracted for the random forest classifier to successfully detect fake users with high precision, recall, and accuracy. We have achieved promising results especially when compared with previous studies. \n ","PeriodicalId":143772,"journal":{"name":"The Arbutus Review","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Arbutus Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18357/tar121202120027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Social media has become a major part of people’s daily lives as it provides users with the convenience to connect with people, interact with friends, share personal content with others, and gather information. However, it also creates opportunities for fake users. Fake users on social media may be perceived as popular and influential if not detected. They might spread false information or fake news by making it look real, manipulating real users into making certain decisions. In computer science, a social network can be treated as a graph, which is a data structure consisting of nodes being the social media users, and edges being the connections between users. Graph data can be stored in a graph database for efficient data analysis. In this paper, we propose using a graph database to achieve an increased scalability to accommodate larger graphs. Centrality measures as features were extracted for the random forest classifier to successfully detect fake users with high precision, recall, and accuracy. We have achieved promising results especially when compared with previous studies.