{"title":"Algorithmic enhancements to identify predictable components from users’ data and a framework to detect misinformation in social media","authors":"G. Dixit, A. Kushwaha","doi":"10.1080/2573234X.2022.2100834","DOIUrl":null,"url":null,"abstract":"ABSTRACT The flow of distorted information on social media platforms cannot always be handled. As a result, digital misinformation has become a significant social, political, and technological risk factor. Extant research on detecting misinformation in social networks has focused on using metadata or characteristics of influential actors (users) and their group dynamics in isolation, but less on the act (information content) itself and on developing an integrated approach. We unify them to produce a data science framework to detect valid instances of misinformation from social media such as Twitter. Here we develop novel and efficient algorithmic improvements to extract predictable components from users’ data. The model results demonstrate a significant increase in performance beyond typical incremental improvements. This research proposes a novel term weighting scheme, clique-based features, and a metadata-based feature. These contributions to the data science literature can be helpful for future studies in the social media context.","PeriodicalId":36417,"journal":{"name":"Journal of Business Analytics","volume":null,"pages":null},"PeriodicalIF":1.7000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Business Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/2573234X.2022.2100834","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
ABSTRACT The flow of distorted information on social media platforms cannot always be handled. As a result, digital misinformation has become a significant social, political, and technological risk factor. Extant research on detecting misinformation in social networks has focused on using metadata or characteristics of influential actors (users) and their group dynamics in isolation, but less on the act (information content) itself and on developing an integrated approach. We unify them to produce a data science framework to detect valid instances of misinformation from social media such as Twitter. Here we develop novel and efficient algorithmic improvements to extract predictable components from users’ data. The model results demonstrate a significant increase in performance beyond typical incremental improvements. This research proposes a novel term weighting scheme, clique-based features, and a metadata-based feature. These contributions to the data science literature can be helpful for future studies in the social media context.