Michael Jonathan Ekosputra, Angela Susanto, Ferdiana Haryanto, Derwin Suhartono
{"title":"Supervised Machine Learning Algorithms to Detect Instagram Fake Accounts","authors":"Michael Jonathan Ekosputra, Angela Susanto, Ferdiana Haryanto, Derwin Suhartono","doi":"10.1109/ISRITI54043.2021.9702833","DOIUrl":null,"url":null,"abstract":"Instagram is extremely popular because many celebrities and their fan pages use Instagram as the platform for them to communicate. Instagram offers many media sharing features and has proven to be the most popular social media platform for promoting many brands. As the most popular platform, Instagram also has fake users. Regrettably, some people do malicious activities using fake accounts such as impersonating artists or influencers, hate comments and spread rumors to become viral. Hence, this research aims to detect Instagram fake users based on the user's profile. There are several stages before account authenticity detection is successful, starting from data pre-processing, selecting a classification model, and classification evaluation. The algorithms that are used to create the supervised machine learning model are Logistic Regression, Bernoulli Naive Bayes, Random Forest, Support Vector Machine, and Artificial Neural Network (ANN). This paper tried two experiments. The first is that the default state of the model has no parameters, and no features are added. Second, to improve the accuracy, new features and tuning parameters were added in the experiment. Models that perform better than other models based on the second experiment with new features and parameters are Logistic Regression and Random Forest, with an accuracy of 0.93.","PeriodicalId":156265,"journal":{"name":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI54043.2021.9702833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Instagram is extremely popular because many celebrities and their fan pages use Instagram as the platform for them to communicate. Instagram offers many media sharing features and has proven to be the most popular social media platform for promoting many brands. As the most popular platform, Instagram also has fake users. Regrettably, some people do malicious activities using fake accounts such as impersonating artists or influencers, hate comments and spread rumors to become viral. Hence, this research aims to detect Instagram fake users based on the user's profile. There are several stages before account authenticity detection is successful, starting from data pre-processing, selecting a classification model, and classification evaluation. The algorithms that are used to create the supervised machine learning model are Logistic Regression, Bernoulli Naive Bayes, Random Forest, Support Vector Machine, and Artificial Neural Network (ANN). This paper tried two experiments. The first is that the default state of the model has no parameters, and no features are added. Second, to improve the accuracy, new features and tuning parameters were added in the experiment. Models that perform better than other models based on the second experiment with new features and parameters are Logistic Regression and Random Forest, with an accuracy of 0.93.