Supervised Machine Learning Algorithms to Detect Instagram Fake Accounts

Michael Jonathan Ekosputra, Angela Susanto, Ferdiana Haryanto, Derwin Suhartono
{"title":"Supervised Machine Learning Algorithms to Detect Instagram Fake Accounts","authors":"Michael Jonathan Ekosputra, Angela Susanto, Ferdiana Haryanto, Derwin Suhartono","doi":"10.1109/ISRITI54043.2021.9702833","DOIUrl":null,"url":null,"abstract":"Instagram is extremely popular because many celebrities and their fan pages use Instagram as the platform for them to communicate. Instagram offers many media sharing features and has proven to be the most popular social media platform for promoting many brands. As the most popular platform, Instagram also has fake users. Regrettably, some people do malicious activities using fake accounts such as impersonating artists or influencers, hate comments and spread rumors to become viral. Hence, this research aims to detect Instagram fake users based on the user's profile. There are several stages before account authenticity detection is successful, starting from data pre-processing, selecting a classification model, and classification evaluation. The algorithms that are used to create the supervised machine learning model are Logistic Regression, Bernoulli Naive Bayes, Random Forest, Support Vector Machine, and Artificial Neural Network (ANN). This paper tried two experiments. The first is that the default state of the model has no parameters, and no features are added. Second, to improve the accuracy, new features and tuning parameters were added in the experiment. Models that perform better than other models based on the second experiment with new features and parameters are Logistic Regression and Random Forest, with an accuracy of 0.93.","PeriodicalId":156265,"journal":{"name":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI54043.2021.9702833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Instagram is extremely popular because many celebrities and their fan pages use Instagram as the platform for them to communicate. Instagram offers many media sharing features and has proven to be the most popular social media platform for promoting many brands. As the most popular platform, Instagram also has fake users. Regrettably, some people do malicious activities using fake accounts such as impersonating artists or influencers, hate comments and spread rumors to become viral. Hence, this research aims to detect Instagram fake users based on the user's profile. There are several stages before account authenticity detection is successful, starting from data pre-processing, selecting a classification model, and classification evaluation. The algorithms that are used to create the supervised machine learning model are Logistic Regression, Bernoulli Naive Bayes, Random Forest, Support Vector Machine, and Artificial Neural Network (ANN). This paper tried two experiments. The first is that the default state of the model has no parameters, and no features are added. Second, to improve the accuracy, new features and tuning parameters were added in the experiment. Models that perform better than other models based on the second experiment with new features and parameters are Logistic Regression and Random Forest, with an accuracy of 0.93.
监督机器学习算法检测Instagram虚假账户
Instagram非常受欢迎,因为许多名人和他们的粉丝页面都把Instagram作为他们交流的平台。Instagram提供了许多媒体分享功能,并已被证明是推广许多品牌的最受欢迎的社交媒体平台。作为最受欢迎的平台,Instagram也有假用户。令人遗憾的是,一些人利用假冒艺术家或网红等虚假账号进行恶意活动,仇恨评论,传播谣言,使其成为病毒。因此,本研究旨在根据用户的个人资料来检测Instagram的假用户。在账户真实性检测成功之前有几个阶段,从数据预处理、选择分类模型到分类评估。用于创建监督机器学习模型的算法有逻辑回归、伯努利朴素贝叶斯、随机森林、支持向量机和人工神经网络(ANN)。本文进行了两个实验。首先,模型的默认状态没有参数,也没有添加任何特征。其次,为了提高精度,在实验中增加了新的特征和调谐参数。在第二次实验中,使用新特征和新参数的模型比其他模型表现更好的是Logistic Regression和Random Forest,准确率为0.93。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信