Febriora Nevia Pramitha, R. B. Hadiprakoso, Nurul Qomariasih, Girinoto
{"title":"Twitter Bot Account Detection Using Supervised Machine Learning","authors":"Febriora Nevia Pramitha, R. B. Hadiprakoso, Nurul Qomariasih, Girinoto","doi":"10.1109/ISRITI54043.2021.9702789","DOIUrl":null,"url":null,"abstract":"Twitter is a primary social media platform gaining popularity among social networking websites at an alarming rate. Twitter's popularity and relatively open nature make it an excellent target for automated programs known as bots, which are computer programs that run automatically. In addition to spamming, bots can be used for various purposes, such as inducing conversations to change the topic of discussion, modifying user popularity, contaminating materials to spread misinformation, and conducting propaganda. This study's goal was to provide a fresh perspective on estimating the possibility of an account being identified as a bot by applying Machine Learning algorithms to a variety of scenarios. Both Random Forest and XGBoost algorithms are used in this application. The inquiry began with exploratory data analysis to determine the current status of the dataset. Next comes the process of model engineering, which involves the steps of requirement gathering and specification, feature selection and optimization, hyperparameter tweaking, and algorithm benchmarking. The findings of this investigation suggest that the XGBoost algorithm outperforms Random Forest, with an accuracy of 0.8908 for XGBoost and 0.8762 for Random Forest.","PeriodicalId":156265,"journal":{"name":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI54043.2021.9702789","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Twitter is a primary social media platform gaining popularity among social networking websites at an alarming rate. Twitter's popularity and relatively open nature make it an excellent target for automated programs known as bots, which are computer programs that run automatically. In addition to spamming, bots can be used for various purposes, such as inducing conversations to change the topic of discussion, modifying user popularity, contaminating materials to spread misinformation, and conducting propaganda. This study's goal was to provide a fresh perspective on estimating the possibility of an account being identified as a bot by applying Machine Learning algorithms to a variety of scenarios. Both Random Forest and XGBoost algorithms are used in this application. The inquiry began with exploratory data analysis to determine the current status of the dataset. Next comes the process of model engineering, which involves the steps of requirement gathering and specification, feature selection and optimization, hyperparameter tweaking, and algorithm benchmarking. The findings of this investigation suggest that the XGBoost algorithm outperforms Random Forest, with an accuracy of 0.8908 for XGBoost and 0.8762 for Random Forest.