S. M. P. C. Souza, Tito B Rezende, José Nascimento, Levy G. Chaves, Darlinne H P Soto, Soroor Salavati
{"title":"Tuning machine learning models to detect bots on Twitter","authors":"S. M. P. C. Souza, Tito B Rezende, José Nascimento, Levy G. Chaves, Darlinne H P Soto, Soroor Salavati","doi":"10.1109/WCNPS50723.2020.9263756","DOIUrl":null,"url":null,"abstract":"Bot generated content on social media can spread fake news and hate speech, manipulate public opinion and influence the community on relevant topics, such as elections. Thus, bot detection in social media platforms plays an important role for the health of the platforms and for the well-being of societies. In this work, we approach the detection of bots on Twitter as a binary output problem through the analysis of account features. We propose a pipeline for feature engineering and model training, tuning and selection. We test our pipeline using 3 publicly available bot datasets, comparing the accuracy of all trained models with the model selected at the end of our pipeline.","PeriodicalId":385668,"journal":{"name":"2020 Workshop on Communication Networks and Power Systems (WCNPS)","volume":"11 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Workshop on Communication Networks and Power Systems (WCNPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCNPS50723.2020.9263756","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Bot generated content on social media can spread fake news and hate speech, manipulate public opinion and influence the community on relevant topics, such as elections. Thus, bot detection in social media platforms plays an important role for the health of the platforms and for the well-being of societies. In this work, we approach the detection of bots on Twitter as a binary output problem through the analysis of account features. We propose a pipeline for feature engineering and model training, tuning and selection. We test our pipeline using 3 publicly available bot datasets, comparing the accuracy of all trained models with the model selected at the end of our pipeline.