S. M. P. C. Souza, Tito B Rezende, José Nascimento, Levy G. Chaves, Darlinne H P Soto, Soroor Salavati
{"title":"调整机器学习模型以检测Twitter上的机器人","authors":"S. M. P. C. Souza, Tito B Rezende, José Nascimento, Levy G. Chaves, Darlinne H P Soto, Soroor Salavati","doi":"10.1109/WCNPS50723.2020.9263756","DOIUrl":null,"url":null,"abstract":"Bot generated content on social media can spread fake news and hate speech, manipulate public opinion and influence the community on relevant topics, such as elections. Thus, bot detection in social media platforms plays an important role for the health of the platforms and for the well-being of societies. In this work, we approach the detection of bots on Twitter as a binary output problem through the analysis of account features. We propose a pipeline for feature engineering and model training, tuning and selection. We test our pipeline using 3 publicly available bot datasets, comparing the accuracy of all trained models with the model selected at the end of our pipeline.","PeriodicalId":385668,"journal":{"name":"2020 Workshop on Communication Networks and Power Systems (WCNPS)","volume":"11 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Tuning machine learning models to detect bots on Twitter\",\"authors\":\"S. M. P. C. Souza, Tito B Rezende, José Nascimento, Levy G. Chaves, Darlinne H P Soto, Soroor Salavati\",\"doi\":\"10.1109/WCNPS50723.2020.9263756\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Bot generated content on social media can spread fake news and hate speech, manipulate public opinion and influence the community on relevant topics, such as elections. Thus, bot detection in social media platforms plays an important role for the health of the platforms and for the well-being of societies. In this work, we approach the detection of bots on Twitter as a binary output problem through the analysis of account features. We propose a pipeline for feature engineering and model training, tuning and selection. We test our pipeline using 3 publicly available bot datasets, comparing the accuracy of all trained models with the model selected at the end of our pipeline.\",\"PeriodicalId\":385668,\"journal\":{\"name\":\"2020 Workshop on Communication Networks and Power Systems (WCNPS)\",\"volume\":\"11 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 Workshop on Communication Networks and Power Systems (WCNPS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WCNPS50723.2020.9263756\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Workshop on Communication Networks and Power Systems (WCNPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCNPS50723.2020.9263756","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Tuning machine learning models to detect bots on Twitter
Bot generated content on social media can spread fake news and hate speech, manipulate public opinion and influence the community on relevant topics, such as elections. Thus, bot detection in social media platforms plays an important role for the health of the platforms and for the well-being of societies. In this work, we approach the detection of bots on Twitter as a binary output problem through the analysis of account features. We propose a pipeline for feature engineering and model training, tuning and selection. We test our pipeline using 3 publicly available bot datasets, comparing the accuracy of all trained models with the model selected at the end of our pipeline.