Gaurav N. Shetty, Ashwin Nair, Pradyumna Vishwanath, Ahuja Stuti
{"title":"Sentiment Analysis and Classification on Twitter Spam Account Dataset","authors":"Gaurav N. Shetty, Ashwin Nair, Pradyumna Vishwanath, Ahuja Stuti","doi":"10.1109/ACCTHPA49271.2020.9213206","DOIUrl":null,"url":null,"abstract":"The amount of people using social media is very large and is increasing day by day. The impact of public figures in social media is quite big. Fake accounts are created in social media platforms and are used for various purposes like inflating the follower list of a particular account. These accounts also called spam accounts usually post spam messages which are used for marketing certain products or spreading particular agendas. Such accounts can be dangerous as they may alter a normal user’s perspective on certain topics. These accounts are used to modify and help in creating a fake sense of popularity which can influence political and social situations. In this project, we try to examine some of the existing methods and approaches for fake Twitter accounts detection. We will make use of a public dataset which contains tweets and account information of both Legitimate accounts as well as spam accounts. We make use of account information to create a classifier which can easily classify whether the given account is a fake account or a legitimate account. We also apply sentiment analysis algorithms on the tweet to find patterns among them. We try to analyse the sentiments behind the tweets of different accounts. Comparing our model with the existing model we will improve the features present in our model. In the process of building a better model, we try to reduce overfitting. The final result is an optimum classifier, which can be used to separate a fake account from a list of legitimate accounts.","PeriodicalId":191794,"journal":{"name":"2020 Advanced Computing and Communication Technologies for High Performance Applications (ACCTHPA)","volume":"34 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Advanced Computing and Communication Technologies for High Performance Applications (ACCTHPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACCTHPA49271.2020.9213206","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The amount of people using social media is very large and is increasing day by day. The impact of public figures in social media is quite big. Fake accounts are created in social media platforms and are used for various purposes like inflating the follower list of a particular account. These accounts also called spam accounts usually post spam messages which are used for marketing certain products or spreading particular agendas. Such accounts can be dangerous as they may alter a normal user’s perspective on certain topics. These accounts are used to modify and help in creating a fake sense of popularity which can influence political and social situations. In this project, we try to examine some of the existing methods and approaches for fake Twitter accounts detection. We will make use of a public dataset which contains tweets and account information of both Legitimate accounts as well as spam accounts. We make use of account information to create a classifier which can easily classify whether the given account is a fake account or a legitimate account. We also apply sentiment analysis algorithms on the tweet to find patterns among them. We try to analyse the sentiments behind the tweets of different accounts. Comparing our model with the existing model we will improve the features present in our model. In the process of building a better model, we try to reduce overfitting. The final result is an optimum classifier, which can be used to separate a fake account from a list of legitimate accounts.