Twitter垃圾邮件账户数据集的情感分析与分类

Gaurav N. Shetty, Ashwin Nair, Pradyumna Vishwanath, Ahuja Stuti
{"title":"Twitter垃圾邮件账户数据集的情感分析与分类","authors":"Gaurav N. Shetty, Ashwin Nair, Pradyumna Vishwanath, Ahuja Stuti","doi":"10.1109/ACCTHPA49271.2020.9213206","DOIUrl":null,"url":null,"abstract":"The amount of people using social media is very large and is increasing day by day. The impact of public figures in social media is quite big. Fake accounts are created in social media platforms and are used for various purposes like inflating the follower list of a particular account. These accounts also called spam accounts usually post spam messages which are used for marketing certain products or spreading particular agendas. Such accounts can be dangerous as they may alter a normal user’s perspective on certain topics. These accounts are used to modify and help in creating a fake sense of popularity which can influence political and social situations. In this project, we try to examine some of the existing methods and approaches for fake Twitter accounts detection. We will make use of a public dataset which contains tweets and account information of both Legitimate accounts as well as spam accounts. We make use of account information to create a classifier which can easily classify whether the given account is a fake account or a legitimate account. We also apply sentiment analysis algorithms on the tweet to find patterns among them. We try to analyse the sentiments behind the tweets of different accounts. Comparing our model with the existing model we will improve the features present in our model. In the process of building a better model, we try to reduce overfitting. The final result is an optimum classifier, which can be used to separate a fake account from a list of legitimate accounts.","PeriodicalId":191794,"journal":{"name":"2020 Advanced Computing and Communication Technologies for High Performance Applications (ACCTHPA)","volume":"34 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Sentiment Analysis and Classification on Twitter Spam Account Dataset\",\"authors\":\"Gaurav N. Shetty, Ashwin Nair, Pradyumna Vishwanath, Ahuja Stuti\",\"doi\":\"10.1109/ACCTHPA49271.2020.9213206\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The amount of people using social media is very large and is increasing day by day. The impact of public figures in social media is quite big. Fake accounts are created in social media platforms and are used for various purposes like inflating the follower list of a particular account. These accounts also called spam accounts usually post spam messages which are used for marketing certain products or spreading particular agendas. Such accounts can be dangerous as they may alter a normal user’s perspective on certain topics. These accounts are used to modify and help in creating a fake sense of popularity which can influence political and social situations. In this project, we try to examine some of the existing methods and approaches for fake Twitter accounts detection. We will make use of a public dataset which contains tweets and account information of both Legitimate accounts as well as spam accounts. We make use of account information to create a classifier which can easily classify whether the given account is a fake account or a legitimate account. We also apply sentiment analysis algorithms on the tweet to find patterns among them. We try to analyse the sentiments behind the tweets of different accounts. Comparing our model with the existing model we will improve the features present in our model. In the process of building a better model, we try to reduce overfitting. The final result is an optimum classifier, which can be used to separate a fake account from a list of legitimate accounts.\",\"PeriodicalId\":191794,\"journal\":{\"name\":\"2020 Advanced Computing and Communication Technologies for High Performance Applications (ACCTHPA)\",\"volume\":\"34 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 Advanced Computing and Communication Technologies for High Performance Applications (ACCTHPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACCTHPA49271.2020.9213206\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Advanced Computing and Communication Technologies for High Performance Applications (ACCTHPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACCTHPA49271.2020.9213206","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

使用社交媒体的人数非常多,而且每天都在增加。公众人物在社交媒体上的影响力相当大。虚假账户是在社交媒体平台上创建的,用于各种目的,比如扩大特定账户的关注者名单。这些帐户也被称为垃圾邮件帐户,通常发布垃圾邮件,用于营销某些产品或传播特定议程。这样的账号可能很危险,因为它们可能会改变普通用户对某些话题的看法。这些账号被用来修改和帮助制造一种虚假的受欢迎感,从而影响政治和社会局势。在这个项目中,我们试图研究一些现有的方法和方法来检测虚假Twitter账户。我们将利用一个公共数据集,其中包含合法账户和垃圾账户的推文和账户信息。我们利用账户信息创建一个分类器,可以很容易地区分给定的账户是假账户还是合法账户。我们还对推文应用情感分析算法来发现它们之间的模式。我们试图分析不同账户推文背后的情绪。将我们的模型与现有模型进行比较,我们将改进模型中的特征。在构建更好的模型的过程中,我们尽量减少过拟合。最终的结果是一个最佳分类器,它可以用来将假帐户从合法帐户列表中分离出来。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sentiment Analysis and Classification on Twitter Spam Account Dataset
The amount of people using social media is very large and is increasing day by day. The impact of public figures in social media is quite big. Fake accounts are created in social media platforms and are used for various purposes like inflating the follower list of a particular account. These accounts also called spam accounts usually post spam messages which are used for marketing certain products or spreading particular agendas. Such accounts can be dangerous as they may alter a normal user’s perspective on certain topics. These accounts are used to modify and help in creating a fake sense of popularity which can influence political and social situations. In this project, we try to examine some of the existing methods and approaches for fake Twitter accounts detection. We will make use of a public dataset which contains tweets and account information of both Legitimate accounts as well as spam accounts. We make use of account information to create a classifier which can easily classify whether the given account is a fake account or a legitimate account. We also apply sentiment analysis algorithms on the tweet to find patterns among them. We try to analyse the sentiments behind the tweets of different accounts. Comparing our model with the existing model we will improve the features present in our model. In the process of building a better model, we try to reduce overfitting. The final result is an optimum classifier, which can be used to separate a fake account from a list of legitimate accounts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信