推特蜂鸣器检测系统采用推特相似度特征和支持向量机

A. Mustofa, Fitrah Maharani Humaira, Myrna Ermawati, Peni Sriwahyu Natasari, Akhmad Arif Kurdianto, Aries Alfian Prasetyo, A. Faisal
{"title":"推特蜂鸣器检测系统采用推特相似度特征和支持向量机","authors":"A. Mustofa, Fitrah Maharani Humaira, Myrna Ermawati, Peni Sriwahyu Natasari, Akhmad Arif Kurdianto, Aries Alfian Prasetyo, A. Faisal","doi":"10.36564/njca.v8i1.306","DOIUrl":null,"url":null,"abstract":"Over the past few years, people have been able to get and share information through social media easily. Some of that information can be a false issue created by a buzzer account that intends to influence people into a specific opinion. Politicians often use social media to maintain a good image in society by utilizing buzzer accounts. The main characteristic of a buzzer account is that they upload the same content repeatedly within a certain period. Before analyzing data taken from social media such as Twitter, we need a buzzer detection system to filter data from buzzer users.  This research attempts to build a buzzer detection system using text processing and classification method. We use the similarity of tweets as a feature for the buzzer detection system by applying Cosine Similarity to the Term Frequency - Inverse Document Frequency (TF-IDF) feature of the tweets. In addition, we will use other features such as the number of followers, number of followings, the intensity of tweets, the ratio of retweets, and the ratio of tweets that contain links as additional features in this study. This research uses these features as inputs to the Support Vector Machine model to determine whether an account is a buzzer or not. This system has promising results by having 89% accuracy, 86.67% precision, 70.91 % recall, and 78% F1-score.","PeriodicalId":102207,"journal":{"name":"NJCA (Nusantara Journal of Computers and Its Applications)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"TWITTER BUZZER DETECTION SYSTEM USING TWEET SIMILARITY FEATURE AND SUPPORT VECTOR MACHINE\",\"authors\":\"A. Mustofa, Fitrah Maharani Humaira, Myrna Ermawati, Peni Sriwahyu Natasari, Akhmad Arif Kurdianto, Aries Alfian Prasetyo, A. Faisal\",\"doi\":\"10.36564/njca.v8i1.306\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the past few years, people have been able to get and share information through social media easily. Some of that information can be a false issue created by a buzzer account that intends to influence people into a specific opinion. Politicians often use social media to maintain a good image in society by utilizing buzzer accounts. The main characteristic of a buzzer account is that they upload the same content repeatedly within a certain period. Before analyzing data taken from social media such as Twitter, we need a buzzer detection system to filter data from buzzer users.  This research attempts to build a buzzer detection system using text processing and classification method. We use the similarity of tweets as a feature for the buzzer detection system by applying Cosine Similarity to the Term Frequency - Inverse Document Frequency (TF-IDF) feature of the tweets. In addition, we will use other features such as the number of followers, number of followings, the intensity of tweets, the ratio of retweets, and the ratio of tweets that contain links as additional features in this study. This research uses these features as inputs to the Support Vector Machine model to determine whether an account is a buzzer or not. This system has promising results by having 89% accuracy, 86.67% precision, 70.91 % recall, and 78% F1-score.\",\"PeriodicalId\":102207,\"journal\":{\"name\":\"NJCA (Nusantara Journal of Computers and Its Applications)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NJCA (Nusantara Journal of Computers and Its Applications)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36564/njca.v8i1.306\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NJCA (Nusantara Journal of Computers and Its Applications)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36564/njca.v8i1.306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在过去的几年里,人们已经能够很容易地通过社交媒体获取和分享信息。其中一些信息可能是由蜂鸣器账户制造的虚假问题,旨在影响人们形成特定的观点。政客们经常利用社交媒体,利用蜂鸣器账号来维护自己在社会中的良好形象。蜂鸣器账号的主要特点是在一定时间内反复上传相同的内容。在分析从Twitter等社交媒体获取的数据之前,我们需要一个蜂鸣器检测系统来过滤蜂鸣器用户的数据。本研究试图利用文本处理和分类的方法构建一个蜂鸣器检测系统。我们将推文的相似度作为蜂鸣器检测系统的一个特征,方法是将余弦相似度应用于推文的词频-逆文档频率(TF-IDF)特征。此外,我们将使用其他特征,如关注者数量、关注者数量、推文强度、转发比例和包含链接的推文比例作为本研究的附加特征。本研究使用这些特征作为支持向量机模型的输入,以确定一个帐户是否是蜂鸣器。该系统具有89%的正确率、86.67%的精密度、70.91%的召回率和78%的f1得分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
TWITTER BUZZER DETECTION SYSTEM USING TWEET SIMILARITY FEATURE AND SUPPORT VECTOR MACHINE
Over the past few years, people have been able to get and share information through social media easily. Some of that information can be a false issue created by a buzzer account that intends to influence people into a specific opinion. Politicians often use social media to maintain a good image in society by utilizing buzzer accounts. The main characteristic of a buzzer account is that they upload the same content repeatedly within a certain period. Before analyzing data taken from social media such as Twitter, we need a buzzer detection system to filter data from buzzer users.  This research attempts to build a buzzer detection system using text processing and classification method. We use the similarity of tweets as a feature for the buzzer detection system by applying Cosine Similarity to the Term Frequency - Inverse Document Frequency (TF-IDF) feature of the tweets. In addition, we will use other features such as the number of followers, number of followings, the intensity of tweets, the ratio of retweets, and the ratio of tweets that contain links as additional features in this study. This research uses these features as inputs to the Support Vector Machine model to determine whether an account is a buzzer or not. This system has promising results by having 89% accuracy, 86.67% precision, 70.91 % recall, and 78% F1-score.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信