Detecting Indonesian Spammer on Twitter

2018 6th International Conference on Information and Communication Technology (ICoICT) Pub Date : 2018-05-01 DOI:10.1109/ICOICT.2018.8528773

E. B. Setiawan, D. H. Widyantoro, K. Surendro

引用次数: 5

Abstract

Nowadays, Twitter is one of the most popular social media today. However, Twitter has several problems that have negative impacts to the users, one of which is spam. We introduce a different approach compared to previous research are the scope of Indonesian-language Twitter, crawling automatically for user and tweets data, as well as the addition of new features. We use two features dimension, i.e., user-based and tweet-based. In this paper, we detect Indonesian spammers on Twitter using four classification algorithms, namely Naïve Bayes (NB), Support Vector Machine (SVM), Logistic Regression (Logit), and J48. The results are confirmed for having better accuracy that of the existing. The highest accuracy of 93,67% is achieved using Logistic Regression (Logit).

查看原文本刊更多论文

检测印尼垃圾邮件在Twitter上

如今，Twitter是当今最受欢迎的社交媒体之一。然而，Twitter有几个对用户有负面影响的问题，其中一个是垃圾邮件。与之前的研究相比，我们引入了一种不同的方法，即印尼语Twitter的范围，自动抓取用户和推文数据，以及添加新功能。我们使用两个特征维度，即基于用户和基于tweet。本文使用Naïve贝叶斯(NB)、支持向量机(SVM)、逻辑回归(Logit)和J48四种分类算法检测Twitter上的印尼垃圾邮件发送者。结果表明，该方法具有较好的精度。使用逻辑回归(Logit)达到了93.67%的最高准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 6th International Conference on Information and Communication Technology (ICoICT)

自引率

0.00%

发文量