An efficient hybrid Hopfield convolutional neural network for detecting spam bots in Twitter platform

A.V. Santhosh Kumar , N. Suresh Kumar , R. Kanniga Devi
{"title":"An efficient hybrid Hopfield convolutional neural network for detecting spam bots in Twitter platform","authors":"A.V. Santhosh Kumar ,&nbsp;N. Suresh Kumar ,&nbsp;R. Kanniga Devi","doi":"10.1016/j.ijcce.2025.05.003","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, social media platforms have become very popular as they offer unbelievable opportunities to their users. Twitter is one of the social media platforms on which a huge number of people exchange their messages by posting tweets. However, this platform is usually used by automated accounts called bots. Such bots are used to spread fake news, fake ideas, and products. Hence, it is essential to detect the presence of spam bots on Twitter. In order to detect spam bots on Twitter, an effective feature selection technique using a novel hybrid deep learning model is introduced in this paper. This paper proposes a novel spam bot detection system for the Twitter social network that combines profile and tweet-based features. Initially, the Twitter data are pre-processed to improve the accuracy of classification. The pre-processing stage involves various steps such as stopping word removal, tokenization, stemming, n-gram identification, user mention, and vocabulary density and richness. After pre-processing, the tweets are given to the next stage for feature extraction. In this stage, the user profile-based features such as name, screen name, location, and time, as well as the tweet-based features such as hashtags, retweeting of tweets, etc., are extracted from the tweets. The extracted features are then subjected to feature selection, where a meta-heuristic-based optimization algorithm called the Binary Golden Search Optimization algorithm (BGSO) is used. This method helps to reduce the feature dimensionality and overfitting issues. In order to improve the optimization algorithm’s searching ability, an X-shaped transfer function is used. Finally, the selected features are provided to the novel Hybrid Hopfield Dilated Depthwise Separable Convolutional Neural Network (HHD<sup>2</sup>SCNN) based classification model, where the output layer classifies the given tweets as spam bots or legitimate. The proposed method is experimentally verified, and the performance metrics are evaluated. Simulation is done using the Python tool, and the Cresci 2017 dataset is used. Simulation results show that the proposed HHD<sup>2</sup>SCNN model provides better accuracy, having an accuracy of 98.40 % compared to the existing techniques. Also, the proposed hybrid deep learning model achieved a precision of 98.40 %, recall of 98.40 %, specificity of 98.40 %, F-score of 98.40 %, and kappa of 96.80 %. Thus, the proposed technique achieves better results compared to the existing techniques.</div></div>","PeriodicalId":100694,"journal":{"name":"International Journal of Cognitive Computing in Engineering","volume":"6 ","pages":"Pages 569-587"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Computing in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666307425000270","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, social media platforms have become very popular as they offer unbelievable opportunities to their users. Twitter is one of the social media platforms on which a huge number of people exchange their messages by posting tweets. However, this platform is usually used by automated accounts called bots. Such bots are used to spread fake news, fake ideas, and products. Hence, it is essential to detect the presence of spam bots on Twitter. In order to detect spam bots on Twitter, an effective feature selection technique using a novel hybrid deep learning model is introduced in this paper. This paper proposes a novel spam bot detection system for the Twitter social network that combines profile and tweet-based features. Initially, the Twitter data are pre-processed to improve the accuracy of classification. The pre-processing stage involves various steps such as stopping word removal, tokenization, stemming, n-gram identification, user mention, and vocabulary density and richness. After pre-processing, the tweets are given to the next stage for feature extraction. In this stage, the user profile-based features such as name, screen name, location, and time, as well as the tweet-based features such as hashtags, retweeting of tweets, etc., are extracted from the tweets. The extracted features are then subjected to feature selection, where a meta-heuristic-based optimization algorithm called the Binary Golden Search Optimization algorithm (BGSO) is used. This method helps to reduce the feature dimensionality and overfitting issues. In order to improve the optimization algorithm’s searching ability, an X-shaped transfer function is used. Finally, the selected features are provided to the novel Hybrid Hopfield Dilated Depthwise Separable Convolutional Neural Network (HHD2SCNN) based classification model, where the output layer classifies the given tweets as spam bots or legitimate. The proposed method is experimentally verified, and the performance metrics are evaluated. Simulation is done using the Python tool, and the Cresci 2017 dataset is used. Simulation results show that the proposed HHD2SCNN model provides better accuracy, having an accuracy of 98.40 % compared to the existing techniques. Also, the proposed hybrid deep learning model achieved a precision of 98.40 %, recall of 98.40 %, specificity of 98.40 %, F-score of 98.40 %, and kappa of 96.80 %. Thus, the proposed technique achieves better results compared to the existing techniques.
基于混合Hopfield卷积神经网络的Twitter平台垃圾邮件机器人检测
最近,社交媒体平台变得非常受欢迎,因为它们为用户提供了难以置信的机会。推特是一种社交媒体平台,大量的人通过发布推特来交换信息。然而,这个平台通常被称为机器人的自动账户使用。这些机器人被用来传播假新闻、假想法和假产品。因此,检测Twitter上的垃圾邮件机器人是至关重要的。为了检测Twitter上的垃圾邮件机器人,本文介绍了一种使用新型混合深度学习模型的有效特征选择技术。本文提出了一种新的针对Twitter社交网络的垃圾邮件机器人检测系统,该系统结合了个人资料和基于tweet的特征。首先,对Twitter数据进行预处理,以提高分类的准确性。预处理阶段包括各种步骤,如停止单词删除、标记化、词干提取、n-gram识别、用户提及、词汇密度和丰富度。预处理后的推文进入下一阶段进行特征提取。在此阶段,从tweets中提取基于用户profile的特征,如姓名、屏幕名称、位置、时间等,以及基于tweet的特征,如hashtag、tweets的转发等。然后对提取的特征进行特征选择,其中使用基于元启发式的优化算法,称为二元黄金搜索优化算法(BGSO)。该方法有助于减少特征维数和过拟合问题。为了提高优化算法的搜索能力,采用了x形传递函数。最后,将选择的特征提供给基于混合Hopfield扩展深度可分离卷积神经网络(HHD2SCNN)的新型分类模型,其中输出层将给定的推文分类为垃圾邮件机器人或合法推文。实验验证了该方法的有效性,并对其性能指标进行了评价。仿真使用Python工具完成,并使用Cresci 2017数据集。仿真结果表明,与现有技术相比,所提出的HHD2SCNN模型具有更好的准确率,准确率达到98.40%。混合深度学习模型的准确率为98.40%,召回率为98.40%,特异性为98.40%,f分数为98.40%,kappa为96.80%。因此,与现有技术相比,所提出的技术取得了更好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
13.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信