针对网络欺凌检测的推文分析

Shipra Mathur, Shivam Isarka, Bhuvaneswar Dharmasivam, J. C. D.
{"title":"针对网络欺凌检测的推文分析","authors":"Shipra Mathur, Shivam Isarka, Bhuvaneswar Dharmasivam, J. C. D.","doi":"10.1109/ICSCCC58608.2023.10176416","DOIUrl":null,"url":null,"abstract":"Cyberbullying takes place online on gadgets like smartphones and computers. Cyberbullying can occur through social media platforms. This paper presents a real-time cyber-bullying detection system for Twitter using Natural Language Processing (NLP) and Machine Learning (ML). The system is trained on a dataset of cyberbullying tweets using several ML algorithms and their performance is compared. Random Forest was found to provide the best results after tuning. To achieve real-time analysis, Selenium was used to scrape tweets from a given Twitter account and store the timestamp of the already checked tweets. Additionally, an image captioning model was employed to generate descriptions for images posted on the account and compare them with user-written captions to filter out spam tweets. The proposed work aims to prevent cyberbullying and provides a valuable tool for online platforms to detect and remove harmful content. The results of this study have shown that the selection of appropriate ML algorithms and preprocessing techniques significantly impact the performance of cyberbullying detection on Twitter. Our model sheds light on the appropriateness of different ML algorithms for the detection of cyberbullying.","PeriodicalId":359466,"journal":{"name":"2023 Third International Conference on Secure Cyber Computing and Communication (ICSCCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Analysis of Tweets for Cyberbullying Detection\",\"authors\":\"Shipra Mathur, Shivam Isarka, Bhuvaneswar Dharmasivam, J. C. D.\",\"doi\":\"10.1109/ICSCCC58608.2023.10176416\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cyberbullying takes place online on gadgets like smartphones and computers. Cyberbullying can occur through social media platforms. This paper presents a real-time cyber-bullying detection system for Twitter using Natural Language Processing (NLP) and Machine Learning (ML). The system is trained on a dataset of cyberbullying tweets using several ML algorithms and their performance is compared. Random Forest was found to provide the best results after tuning. To achieve real-time analysis, Selenium was used to scrape tweets from a given Twitter account and store the timestamp of the already checked tweets. Additionally, an image captioning model was employed to generate descriptions for images posted on the account and compare them with user-written captions to filter out spam tweets. The proposed work aims to prevent cyberbullying and provides a valuable tool for online platforms to detect and remove harmful content. The results of this study have shown that the selection of appropriate ML algorithms and preprocessing techniques significantly impact the performance of cyberbullying detection on Twitter. Our model sheds light on the appropriateness of different ML algorithms for the detection of cyberbullying.\",\"PeriodicalId\":359466,\"journal\":{\"name\":\"2023 Third International Conference on Secure Cyber Computing and Communication (ICSCCC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 Third International Conference on Secure Cyber Computing and Communication (ICSCCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSCCC58608.2023.10176416\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Third International Conference on Secure Cyber Computing and Communication (ICSCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCCC58608.2023.10176416","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

网络欺凌发生在智能手机和电脑等网络设备上。网络欺凌可以通过社交媒体平台发生。本文提出了一种基于自然语言处理(NLP)和机器学习(ML)的Twitter实时网络欺凌检测系统。该系统使用几种机器学习算法在网络欺凌推文数据集上进行训练,并比较它们的性能。随机森林在调优后提供了最好的结果。为了实现实时分析,使用Selenium从给定的Twitter帐户抓取tweet,并存储已检查tweet的时间戳。此外,使用图像字幕模型对帐户上发布的图像生成描述,并将其与用户编写的标题进行比较,以过滤掉垃圾推文。拟议的工作旨在防止网络欺凌,并为在线平台检测和删除有害内容提供有价值的工具。本研究结果表明,选择合适的ML算法和预处理技术显著影响Twitter网络欺凌检测的性能。我们的模型揭示了检测网络欺凌的不同ML算法的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis of Tweets for Cyberbullying Detection
Cyberbullying takes place online on gadgets like smartphones and computers. Cyberbullying can occur through social media platforms. This paper presents a real-time cyber-bullying detection system for Twitter using Natural Language Processing (NLP) and Machine Learning (ML). The system is trained on a dataset of cyberbullying tweets using several ML algorithms and their performance is compared. Random Forest was found to provide the best results after tuning. To achieve real-time analysis, Selenium was used to scrape tweets from a given Twitter account and store the timestamp of the already checked tweets. Additionally, an image captioning model was employed to generate descriptions for images posted on the account and compare them with user-written captions to filter out spam tweets. The proposed work aims to prevent cyberbullying and provides a valuable tool for online platforms to detect and remove harmful content. The results of this study have shown that the selection of appropriate ML algorithms and preprocessing techniques significantly impact the performance of cyberbullying detection on Twitter. Our model sheds light on the appropriateness of different ML algorithms for the detection of cyberbullying.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信