{"title":"使用预先训练的深度学习架构对多模态数据进行网络欺凌检测","authors":"Subbaraju Pericherla, I. E","doi":"10.16925/2357-6014.2021.03.09","DOIUrl":null,"url":null,"abstract":"Cyberbullying is a big challenging task in the social media era. The forms of bullying are increasing with the increase of digital technologies. In the past, most of the bullying happened through text messages. Now bullies take advantage of technology, they try bullying others in different forms such as images, videos, and emojis. In this paper, we proposed an approach to identify cyberbullying on both text and image data combinations. We used RoBERTa and Xception deep learning architectures to generate word embeddings from the text data and the image respectively. LightGBM classifier is used to classify bullying and non-bullying tweets. The experiments conducted on 2100 samples of combined data of text and image. The proposed approach efficiently classifies bullying data with F1-score of 80% and outperforms as compared to existing approaches.","PeriodicalId":41023,"journal":{"name":"Ingenieria Solidaria","volume":" ","pages":""},"PeriodicalIF":0.4000,"publicationDate":"2021-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Cyberbullying detection on multi-modal data using pre-trained deep learning architectures\",\"authors\":\"Subbaraju Pericherla, I. E\",\"doi\":\"10.16925/2357-6014.2021.03.09\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cyberbullying is a big challenging task in the social media era. The forms of bullying are increasing with the increase of digital technologies. In the past, most of the bullying happened through text messages. Now bullies take advantage of technology, they try bullying others in different forms such as images, videos, and emojis. In this paper, we proposed an approach to identify cyberbullying on both text and image data combinations. We used RoBERTa and Xception deep learning architectures to generate word embeddings from the text data and the image respectively. LightGBM classifier is used to classify bullying and non-bullying tweets. The experiments conducted on 2100 samples of combined data of text and image. The proposed approach efficiently classifies bullying data with F1-score of 80% and outperforms as compared to existing approaches.\",\"PeriodicalId\":41023,\"journal\":{\"name\":\"Ingenieria Solidaria\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2021-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ingenieria Solidaria\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.16925/2357-6014.2021.03.09\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ingenieria Solidaria","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.16925/2357-6014.2021.03.09","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Cyberbullying detection on multi-modal data using pre-trained deep learning architectures
Cyberbullying is a big challenging task in the social media era. The forms of bullying are increasing with the increase of digital technologies. In the past, most of the bullying happened through text messages. Now bullies take advantage of technology, they try bullying others in different forms such as images, videos, and emojis. In this paper, we proposed an approach to identify cyberbullying on both text and image data combinations. We used RoBERTa and Xception deep learning architectures to generate word embeddings from the text data and the image respectively. LightGBM classifier is used to classify bullying and non-bullying tweets. The experiments conducted on 2100 samples of combined data of text and image. The proposed approach efficiently classifies bullying data with F1-score of 80% and outperforms as compared to existing approaches.