Shuai Wang, Abdul Samad Shibghatullah, Thirupattur Javid Iqbal, Kay Hooi Keoy
{"title":"A review of multimodal-based emotion recognition techniques for cyberbullying detection in online social media platforms","authors":"Shuai Wang, Abdul Samad Shibghatullah, Thirupattur Javid Iqbal, Kay Hooi Keoy","doi":"10.1007/s00521-024-10371-3","DOIUrl":null,"url":null,"abstract":"<p>Cyberbullying is a serious issue in online social media platforms (OSMP), which requires effective detection and intervention systems. Multimodal emotion recognition (MER) technology can help prevent cyberbullying by analyzing emotions from textual messages, vision, facial expressions, tone of voice, and physiological signals. However, existing machine learning-based MER models have limitations in accuracy and generalization. Deep learning (DL) methods have achieved remarkable successes in various tasks and have been applied to learn high-level emotional features for MER. This paper provides a systematic review of the recent research on DL-based MER for cyberbullying detection (MERCD). We first introduce the concept of cyberbullying and the general framework of MERCD, as well as the commonly used multimodal emotion datasets. Then, we overview the principles and advancements of representative DL techniques. Next, we focus on the research progress of two key steps in MERCD: emotion feature extraction from speech, vision, and text modalities; and multimodal information fusion strategies. Finally, we discuss the challenges and opportunities in designing a cyberbullying prediction model and suggest possible directions in the MERCD area for future research.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Computing and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00521-024-10371-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Cyberbullying is a serious issue in online social media platforms (OSMP), which requires effective detection and intervention systems. Multimodal emotion recognition (MER) technology can help prevent cyberbullying by analyzing emotions from textual messages, vision, facial expressions, tone of voice, and physiological signals. However, existing machine learning-based MER models have limitations in accuracy and generalization. Deep learning (DL) methods have achieved remarkable successes in various tasks and have been applied to learn high-level emotional features for MER. This paper provides a systematic review of the recent research on DL-based MER for cyberbullying detection (MERCD). We first introduce the concept of cyberbullying and the general framework of MERCD, as well as the commonly used multimodal emotion datasets. Then, we overview the principles and advancements of representative DL techniques. Next, we focus on the research progress of two key steps in MERCD: emotion feature extraction from speech, vision, and text modalities; and multimodal information fusion strategies. Finally, we discuss the challenges and opportunities in designing a cyberbullying prediction model and suggest possible directions in the MERCD area for future research.