{"title":"探索基于nlp的社交媒体节制挑战解决方案","authors":"Heba Saleous, Marton Gergely, Khaled Shuaib","doi":"10.1155/hbe2/9436490","DOIUrl":null,"url":null,"abstract":"<p>The rise of social media has revolutionized global communication, enabling users and businesses to connect, advertise, and monitor competitors. However, this expansion has also fueled toxic behaviors like hate speech and harassment, exposing innocent users to harmful content while overwhelming human moderators and impacting their well-being. To address these challenges, artificial intelligence (AI) and natural language processing (NLP) have been explored as potential solutions. The aim of this paper is to study existing AI-based moderation approaches to understand which models have been used, their effectiveness, and the challenges they face. This work conducts a targeted systematic literature review of research efforts that present a technical approach to the topic while sharing model results and highlighting the challenges encountered. The findings reveal that AI-driven moderation shows promise by achieving high accuracy but has some issues that need to be addressed, such as dataset imbalance, obstacles and inconsistencies, bias, and misinterpretation of message meanings. By summarizing existing research efforts and identifying key gaps, this study provides insights into the strengths and weaknesses of current AI-based solutions for content moderation.</p>","PeriodicalId":36408,"journal":{"name":"Human Behavior and Emerging Technologies","volume":"2025 1","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2025-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/hbe2/9436490","citationCount":"0","resultStr":"{\"title\":\"Exploring NLP-Based Solutions to Social Media Moderation Challenges\",\"authors\":\"Heba Saleous, Marton Gergely, Khaled Shuaib\",\"doi\":\"10.1155/hbe2/9436490\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The rise of social media has revolutionized global communication, enabling users and businesses to connect, advertise, and monitor competitors. However, this expansion has also fueled toxic behaviors like hate speech and harassment, exposing innocent users to harmful content while overwhelming human moderators and impacting their well-being. To address these challenges, artificial intelligence (AI) and natural language processing (NLP) have been explored as potential solutions. The aim of this paper is to study existing AI-based moderation approaches to understand which models have been used, their effectiveness, and the challenges they face. This work conducts a targeted systematic literature review of research efforts that present a technical approach to the topic while sharing model results and highlighting the challenges encountered. The findings reveal that AI-driven moderation shows promise by achieving high accuracy but has some issues that need to be addressed, such as dataset imbalance, obstacles and inconsistencies, bias, and misinterpretation of message meanings. By summarizing existing research efforts and identifying key gaps, this study provides insights into the strengths and weaknesses of current AI-based solutions for content moderation.</p>\",\"PeriodicalId\":36408,\"journal\":{\"name\":\"Human Behavior and Emerging Technologies\",\"volume\":\"2025 1\",\"pages\":\"\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1155/hbe2/9436490\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human Behavior and Emerging Technologies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1155/hbe2/9436490\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Behavior and Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/hbe2/9436490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}
Exploring NLP-Based Solutions to Social Media Moderation Challenges
The rise of social media has revolutionized global communication, enabling users and businesses to connect, advertise, and monitor competitors. However, this expansion has also fueled toxic behaviors like hate speech and harassment, exposing innocent users to harmful content while overwhelming human moderators and impacting their well-being. To address these challenges, artificial intelligence (AI) and natural language processing (NLP) have been explored as potential solutions. The aim of this paper is to study existing AI-based moderation approaches to understand which models have been used, their effectiveness, and the challenges they face. This work conducts a targeted systematic literature review of research efforts that present a technical approach to the topic while sharing model results and highlighting the challenges encountered. The findings reveal that AI-driven moderation shows promise by achieving high accuracy but has some issues that need to be addressed, such as dataset imbalance, obstacles and inconsistencies, bias, and misinterpretation of message meanings. By summarizing existing research efforts and identifying key gaps, this study provides insights into the strengths and weaknesses of current AI-based solutions for content moderation.
期刊介绍:
Human Behavior and Emerging Technologies is an interdisciplinary journal dedicated to publishing high-impact research that enhances understanding of the complex interactions between diverse human behavior and emerging digital technologies.