Mujahidul Islam, Maqsudur Rahman, M. T. Ahmed, Abu Zafor Muhammad Islam, Dipankar Das, M. M. Hoque
{"title":"Sexual Harassment Detection using Machine Learning and Deep Learning Techniques for Bangla Text","authors":"Mujahidul Islam, Maqsudur Rahman, M. T. Ahmed, Abu Zafor Muhammad Islam, Dipankar Das, M. M. Hoque","doi":"10.1109/ECCE57851.2023.10101522","DOIUrl":null,"url":null,"abstract":"Harassment is a kind of act that annoys or upsets someone. Harassment can be classified into different categories. Sexual harassment is one of them. Sexual harassment is a type of harassment that involves the use of implicit or explicit sexual overtones, including the inappropriate and unwelcome promises of rewards in exchange for sexual favors. At present time, the technology has become more advance and spread all over the place. That gave the toxic people a huge opportunity to spread toxicity in online platforms. Because of the increasing amount Bangla text in different social media platforms, we also need to filter such kinds of offensive Bangla texts. The objective of this research is to detect sexual harassment from Bangla text and classify them by using machine learning and deep learning algorithms as well as prevents them. In the experiment, we combined TF-IDF with different machine learning algorithms like Naive Bayes, Decision Tree, Random Forest, AdaBoost, SGD, Logistic Regression, KNN, SVM and got accuracy of 74.9%, 75.6%, 70.0%, 70.1%, 75.2%, 75.7%, 65.2%, 76.5% respectively. Deep learning algorithms like CNN, LSTM, hybrid CNN-LSTM were also used and achieved accuracy of 89% for all of them which is comparatively better than machine learning techniques.","PeriodicalId":131537,"journal":{"name":"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Electrical, Computer and Communication Engineering (ECCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECCE57851.2023.10101522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Harassment is a kind of act that annoys or upsets someone. Harassment can be classified into different categories. Sexual harassment is one of them. Sexual harassment is a type of harassment that involves the use of implicit or explicit sexual overtones, including the inappropriate and unwelcome promises of rewards in exchange for sexual favors. At present time, the technology has become more advance and spread all over the place. That gave the toxic people a huge opportunity to spread toxicity in online platforms. Because of the increasing amount Bangla text in different social media platforms, we also need to filter such kinds of offensive Bangla texts. The objective of this research is to detect sexual harassment from Bangla text and classify them by using machine learning and deep learning algorithms as well as prevents them. In the experiment, we combined TF-IDF with different machine learning algorithms like Naive Bayes, Decision Tree, Random Forest, AdaBoost, SGD, Logistic Regression, KNN, SVM and got accuracy of 74.9%, 75.6%, 70.0%, 70.1%, 75.2%, 75.7%, 65.2%, 76.5% respectively. Deep learning algorithms like CNN, LSTM, hybrid CNN-LSTM were also used and achieved accuracy of 89% for all of them which is comparatively better than machine learning techniques.