Shad Al Kaiser, S. Mandal, Ashraful Kalam Abid, Ekhfa Hossain, Ferdous Bin Ali, Intisar Tahmid Naheen
{"title":"Social Media Opinion Mining Based on Bangla Public Post of Facebook","authors":"Shad Al Kaiser, S. Mandal, Ashraful Kalam Abid, Ekhfa Hossain, Ferdous Bin Ali, Intisar Tahmid Naheen","doi":"10.1109/ICCIT54785.2021.9689860","DOIUrl":null,"url":null,"abstract":"Social media holds the freedom to express anyone as they are. Still, people fail to follow community standards and cross the boundary of self-limit, hurting other people, sometimes leading to cyberbullying. Social media mining is the frontier where researchers contend with ensuring safe cyberspace with the help of robust information retrieval and data mining techniques. In this paper, we are aiming towards achieving such a goal for Bangla language-spoken people. We have created a corpus that contains 11006 Bangla comments from Facebook, analyzed them demographically, annotated them to create robust classifiers to classify these comments as positive, negative, and neutral polarity. We have decomposed these polarities to further sentiments based on contents of the text varying from wishful thinking to gender-based hate speech. Our multiclass classification algorithm, consisting of TF-IDF vectorizer alongside uni-gram, bi-gram, and tri-gram followed by MNB, MNB, and KNN, gives 82.60%, 82.33%, and 79.63% accuracy, respectively.","PeriodicalId":166450,"journal":{"name":"2021 24th International Conference on Computer and Information Technology (ICCIT)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 24th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT54785.2021.9689860","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Social media holds the freedom to express anyone as they are. Still, people fail to follow community standards and cross the boundary of self-limit, hurting other people, sometimes leading to cyberbullying. Social media mining is the frontier where researchers contend with ensuring safe cyberspace with the help of robust information retrieval and data mining techniques. In this paper, we are aiming towards achieving such a goal for Bangla language-spoken people. We have created a corpus that contains 11006 Bangla comments from Facebook, analyzed them demographically, annotated them to create robust classifiers to classify these comments as positive, negative, and neutral polarity. We have decomposed these polarities to further sentiments based on contents of the text varying from wishful thinking to gender-based hate speech. Our multiclass classification algorithm, consisting of TF-IDF vectorizer alongside uni-gram, bi-gram, and tri-gram followed by MNB, MNB, and KNN, gives 82.60%, 82.33%, and 79.63% accuracy, respectively.