{"title":"Naïve Bayes with Negation Handling for Sentiment Analysis of Twitter Data","authors":"Lobna H. Kamal, Gerard McKee, N. A. Othman","doi":"10.1109/ISCMI56532.2022.10068474","DOIUrl":null,"url":null,"abstract":"This paper proposes an enhanced negation handling technique for sentiment analysis of Twitter data using the Naïve Bayes algorithm and Part-of-Speech (POS) tagging. Negation handling detects negated content in text and can thus improve sentiment prediction. The proposed technique focuses on the detection of direct negation words such as “not” and “no”, and implicitly negated content such as “could have been” and “should have been”. The paper compares the proposed negation handling technique with an existing negation handling technique. The Sentiment140 dataset is used in the experiments. Naïve Bayes with the proposed negation handling technique gave an accuracy of 77.57% while the accuracy of the Naïve Bayes with the existing negation handling was 76.93% and the accuracy of the standard Naïve Bayes was 76.12 % for a dataset of 1,000,000 tweets. Of these 1,000,000 tweets 197,381 contained one or more negations. Taking these negated tweets alone, the proposed technique showed an improvement over the existing technique and standard Naïve Bayes with accuracies respectively of 76.51%, 75.98%, and 75.09%. The improvements and shortcomings of the proposed technique are discussed.","PeriodicalId":340397,"journal":{"name":"2022 9th International Conference on Soft Computing & Machine Intelligence (ISCMI)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th International Conference on Soft Computing & Machine Intelligence (ISCMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCMI56532.2022.10068474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper proposes an enhanced negation handling technique for sentiment analysis of Twitter data using the Naïve Bayes algorithm and Part-of-Speech (POS) tagging. Negation handling detects negated content in text and can thus improve sentiment prediction. The proposed technique focuses on the detection of direct negation words such as “not” and “no”, and implicitly negated content such as “could have been” and “should have been”. The paper compares the proposed negation handling technique with an existing negation handling technique. The Sentiment140 dataset is used in the experiments. Naïve Bayes with the proposed negation handling technique gave an accuracy of 77.57% while the accuracy of the Naïve Bayes with the existing negation handling was 76.93% and the accuracy of the standard Naïve Bayes was 76.12 % for a dataset of 1,000,000 tweets. Of these 1,000,000 tweets 197,381 contained one or more negations. Taking these negated tweets alone, the proposed technique showed an improvement over the existing technique and standard Naïve Bayes with accuracies respectively of 76.51%, 75.98%, and 75.09%. The improvements and shortcomings of the proposed technique are discussed.
本文提出了一种使用Naïve贝叶斯算法和词性标注的Twitter数据情感分析的增强否定处理技术。否定处理检测文本中的否定内容,从而可以提高情绪预测。该技术侧重于直接否定词(如“not”和“no”)和隐含否定内容(如“could have been”和“should have been”)的检测。本文将提出的否定处理技术与现有的否定处理技术进行了比较。实验中使用Sentiment140数据集。对于1,000,000条推文数据集,Naïve贝叶斯的否定处理准确率为77.57%,而现有的Naïve贝叶斯的否定处理准确率为76.93%,标准Naïve贝叶斯的准确率为76.12%。在这1,000,000条tweet中,197,381条包含一个或多个否定。单独考虑这些否定推文,本文提出的技术比现有技术和标准Naïve贝叶斯有了改进,准确率分别为76.51%、75.98%和75.09%。讨论了该技术的改进和不足。