A Frame Work for Sentiment Analysis Classification based on Comparative Study

International Journal of Soft Computing and Engineering Pub Date : 2022-05-30 DOI:10.35940/ijsce.a3524.0512222

Zahir Younis, Nidal M. S. Kafri, Wael Hasouneh

{"title":"A Frame Work for Sentiment Analysis Classification based on Comparative Study","authors":"Zahir Younis, Nidal M. S. Kafri, Wael Hasouneh","doi":"10.35940/ijsce.a3524.0512222","DOIUrl":null,"url":null,"abstract":"A number of Feature Selection and Ensemble Methods for Sentiment Analysis Classification had been introduced in many searches. This paper presents A frame work for sentiment analysis classification based on comparative study on different classification algorithms i.e., comparison between combinations of classification algorithms: Bayes, SVM, Decision Tree. We also examined the effect of using feature selection methods (statistical, wrapper, or embedded), ensemble methods (Bagging, Boosting, Stacking, or Vote), tuning parameters of methods (SVMAttributeEval, Stacking), and the effect of merging feature subsets selected by embedded method on the classification accuracy. Particularly, the results showed that accuracy depends on the feature selection method, ensemble methods, number of selected features, type of classifier, and tuning parameters of the algorithms used. A high accuracy of up to 99.85% was achieved by merging features of two embedded methods when using stacking ensemble method. Also, a high accuracy of 99.5% was achieved by tuning parameters in stacking method, and it reached 99.95% and 100% by tuning parameters in SVMAttributeEval method using statistical and machine learning approaches, respectively. Furthermore, tuning algorithms' parameters reduced the time needed to select feature subsets. Thus, these combinations of algorithms can be followed as a frame work for sentiment analyses.","PeriodicalId":173799,"journal":{"name":"International Journal of Soft Computing and Engineering","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Soft Computing and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35940/ijsce.a3524.0512222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

A number of Feature Selection and Ensemble Methods for Sentiment Analysis Classification had been introduced in many searches. This paper presents A frame work for sentiment analysis classification based on comparative study on different classification algorithms i.e., comparison between combinations of classification algorithms: Bayes, SVM, Decision Tree. We also examined the effect of using feature selection methods (statistical, wrapper, or embedded), ensemble methods (Bagging, Boosting, Stacking, or Vote), tuning parameters of methods (SVMAttributeEval, Stacking), and the effect of merging feature subsets selected by embedded method on the classification accuracy. Particularly, the results showed that accuracy depends on the feature selection method, ensemble methods, number of selected features, type of classifier, and tuning parameters of the algorithms used. A high accuracy of up to 99.85% was achieved by merging features of two embedded methods when using stacking ensemble method. Also, a high accuracy of 99.5% was achieved by tuning parameters in stacking method, and it reached 99.95% and 100% by tuning parameters in SVMAttributeEval method using statistical and machine learning approaches, respectively. Furthermore, tuning algorithms' parameters reduced the time needed to select feature subsets. Thus, these combinations of algorithms can be followed as a frame work for sentiment analyses.

查看原文本刊更多论文

基于比较研究的情感分析分类框架

在许多搜索中，情感分析分类的特征选择和集成方法已经被引入。本文通过对不同分类算法的比较研究，即贝叶斯、支持向量机、决策树分类算法组合的比较，提出了一种情感分析分类框架。我们还研究了使用特征选择方法(统计、包装或嵌入)、集成方法(Bagging、Boosting、Stacking或Vote)、方法参数调优(SVMAttributeEval、Stacking)的效果，以及嵌入方法选择的合并特征子集对分类精度的影响。具体而言，结果表明，准确率取决于特征选择方法、集成方法、选择的特征数量、分类器类型和所用算法的调优参数。在使用叠加集成方法时，通过融合两种嵌入方法的特征，获得了高达99.85%的准确率。采用统计方法和机器学习方法对SVMAttributeEval方法进行参数调优，准确率分别达到99.95%和100%。此外，调优算法的参数减少了选择特征子集所需的时间。因此，这些算法的组合可以作为情感分析的框架。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Soft Computing and Engineering

自引率

0.00%

发文量