Comparative Analysis of Machine Learning Techniques using Customer Feedback Reviews of Oil and Gas Companies

Layth Nabeel Alrawi, Osama Ibraheem Ashour Ashour
{"title":"Comparative Analysis of Machine Learning Techniques using Customer Feedback Reviews of Oil and Gas Companies","authors":"Layth Nabeel Alrawi, Osama Ibraheem Ashour Ashour","doi":"10.1145/3436829.3436871","DOIUrl":null,"url":null,"abstract":"Sentiment analysis is the process of computationally identifying and categorizing opinions from a piece of text to determine whether the writer's attitude towards a practical topic, products or services is positive, negative or neutral. In this study, Machine Learning techniques are used to perform sentiment analysis on Oil and Gas customer feedback data. We present a comparison of different classification algorithms used for opinion mining, including Support Vector Machine (SVM), Naïve Bayes (NB), Instance Based Learning (IB3), Random Forest (RF), Partial Decision trees (PART), and Logit Boost (LB). Many studies have been performed on sentiment analysis in different sectors, but research into Oil and Gas customer feedback has been limited. Therefore, we have targeted a pathless sector, namely the Petroleum sector, where companies express their opinions towards specific products or services. Waikato Environment for Knowledge Analysis (WEKA) is used for experimental results. The WEKA environment is open source software entailing a collection of machine learning algorithms to solve data mining problems. The main aim of this study is to evaluate the efficiency of the above mentioned classifiers in terms of Precision, Recall, F-Measure and Accuracy. The findings of the comparison analysis indicate that the Naïve-Bayes classifier gives the best Accuracy of all classifiers. A small dataset could be considered as a limitation to our study due to the difficulty of gaining more datasets at the time of the research. However, this research will play a vital role for researchers in making decisions about the algorithm that they are going to use to solve their data mining problems.","PeriodicalId":162157,"journal":{"name":"Proceedings of the 9th International Conference on Software and Information Engineering","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 9th International Conference on Software and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3436829.3436871","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Sentiment analysis is the process of computationally identifying and categorizing opinions from a piece of text to determine whether the writer's attitude towards a practical topic, products or services is positive, negative or neutral. In this study, Machine Learning techniques are used to perform sentiment analysis on Oil and Gas customer feedback data. We present a comparison of different classification algorithms used for opinion mining, including Support Vector Machine (SVM), Naïve Bayes (NB), Instance Based Learning (IB3), Random Forest (RF), Partial Decision trees (PART), and Logit Boost (LB). Many studies have been performed on sentiment analysis in different sectors, but research into Oil and Gas customer feedback has been limited. Therefore, we have targeted a pathless sector, namely the Petroleum sector, where companies express their opinions towards specific products or services. Waikato Environment for Knowledge Analysis (WEKA) is used for experimental results. The WEKA environment is open source software entailing a collection of machine learning algorithms to solve data mining problems. The main aim of this study is to evaluate the efficiency of the above mentioned classifiers in terms of Precision, Recall, F-Measure and Accuracy. The findings of the comparison analysis indicate that the Naïve-Bayes classifier gives the best Accuracy of all classifiers. A small dataset could be considered as a limitation to our study due to the difficulty of gaining more datasets at the time of the research. However, this research will play a vital role for researchers in making decisions about the algorithm that they are going to use to solve their data mining problems.
基于油气公司客户反馈评论的机器学习技术对比分析
情感分析是从一篇文章中计算识别和分类观点的过程,以确定作者对实际话题、产品或服务的态度是积极、消极还是中立。在本研究中,机器学习技术用于对石油和天然气客户反馈数据进行情感分析。我们比较了用于意见挖掘的不同分类算法,包括支持向量机(SVM)、Naïve贝叶斯(NB)、基于实例的学习(IB3)、随机森林(RF)、部分决策树(PART)和Logit Boost (LB)。许多研究都是针对不同行业的情绪分析进行的,但对油气行业客户反馈的研究却很有限。因此,我们瞄准了一个无路径的行业,即石油行业,公司对特定的产品或服务表达自己的意见。实验结果采用怀卡托知识分析环境(WEKA)。WEKA环境是一个开源软件,它包含一组机器学习算法来解决数据挖掘问题。本研究的主要目的是评估上述分类器在Precision, Recall, F-Measure和Accuracy方面的效率。对比分析结果表明,Naïve-Bayes分类器在所有分类器中准确率最高。由于在研究时难以获得更多的数据集,因此小数据集可能被认为是我们研究的限制。然而,这项研究将对研究人员在决定他们将要使用的算法来解决他们的数据挖掘问题方面发挥至关重要的作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信