Sentiment Analysis Using AI: A Comparative Study Comparative Study of 5 Different Algorithms and Benchmarking Them with A Qualitative Analysis of Training time, Prediction time, and Accuracy
J. Biswas, M. Haid, Ishak Boyaci, Indrashis Nath, Bharat Hegde, Oliver Janssen, Oliver Kohl, Andreas Minarski
{"title":"Sentiment Analysis Using AI: A Comparative Study Comparative Study of 5 Different Algorithms and Benchmarking Them with A Qualitative Analysis of Training time, Prediction time, and Accuracy","authors":"J. Biswas, M. Haid, Ishak Boyaci, Indrashis Nath, Bharat Hegde, Oliver Janssen, Oliver Kohl, Andreas Minarski","doi":"10.1145/3459104.3459165","DOIUrl":null,"url":null,"abstract":"Sentiment analysis also called Opinion Mining has been one of the most ancient topics in Natural Language Processing (NLP). NLP which is a subfield of AI enables machines to read, understand, interpret and manipulate human languages. The netizens often express their sentiments through tweets, reviews and ratings. Therefore, it's becoming increasingly popular to analyse these sentiments for commercial applications such as market analysis, product reviews, customer services and many more. This analysis can either be lexicon-based or machine learning-based. Lexicon-based techniques use dictionaries of words, which is then used to calculate a score for the polarity. Machine learning-based classifiers use a dataset to train a model which is used to find the sentiment of a sentence or a paragraph. ML-based approaches are more robust compared to the lexicon-based approach. The main contribution of this paper is to demonstrate the performance of various Machine Learning and Deep Learning models in Sentiment Analysis. In this paper, various models viz. Multinomial Naive Bayes(MNB), Support Vector Machine(SVM), Hidden Markov Model(HMM), Long Short Term Memory(LSTM) and Bidirectional Encoder Representations from Transformers(BERT), are trained through supervised learning, and then they are benchmarked through a qualitative analysis of accuracy, effectiveness and speed to provide an extensive overview of the various algorithms used in the field. The performance comparison of the various models presents an in-depth understanding of the different techniques within the framework of a comprehensive evaluation and discussion.","PeriodicalId":142284,"journal":{"name":"2021 International Symposium on Electrical, Electronics and Information Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium on Electrical, Electronics and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3459104.3459165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Sentiment analysis also called Opinion Mining has been one of the most ancient topics in Natural Language Processing (NLP). NLP which is a subfield of AI enables machines to read, understand, interpret and manipulate human languages. The netizens often express their sentiments through tweets, reviews and ratings. Therefore, it's becoming increasingly popular to analyse these sentiments for commercial applications such as market analysis, product reviews, customer services and many more. This analysis can either be lexicon-based or machine learning-based. Lexicon-based techniques use dictionaries of words, which is then used to calculate a score for the polarity. Machine learning-based classifiers use a dataset to train a model which is used to find the sentiment of a sentence or a paragraph. ML-based approaches are more robust compared to the lexicon-based approach. The main contribution of this paper is to demonstrate the performance of various Machine Learning and Deep Learning models in Sentiment Analysis. In this paper, various models viz. Multinomial Naive Bayes(MNB), Support Vector Machine(SVM), Hidden Markov Model(HMM), Long Short Term Memory(LSTM) and Bidirectional Encoder Representations from Transformers(BERT), are trained through supervised learning, and then they are benchmarked through a qualitative analysis of accuracy, effectiveness and speed to provide an extensive overview of the various algorithms used in the field. The performance comparison of the various models presents an in-depth understanding of the different techniques within the framework of a comprehensive evaluation and discussion.