Sentiment Analysis and Classification of Forest Fires in Indonesia

Ilkom Jurnal Ilmiah Pub Date : 2023-04-07 DOI:10.33096/ilkom.v15i1.1337.175-185

I. Irawanto, Cynthia Widodo, Atin Hasanah, Prema Adhitya Dharma Kusumah, Kusirini Kusrini, Kusnawi Kusnawi

{"title":"Sentiment Analysis and Classification of Forest Fires in Indonesia","authors":"I. Irawanto, Cynthia Widodo, Atin Hasanah, Prema Adhitya Dharma Kusumah, Kusirini Kusrini, Kusnawi Kusnawi","doi":"10.33096/ilkom.v15i1.1337.175-185","DOIUrl":null,"url":null,"abstract":"so the preprocessing process must be carried out as was done in research [5], which retrieved Twitter data on the theme of COVID 2019. Moreover, weighting must be applied to tasks that must be completed prior to classification. This study uses VADER or commonly known as the lexicon. [6] It uses a lexicon that combines lexical dictionary features as a polarity assessment. Sentiment scores of 5 additional criteria, namely exclamation marks, large alphabet, level of word order, polarity shift due to the term \"but,\" and using the tri-gram feature to study negation [7]. Once the text has been labeled, we will classify it using the sentiment analysis. The Nave Bayes technique, Random Forest, and SVM are some reliable classifications that have been demonstrated in numerous research (Support Vector Machine). A popular algorithm that is frequently employed by researchers is Naïve Bayes. The following researchers have used the Naive Bayes method for sentiment analysis research: : [8] analyzing the online store JD.ID, [9] regarding awareness of procedures to prevent COVID 2019. Random Forest is rarely implemented in research on sentiment analysis, although it has recently been investigated to gauge its accuracy. It is used by a number of researchers, including [10], who achieves an accuracy of about 0.829. Moreover, the SVM (Support Vector Machine) approach, whose accuracy is 85%, is also being investigated in sentiment analysis study by [11]. Hence, researchers want to compare the values of the 3 methods namely Naïve Bayes, Random Forest and SVM (Support Vector Machine) to find out the difference in accuracy of the three when using the same data. As for the accuracy will be calculated using the calculation on the confusion matrix. In addition, the researcher also wants to compare the results of classifying sentiment statements which are divided into positive, negative and neutral sentiments.","PeriodicalId":33690,"journal":{"name":"Ilkom Jurnal Ilmiah","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ilkom Jurnal Ilmiah","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33096/ilkom.v15i1.1337.175-185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

so the preprocessing process must be carried out as was done in research [5], which retrieved Twitter data on the theme of COVID 2019. Moreover, weighting must be applied to tasks that must be completed prior to classification. This study uses VADER or commonly known as the lexicon. [6] It uses a lexicon that combines lexical dictionary features as a polarity assessment. Sentiment scores of 5 additional criteria, namely exclamation marks, large alphabet, level of word order, polarity shift due to the term "but," and using the tri-gram feature to study negation [7]. Once the text has been labeled, we will classify it using the sentiment analysis. The Nave Bayes technique, Random Forest, and SVM are some reliable classifications that have been demonstrated in numerous research (Support Vector Machine). A popular algorithm that is frequently employed by researchers is Naïve Bayes. The following researchers have used the Naive Bayes method for sentiment analysis research: : [8] analyzing the online store JD.ID, [9] regarding awareness of procedures to prevent COVID 2019. Random Forest is rarely implemented in research on sentiment analysis, although it has recently been investigated to gauge its accuracy. It is used by a number of researchers, including [10], who achieves an accuracy of about 0.829. Moreover, the SVM (Support Vector Machine) approach, whose accuracy is 85%, is also being investigated in sentiment analysis study by [11]. Hence, researchers want to compare the values of the 3 methods namely Naïve Bayes, Random Forest and SVM (Support Vector Machine) to find out the difference in accuracy of the three when using the same data. As for the accuracy will be calculated using the calculation on the confusion matrix. In addition, the researcher also wants to compare the results of classifying sentiment statements which are divided into positive, negative and neutral sentiments.

查看原文本刊更多论文

印尼森林火灾的情绪分析与分类

因此，必须像研究[5]中所做的那样进行预处理过程，该研究检索了2019冠状病毒病主题的推特数据。此外，必须对分类前必须完成的任务进行加权。这项研究使用VADER或俗称的词典。[6] 它使用一个结合了词典特征的词典作为极性评估。五个附加标准的情感得分，即感叹号、大字母表、语序水平、“但是”引起的极性变化，以及使用三元图特征研究否定[7]。一旦文本被标记，我们将使用情感分析对其进行分类。Nave Bayes技术、随机森林和SVM是一些可靠的分类，已经在许多研究（支持向量机）中得到了证明。研究人员经常使用的一种流行算法是朴素贝叶斯。以下研究人员使用朴素贝叶斯方法进行情绪分析研究：[8]分析在线商店JD.ID，[9]关于预防2019冠状病毒病的程序意识。随机森林很少用于情绪分析的研究，尽管最近已经对其准确性进行了调查。包括[10]在内的许多研究人员都在使用它，他们的准确度约为0.829。此外，[11]还在情感分析研究中研究了SVM（支持向量机）方法，其准确率为85%。因此，研究人员希望比较三种方法的值，即Naïve Bayes、随机森林和SVM（支持向量机），以找出在使用相同数据时三种方法准确性的差异。至于精度，将使用对混淆矩阵的计算来计算。此外，研究人员还想比较情绪陈述的分类结果，情绪陈述分为积极情绪、消极情绪和中性情绪。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Ilkom Jurnal Ilmiah

自引率

0.00%

发文量

审稿时长

4 weeks