ENSEMBLE STACKING DALAM ANALISA SENTIMEN REAKSI VETERAN MILITER AS TERHADAP PENGAMBILALIHAN AFGHANISTAN OLEH TALIBAN

Henny Leidiyana
{"title":"ENSEMBLE STACKING DALAM ANALISA SENTIMEN REAKSI VETERAN MILITER AS TERHADAP PENGAMBILALIHAN AFGHANISTAN OLEH TALIBAN","authors":"Henny Leidiyana","doi":"10.33480/inti.v18i1.4175","DOIUrl":null,"url":null,"abstract":"Abstrak— Sentiment analysis can be used to glean information about user opinions and identify social or political trends. There have been many studies on sentiment analysis using machine learning or lexicon-based methods that have been quite impressive. However, machine learning models often have difficulty generalizing to new data due to various reasons, such as overfitting and limited training data. These models are also prone to bias and variance, which negatively affect the accuracy of their predictions. This study discusses the application of the ensemble stacking method in sentiment analysis with the topic of the takeover of Afghanistan by the Taliban. By monitoring social media, the author uses a dataset in the form of comments on YouTube news channels related to the topic raised. Several studies have shown how the ensemble stacking method predicts better than the single model. The research was carried out by creating a sentiment classification model with logistic regression machine learning algorithms, SVM, KNN, and CART then the ensemble stacking classifier formed by the base learner of the four algorithms. As a result, for a single classifier, the highest average accuracy is the logistic regression algorithm of 74.6 percent. The four algorithms are compiled and predicted by logistic regression, and the stacking ensemble classifier that is applied produces better accuracy than the stand-alone classifier, which is 75.3 percent","PeriodicalId":197142,"journal":{"name":"INTI Nusa Mandiri","volume":"316 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"INTI Nusa Mandiri","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33480/inti.v18i1.4175","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstrak— Sentiment analysis can be used to glean information about user opinions and identify social or political trends. There have been many studies on sentiment analysis using machine learning or lexicon-based methods that have been quite impressive. However, machine learning models often have difficulty generalizing to new data due to various reasons, such as overfitting and limited training data. These models are also prone to bias and variance, which negatively affect the accuracy of their predictions. This study discusses the application of the ensemble stacking method in sentiment analysis with the topic of the takeover of Afghanistan by the Taliban. By monitoring social media, the author uses a dataset in the form of comments on YouTube news channels related to the topic raised. Several studies have shown how the ensemble stacking method predicts better than the single model. The research was carried out by creating a sentiment classification model with logistic regression machine learning algorithms, SVM, KNN, and CART then the ensemble stacking classifier formed by the base learner of the four algorithms. As a result, for a single classifier, the highest average accuracy is the logistic regression algorithm of 74.6 percent. The four algorithms are compiled and predicted by logistic regression, and the stacking ensemble classifier that is applied produces better accuracy than the stand-alone classifier, which is 75.3 percent
摘要:情感分析可以用来收集用户意见的信息,并确定社会或政治趋势。已经有很多关于使用机器学习或基于词典的方法进行情感分析的研究,这些研究都令人印象深刻。然而,由于各种原因,如过度拟合和有限的训练数据,机器学习模型往往难以泛化到新的数据。这些模型也容易出现偏差和方差,这对其预测的准确性产生了负面影响。本研究以塔利班接管阿富汗为主题,探讨集合堆叠方法在情绪分析中的应用。通过对社交媒体的监测,作者使用了与所提出话题相关的YouTube新闻频道评论形式的数据集。几项研究表明,集成叠加方法的预测效果优于单一模型。研究采用逻辑回归机器学习算法、SVM、KNN和CART建立情感分类模型,然后由四种算法的基础学习器形成集成堆叠分类器。因此,对于单个分类器,最高的平均准确率是逻辑回归算法,为74.6%。四种算法通过逻辑回归进行编译和预测,应用的堆叠集成分类器比独立分类器产生更好的准确率,为75.3%
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信