基于方面的CNN情感分析方法

B. M. Mulyo, D. H. Widyantoro
{"title":"基于方面的CNN情感分析方法","authors":"B. M. Mulyo, D. H. Widyantoro","doi":"10.1109/EECSI.2018.8752857","DOIUrl":null,"url":null,"abstract":"Lots of research has been done on the domain of Sentiment Analysis, for example, research that conducted by Bing Liu's (2012) [1]. Other research conducted in a SemEval competition, the domain of sentiment analysis research has been developed further up to the aspect or commonly called Aspect Based Sentiment Analysis (ABSA) [2]. The domain problem of Aspect Based Sentiment Analysis (ABSA) from SemEval is quite diverse, all of those problems arise mostly from the real data provided. Some existing problems include Implicit, Multi-label, Out Of Vocabulary (OOV), Expression extraction, and the detection of aspects and polarities. This research only focuses on classification aspect and classification of sentiment. This study uses an existing method of Convolution Neural Network (CNN) method, which was introduced again by Alex K. The study by Alex K reduces the error rate by 15%, compared in the previous year the decrease was only 5%. This research would like to propose CNN methods that have been optimized, and use Threshold (CNN-T) to select the best data in training data. This method can produce more than one aspect using one data test. The average result of this experiment using CNN-T got better F-Measure compared to CNN and 3 classic Machine Learning method, i.e. SVM, Naive Bayes, and KNN. The overall F1 score of CNN-T is 0.71, which is greater than the other comparable methods.","PeriodicalId":6543,"journal":{"name":"2018 5th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)","volume":"2 1","pages":"142-147"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Aspect-Based Sentiment Analysis Approach with CNN\",\"authors\":\"B. M. Mulyo, D. H. Widyantoro\",\"doi\":\"10.1109/EECSI.2018.8752857\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Lots of research has been done on the domain of Sentiment Analysis, for example, research that conducted by Bing Liu's (2012) [1]. Other research conducted in a SemEval competition, the domain of sentiment analysis research has been developed further up to the aspect or commonly called Aspect Based Sentiment Analysis (ABSA) [2]. The domain problem of Aspect Based Sentiment Analysis (ABSA) from SemEval is quite diverse, all of those problems arise mostly from the real data provided. Some existing problems include Implicit, Multi-label, Out Of Vocabulary (OOV), Expression extraction, and the detection of aspects and polarities. This research only focuses on classification aspect and classification of sentiment. This study uses an existing method of Convolution Neural Network (CNN) method, which was introduced again by Alex K. The study by Alex K reduces the error rate by 15%, compared in the previous year the decrease was only 5%. This research would like to propose CNN methods that have been optimized, and use Threshold (CNN-T) to select the best data in training data. This method can produce more than one aspect using one data test. The average result of this experiment using CNN-T got better F-Measure compared to CNN and 3 classic Machine Learning method, i.e. SVM, Naive Bayes, and KNN. The overall F1 score of CNN-T is 0.71, which is greater than the other comparable methods.\",\"PeriodicalId\":6543,\"journal\":{\"name\":\"2018 5th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)\",\"volume\":\"2 1\",\"pages\":\"142-147\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 5th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EECSI.2018.8752857\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EECSI.2018.8752857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

在情感分析领域已经做了大量的研究,例如刘兵(2012)[1]的研究。在SemEval竞赛中进行的其他研究,情感分析研究领域已经进一步发展到方面或通常称为基于方面的情感分析(ABSA)[2]。SemEval基于方面的情感分析(ABSA)的领域问题是多种多样的,所有这些问题大多来自于所提供的真实数据。存在的问题包括隐式、多标签、无词汇、表达提取、方面和极性检测等。本研究仅关注分类方面和情感分类。本研究使用了卷积神经网络(CNN)方法的现有方法,该方法由Alex K再次引入,Alex K的研究将错误率降低了15%,而去年的错误率仅为5%。本研究拟提出经过优化的CNN方法,并使用阈值(CNN- t)在训练数据中选择最佳数据。此方法可以使用一个数据测试产生多个方面。与CNN和3种经典的机器学习方法(SVM, Naive Bayes, KNN)相比,本实验使用CNN- t的平均结果得到了更好的F-Measure。CNN-T的F1总分为0.71,高于其他可比较的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Aspect-Based Sentiment Analysis Approach with CNN
Lots of research has been done on the domain of Sentiment Analysis, for example, research that conducted by Bing Liu's (2012) [1]. Other research conducted in a SemEval competition, the domain of sentiment analysis research has been developed further up to the aspect or commonly called Aspect Based Sentiment Analysis (ABSA) [2]. The domain problem of Aspect Based Sentiment Analysis (ABSA) from SemEval is quite diverse, all of those problems arise mostly from the real data provided. Some existing problems include Implicit, Multi-label, Out Of Vocabulary (OOV), Expression extraction, and the detection of aspects and polarities. This research only focuses on classification aspect and classification of sentiment. This study uses an existing method of Convolution Neural Network (CNN) method, which was introduced again by Alex K. The study by Alex K reduces the error rate by 15%, compared in the previous year the decrease was only 5%. This research would like to propose CNN methods that have been optimized, and use Threshold (CNN-T) to select the best data in training data. This method can produce more than one aspect using one data test. The average result of this experiment using CNN-T got better F-Measure compared to CNN and 3 classic Machine Learning method, i.e. SVM, Naive Bayes, and KNN. The overall F1 score of CNN-T is 0.71, which is greater than the other comparable methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信