Unsupervised Aspect Extraction Algorithm for opinion mining using topic modeling

Azizkhan F Pathan , Chetana Prakash
{"title":"Unsupervised Aspect Extraction Algorithm for opinion mining using topic modeling","authors":"Azizkhan F Pathan ,&nbsp;Chetana Prakash","doi":"10.1016/j.gltp.2021.08.005","DOIUrl":null,"url":null,"abstract":"<div><p>With the massive use of electronic gadgets and the developing fame of web-based media, a great deal of text information is being produced at the rate never observed. It is not feasible for people to pursue all information produced and discover what is being investigated in their area of interest. To determine topics in large textual documents Topic modeling is used. Topic Modeling Algorithms are Unsupervised Machine Learning approaches which are widely used and have proven to be successful in the area of Aspect-based Opinion Mining to extract ‘latent’ topics, which are aspects of interest. In this paper, the approaches that are widely used for topic modeling are examined and compared to find their importance in detecting topics based on metrics such as Perplexity and Coherence. As a result, Latent Dirichlet Allocation is a good topic modeling algorithm compared to Latent Semantic Analysis and Hierarchical Dirichlet Process for aspect extraction process in aspect-based opinion mining. Also, we have proposed an unsupervised aspect extraction algorithm based on topic models for Aspect-based Opinion mining.</p></div>","PeriodicalId":100588,"journal":{"name":"Global Transitions Proceedings","volume":"2 2","pages":"Pages 492-499"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.gltp.2021.08.005","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Transitions Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666285X21000339","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

With the massive use of electronic gadgets and the developing fame of web-based media, a great deal of text information is being produced at the rate never observed. It is not feasible for people to pursue all information produced and discover what is being investigated in their area of interest. To determine topics in large textual documents Topic modeling is used. Topic Modeling Algorithms are Unsupervised Machine Learning approaches which are widely used and have proven to be successful in the area of Aspect-based Opinion Mining to extract ‘latent’ topics, which are aspects of interest. In this paper, the approaches that are widely used for topic modeling are examined and compared to find their importance in detecting topics based on metrics such as Perplexity and Coherence. As a result, Latent Dirichlet Allocation is a good topic modeling algorithm compared to Latent Semantic Analysis and Hierarchical Dirichlet Process for aspect extraction process in aspect-based opinion mining. Also, we have proposed an unsupervised aspect extraction algorithm based on topic models for Aspect-based Opinion mining.

基于主题建模的意见挖掘无监督方面提取算法
随着电子产品的大量使用和网络媒体的发展,大量的文字信息正在以前所未有的速度产生。人们不可能追求所有产生的信息,并发现在他们感兴趣的领域正在调查什么。为了确定大型文本文档中的主题,需要使用主题建模。主题建模算法是一种被广泛使用的无监督机器学习方法,并且在基于方面的意见挖掘领域被证明是成功的,可以提取“潜在”主题,即感兴趣的方面。在本文中,对广泛用于主题建模的方法进行了检查和比较,以发现它们在基于诸如Perplexity和Coherence等度量来检测主题方面的重要性。结果表明,在基于方面的意见挖掘中,相对于潜在语义分析和层次狄利克雷过程,潜在狄利克雷分配是一种较好的主题建模算法。此外,我们还提出了一种基于主题模型的无监督方面提取算法,用于基于方面的意见挖掘。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信