Improved Opinion Mining for Unstructured Data Using Machine Learning Enabling Business Intelligence

IF 1.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of Advances in Information Technology Pub Date : 2023-01-01 DOI:10.12720/jait.14.4.821-829

Ruchi Sharma, P. Shrinath

{"title":"Improved Opinion Mining for Unstructured Data Using Machine Learning Enabling Business Intelligence","authors":"Ruchi Sharma, P. Shrinath","doi":"10.12720/jait.14.4.821-829","DOIUrl":null,"url":null,"abstract":"—There has been an exponential increase in usage of social informatics in recent years. This makes opinion mining more complex, especially for unstructured data available online. Although a substantial amount of research has been conducted on the COVID pandemic, post-pandemic research is lacking. Our research focuses on design and implementation of opinion mining framework for unstructured data input for business intelligence dealing with post pandemic work environment in industries. In this paper, we implement opinion mining algorithm in combination with machine learning approaches providing a hybrid approach. Transformer architecture Bidirectional Encoder Representations from Transformers language model is implemented to obtain sentence level feature vector of the document corpus and t-distributed stochastic neighbor embedding is implemented for clustering experimental evaluation. In this work, performance evaluation is undertaken using the Intertopic Distance map. By applying a hybrid strategy of natural language processing and machine learning, the results of this study indicate efficient framework development and anticipated to contribute to the improvement of efficacy of opinion mining models compared to existing approaches. This research is significant and will benefit businesses in gaining valuable insights that will lead to improved decision-making and business insights.","PeriodicalId":36452,"journal":{"name":"Journal of Advances in Information Technology","volume":"1 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12720/jait.14.4.821-829","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

—There has been an exponential increase in usage of social informatics in recent years. This makes opinion mining more complex, especially for unstructured data available online. Although a substantial amount of research has been conducted on the COVID pandemic, post-pandemic research is lacking. Our research focuses on design and implementation of opinion mining framework for unstructured data input for business intelligence dealing with post pandemic work environment in industries. In this paper, we implement opinion mining algorithm in combination with machine learning approaches providing a hybrid approach. Transformer architecture Bidirectional Encoder Representations from Transformers language model is implemented to obtain sentence level feature vector of the document corpus and t-distributed stochastic neighbor embedding is implemented for clustering experimental evaluation. In this work, performance evaluation is undertaken using the Intertopic Distance map. By applying a hybrid strategy of natural language processing and machine learning, the results of this study indicate efficient framework development and anticipated to contribute to the improvement of efficacy of opinion mining models compared to existing approaches. This research is significant and will benefit businesses in gaining valuable insights that will lead to improved decision-making and business insights.

查看原文本刊更多论文

使用支持商业智能的机器学习改进非结构化数据的意见挖掘

近年来，社会信息学的使用呈指数级增长。这使得意见挖掘变得更加复杂，特别是对于在线可用的非结构化数据。虽然对COVID大流行进行了大量研究，但缺乏大流行后的研究。我们的研究重点是为非结构化数据输入的意见挖掘框架的设计和实现，用于处理疫情后工业工作环境的商业智能。在本文中，我们将意见挖掘算法与机器学习方法相结合，提供了一种混合方法。实现了Transformer语言模型的双向编码器表示来获取文档语料库的句子级特征向量，并实现了t分布随机邻居嵌入来进行聚类实验评价。在这项工作中，使用主题间距离图进行绩效评估。通过应用自然语言处理和机器学习的混合策略，本研究的结果表明，与现有方法相比，有效的框架开发和期望有助于提高意见挖掘模型的有效性。这项研究意义重大，将有利于企业获得有价值的见解，从而改进决策和业务见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Advances in Information Technology Computer Science-Information Systems

CiteScore

4.20

自引率

20.00%

发文量