{"title":"使用支持商业智能的机器学习改进非结构化数据的意见挖掘","authors":"Ruchi Sharma, P. Shrinath","doi":"10.12720/jait.14.4.821-829","DOIUrl":null,"url":null,"abstract":"—There has been an exponential increase in usage of social informatics in recent years. This makes opinion mining more complex, especially for unstructured data available online. Although a substantial amount of research has been conducted on the COVID pandemic, post-pandemic research is lacking. Our research focuses on design and implementation of opinion mining framework for unstructured data input for business intelligence dealing with post pandemic work environment in industries. In this paper, we implement opinion mining algorithm in combination with machine learning approaches providing a hybrid approach. Transformer architecture Bidirectional Encoder Representations from Transformers language model is implemented to obtain sentence level feature vector of the document corpus and t-distributed stochastic neighbor embedding is implemented for clustering experimental evaluation. In this work, performance evaluation is undertaken using the Intertopic Distance map. By applying a hybrid strategy of natural language processing and machine learning, the results of this study indicate efficient framework development and anticipated to contribute to the improvement of efficacy of opinion mining models compared to existing approaches. This research is significant and will benefit businesses in gaining valuable insights that will lead to improved decision-making and business insights.","PeriodicalId":36452,"journal":{"name":"Journal of Advances in Information Technology","volume":"1 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved Opinion Mining for Unstructured Data Using Machine Learning Enabling Business Intelligence\",\"authors\":\"Ruchi Sharma, P. Shrinath\",\"doi\":\"10.12720/jait.14.4.821-829\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"—There has been an exponential increase in usage of social informatics in recent years. This makes opinion mining more complex, especially for unstructured data available online. Although a substantial amount of research has been conducted on the COVID pandemic, post-pandemic research is lacking. Our research focuses on design and implementation of opinion mining framework for unstructured data input for business intelligence dealing with post pandemic work environment in industries. In this paper, we implement opinion mining algorithm in combination with machine learning approaches providing a hybrid approach. Transformer architecture Bidirectional Encoder Representations from Transformers language model is implemented to obtain sentence level feature vector of the document corpus and t-distributed stochastic neighbor embedding is implemented for clustering experimental evaluation. In this work, performance evaluation is undertaken using the Intertopic Distance map. By applying a hybrid strategy of natural language processing and machine learning, the results of this study indicate efficient framework development and anticipated to contribute to the improvement of efficacy of opinion mining models compared to existing approaches. This research is significant and will benefit businesses in gaining valuable insights that will lead to improved decision-making and business insights.\",\"PeriodicalId\":36452,\"journal\":{\"name\":\"Journal of Advances in Information Technology\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Advances in Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.12720/jait.14.4.821-829\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12720/jait.14.4.821-829","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Improved Opinion Mining for Unstructured Data Using Machine Learning Enabling Business Intelligence
—There has been an exponential increase in usage of social informatics in recent years. This makes opinion mining more complex, especially for unstructured data available online. Although a substantial amount of research has been conducted on the COVID pandemic, post-pandemic research is lacking. Our research focuses on design and implementation of opinion mining framework for unstructured data input for business intelligence dealing with post pandemic work environment in industries. In this paper, we implement opinion mining algorithm in combination with machine learning approaches providing a hybrid approach. Transformer architecture Bidirectional Encoder Representations from Transformers language model is implemented to obtain sentence level feature vector of the document corpus and t-distributed stochastic neighbor embedding is implemented for clustering experimental evaluation. In this work, performance evaluation is undertaken using the Intertopic Distance map. By applying a hybrid strategy of natural language processing and machine learning, the results of this study indicate efficient framework development and anticipated to contribute to the improvement of efficacy of opinion mining models compared to existing approaches. This research is significant and will benefit businesses in gaining valuable insights that will lead to improved decision-making and business insights.