Big Data Analytics Based Sentiment Analysis Using Superior Expectation-Maximization Vector Neural Network in Tourism

Chingakham Nirma Devi, R. Renuga Devi
{"title":"Big Data Analytics Based Sentiment Analysis Using Superior Expectation-Maximization Vector Neural Network in Tourism","authors":"Chingakham Nirma Devi, R. Renuga Devi","doi":"10.1109/ICCMC53470.2022.9753738","DOIUrl":null,"url":null,"abstract":"Tourism experience shared through social media has become a highly influential source of information and has a multi-faceted impact on tourism. With the vast development of the Internet, text data has become one of the leading formats of big tourism data. Text analytics of such data has great potential to express tourists' opinions effectively. Sentiment analysis is an essential component of tourism big data because it can detect positive and negative opinions in texts. Tourist comments are essential for the development of tourism but still, the number of comments complicates the analysis of essential aspects of the comments by the owner. Big data-based sentiment analysis is one of the most challenging problems globally, and the amount of data is enormous. To resolve this problem, the proposed big data approaches can help detect new words, especially with sentiment analysis and detection of proper nouns and emotional words useful for subsequent tasks as word vectors. The proposed system follows the three steps: text analysis and cleaning, Word vector similarity analysis, and final sentiment classification. First step is used to remove the noise of the data and detect the symbols. The next step is the ID3 (Iterative Dichotomiser) Maximum Word Vector Dimensionality Posteriorl method, which discovers all travel review corpora's main problem and uses it to enrich the vocabulary vector representation of words in context. Attention mechanisms are used to learn words and the overall meaning of different weights text attributes. According to the classification, the final Superior Expectation-Maximization Vector Neural Network (SEMVNN) is used for classifying sentiment analysis level. The SEMVNN method gives accuracy, time complexity, precision, recall and F-measure values to achieve better results than the previous system.","PeriodicalId":345346,"journal":{"name":"2022 6th International Conference on Computing Methodologies and Communication (ICCMC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th International Conference on Computing Methodologies and Communication (ICCMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCMC53470.2022.9753738","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Tourism experience shared through social media has become a highly influential source of information and has a multi-faceted impact on tourism. With the vast development of the Internet, text data has become one of the leading formats of big tourism data. Text analytics of such data has great potential to express tourists' opinions effectively. Sentiment analysis is an essential component of tourism big data because it can detect positive and negative opinions in texts. Tourist comments are essential for the development of tourism but still, the number of comments complicates the analysis of essential aspects of the comments by the owner. Big data-based sentiment analysis is one of the most challenging problems globally, and the amount of data is enormous. To resolve this problem, the proposed big data approaches can help detect new words, especially with sentiment analysis and detection of proper nouns and emotional words useful for subsequent tasks as word vectors. The proposed system follows the three steps: text analysis and cleaning, Word vector similarity analysis, and final sentiment classification. First step is used to remove the noise of the data and detect the symbols. The next step is the ID3 (Iterative Dichotomiser) Maximum Word Vector Dimensionality Posteriorl method, which discovers all travel review corpora's main problem and uses it to enrich the vocabulary vector representation of words in context. Attention mechanisms are used to learn words and the overall meaning of different weights text attributes. According to the classification, the final Superior Expectation-Maximization Vector Neural Network (SEMVNN) is used for classifying sentiment analysis level. The SEMVNN method gives accuracy, time complexity, precision, recall and F-measure values to achieve better results than the previous system.
基于大数据分析的旅游情感分析——基于优期望最大化向量神经网络
通过社交媒体分享的旅游体验已成为极具影响力的信息来源,对旅游业产生了多方面的影响。随着互联网的迅猛发展,文本数据已经成为旅游大数据的主要形式之一。对这些数据进行文本分析,可以有效地表达游客的意见。情感分析是旅游大数据的重要组成部分,因为它可以检测文本中的积极和消极观点。游客评论对旅游业的发展至关重要,但评论的数量仍然使业主对评论本质方面的分析复杂化。基于大数据的情感分析是全球最具挑战性的问题之一,数据量巨大。为了解决这个问题,提出的大数据方法可以帮助检测新词,特别是情感分析和专有名词和情感词的检测,这些词作为词向量对后续任务很有用。该系统分为三个步骤:文本分析和清理,词向量相似度分析,最后进行情感分类。第一步是去除数据中的噪声并检测符号。下一步是ID3(迭代二分法)最大词向量维数后验方法,该方法发现所有旅游评论语料库的主要问题,并用它来丰富单词在上下文中的词汇向量表示。注意机制用于学习单词和不同权重文本属性的整体含义。根据分类结果,利用最终的超期望最大化向量神经网络(SEMVNN)对情感分析水平进行分类。SEMVNN方法给出了准确度、时间复杂度、精密度、召回率和f测量值,取得了比以前系统更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信