Mining Tourists’ Opinions on Popular Indian Tourism Hotspots using Sentiment Analysis and Topic Modeling

Shefali Singh, Tureen Chauhan, Vibhas Wahi, P. Meel
{"title":"Mining Tourists’ Opinions on Popular Indian Tourism Hotspots using Sentiment Analysis and Topic Modeling","authors":"Shefali Singh, Tureen Chauhan, Vibhas Wahi, P. Meel","doi":"10.1109/ICCMC51019.2021.9418341","DOIUrl":null,"url":null,"abstract":"User-generated content is an exploration area of interest with regards to web 2.0. The development of social networks and community-based websites have changed the manner in which individuals utilize the Internet. It makes individuals no longer restricted to pursuing the data given by professional channels, but to making individual profiles, producing personalized content, or sharing photographs, recordings, blogs, and so forth. This sort of data comprises the current online user-generated content. With the continuous development of the travel industry, the quantity of online travel review websites has also increased. Indian Tourism is popular for its rich culture and diversity and hence Government of India has increased the number of new tourist destinations to expand their popularity and presence. Researchers have proposed various studies to increase tourism network using Big Data. Techniques of Sentiment Analysis along with Topic Modelling have been used to unearth patterns and observations from online reviews. This paper aims to mine reviews of 10 popular travel destinations in India. Using sentiment analysis technique, the proposed research work has explored the polarity of various reviews extracted from TripAdvisor. Data collection was done by using the web framework Scrapy to acquire more than 10,000 reviews for these destinations. This paper also analyzes the result of doing Topic Modeling on reviews for individual destinations. Results conclude that Joy is the most common emotion in all the visitor’s experiences. Indian tourism decision quality can be improved by the help of the results from this study.","PeriodicalId":131747,"journal":{"name":"2021 5th International Conference on Computing Methodologies and Communication (ICCMC)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th International Conference on Computing Methodologies and Communication (ICCMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCMC51019.2021.9418341","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

User-generated content is an exploration area of interest with regards to web 2.0. The development of social networks and community-based websites have changed the manner in which individuals utilize the Internet. It makes individuals no longer restricted to pursuing the data given by professional channels, but to making individual profiles, producing personalized content, or sharing photographs, recordings, blogs, and so forth. This sort of data comprises the current online user-generated content. With the continuous development of the travel industry, the quantity of online travel review websites has also increased. Indian Tourism is popular for its rich culture and diversity and hence Government of India has increased the number of new tourist destinations to expand their popularity and presence. Researchers have proposed various studies to increase tourism network using Big Data. Techniques of Sentiment Analysis along with Topic Modelling have been used to unearth patterns and observations from online reviews. This paper aims to mine reviews of 10 popular travel destinations in India. Using sentiment analysis technique, the proposed research work has explored the polarity of various reviews extracted from TripAdvisor. Data collection was done by using the web framework Scrapy to acquire more than 10,000 reviews for these destinations. This paper also analyzes the result of doing Topic Modeling on reviews for individual destinations. Results conclude that Joy is the most common emotion in all the visitor’s experiences. Indian tourism decision quality can be improved by the help of the results from this study.
利用情感分析和话题建模挖掘游客对印度热门旅游热点的看法
关于web 2.0,用户生成内容是一个有趣的探索领域。社交网络和社区网站的发展改变了个人使用互联网的方式。它使个人不再局限于追求专业渠道提供的数据,而是创建个人档案,生产个性化内容,或共享照片、录音、博客等。这类数据包括当前在线用户生成的内容。随着旅游业的不断发展,在线旅游点评网站的数量也在不断增加。印度旅游业因其丰富的文化和多样性而广受欢迎,因此印度政府增加了新的旅游目的地的数量,以扩大其知名度和存在。研究人员提出了各种研究,以利用大数据增加旅游网络。情感分析和主题建模技术已被用于从在线评论中挖掘模式和观察结果。本文旨在挖掘印度10个热门旅游目的地的评论。利用情感分析技术,本研究探索了从TripAdvisor提取的各种评论的极性。数据收集是通过使用web框架Scrapy来获取这些目的地的10,000多条评论。本文还分析了对各个目的地的评论进行主题建模的结果。结果表明,在所有游客的体验中,快乐是最常见的情绪。本文的研究结果有助于提高印度旅游决策质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信