Discovering a tourism destination with social media data: BERT-based sentiment analysis

IF 5.3 3区 管理学 Q1 HOSPITALITY, LEISURE, SPORT & TOURISM
M. Viñán-Ludeña, Luis M. de Campos
{"title":"Discovering a tourism destination with social media data: BERT-based sentiment analysis","authors":"M. Viñán-Ludeña, Luis M. de Campos","doi":"10.1108/jhtt-09-2021-0259","DOIUrl":null,"url":null,"abstract":"\nPurpose\nThe main purpose of this paper is to analyze a tourist destination using sentiment analysis techniques with data from Twitter and Instagram to find the most representative entities (or places) and perceptions (or aspects) of the users.\n\n\nDesign/methodology/approach\nThe authors used 90,725 Instagram posts and 235,755 Twitter tweets to analyze tourism in Granada (Spain) to identify the important places and perceptions mentioned by travelers on both social media sites. The authors used several approaches for sentiment classification for English and Spanish texts, including deep learning models.\n\n\nFindings\nThe best results in a test set were obtained using a bidirectional encoder representations from transformers (BERT) model for Spanish texts and Tweeteval for English texts, and these were subsequently used to analyze the data sets. It was then possible to identify the most important entities and aspects, and this, in turn, provided interesting insights for researchers, practitioners, travelers and tourism managers so that services could be improved and better marketing strategies formulated.\n\n\nResearch limitations/implications\nThe authors propose a Spanish-Tourism-BERT model for performing sentiment classification together with a process to find places through hashtags and to reveal the important negative aspects of each place.\n\n\nPractical implications\nThe study enables managers and practitioners to implement the Spanish-BERT model with our Spanish Tourism data set that the authors released for adoption in applications to find both positive and negative perceptions.\n\n\nOriginality/value\nThis study presents a novel approach on how to apply sentiment analysis in the tourism domain. First, the way to evaluate the different existing models and tools is presented; second, a model is trained using BERT (deep learning model); third, an approach of how to identify the acceptance of the places of a destination through hashtags is presented and, finally, the evaluation of why the users express positivity (negativity) through the identification of entities and aspects.\n","PeriodicalId":51611,"journal":{"name":"Journal of Hospitality and Tourism Technology","volume":" ","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hospitality and Tourism Technology","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/jhtt-09-2021-0259","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HOSPITALITY, LEISURE, SPORT & TOURISM","Score":null,"Total":0}
引用次数: 6

Abstract

Purpose The main purpose of this paper is to analyze a tourist destination using sentiment analysis techniques with data from Twitter and Instagram to find the most representative entities (or places) and perceptions (or aspects) of the users. Design/methodology/approach The authors used 90,725 Instagram posts and 235,755 Twitter tweets to analyze tourism in Granada (Spain) to identify the important places and perceptions mentioned by travelers on both social media sites. The authors used several approaches for sentiment classification for English and Spanish texts, including deep learning models. Findings The best results in a test set were obtained using a bidirectional encoder representations from transformers (BERT) model for Spanish texts and Tweeteval for English texts, and these were subsequently used to analyze the data sets. It was then possible to identify the most important entities and aspects, and this, in turn, provided interesting insights for researchers, practitioners, travelers and tourism managers so that services could be improved and better marketing strategies formulated. Research limitations/implications The authors propose a Spanish-Tourism-BERT model for performing sentiment classification together with a process to find places through hashtags and to reveal the important negative aspects of each place. Practical implications The study enables managers and practitioners to implement the Spanish-BERT model with our Spanish Tourism data set that the authors released for adoption in applications to find both positive and negative perceptions. Originality/value This study presents a novel approach on how to apply sentiment analysis in the tourism domain. First, the way to evaluate the different existing models and tools is presented; second, a model is trained using BERT (deep learning model); third, an approach of how to identify the acceptance of the places of a destination through hashtags is presented and, finally, the evaluation of why the users express positivity (negativity) through the identification of entities and aspects.
利用社交媒体数据发现旅游目的地:基于BERT的情绪分析
目的本文的主要目的是使用情绪分析技术,利用Twitter和Instagram的数据分析旅游目的地,以找到用户最具代表性的实体(或地方)和感知(或方面)。设计/方法/方法作者使用90725条Instagram帖子和235755条推特推文分析了格拉纳达(西班牙)的旅游业,以确定游客在两个社交媒体网站上提到的重要地点和看法。作者使用了几种方法对英语和西班牙语文本进行情感分类,包括深度学习模型。发现在测试集中使用双向编码器表示从转换器(BERT)模型(西班牙语文本)和Tweeteval(英语文本)获得最佳结果,随后将其用于分析数据集。然后就可以确定最重要的实体和方面,这反过来又为研究人员、从业者、旅行者和旅游经理提供了有趣的见解,从而可以改进服务并制定更好的营销策略。研究局限性/含义作者提出了一个西班牙旅游BERT模型,用于进行情绪分类,以及通过标签寻找地点并揭示每个地方重要负面方面的过程。实际含义该研究使管理者和从业者能够利用我们的西班牙旅游数据集实施西班牙BERT模型,作者发布了这些数据集供应用程序采用,以发现积极和消极的看法。创意/价值这项研究为如何在旅游领域应用情感分析提供了一种新的方法。首先,提出了对不同的现有模型和工具进行评估的方法;其次,使用BERT(深度学习模型)对模型进行训练;第三,提出了一种如何通过标签识别目的地的接受度的方法,最后,通过识别实体和方面来评估用户为什么表达积极性(消极性)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Hospitality and Tourism Technology
Journal of Hospitality and Tourism Technology HOSPITALITY, LEISURE, SPORT & TOURISM-
CiteScore
8.40
自引率
12.80%
发文量
41
期刊介绍: The Journal of Hospitality and Tourism Technology is the only journal dedicated solely for research in technology and e-business in tourism and hospitality. It is a bridge between academia and industry through the intellectual exchange of ideas, trends and paradigmatic changes in the fields of hospitality, IT and e-business. It covers: -E-Marketplaces, electronic distribution channels, or e-Intermediaries -Internet or e-commerce business models -Self service technologies -E-Procurement -Social dynamics of e-communication -Relationship Development and Retention -E-governance -Security of transactions -Mobile/Wireless technologies in commerce -IT control and preparation for disaster -Virtual reality applications -Word of Mouth. -Cross-Cultural differences in IT use -GPS and Location-based services -Biometric applications -Business intelligence visualization -Radio Frequency Identification applications -Service-Oriented Architecture of business systems -Technology in New Product Development
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信