{"title":"推进斯里兰卡的旅游需求预测:评估机器学习模型的性能和社交媒体数据整合的影响","authors":"Isuru Udayangani Hewapathirana","doi":"10.1108/jtf-06-2023-0149","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>This study explores the pioneering approach of utilising machine learning (ML) models and integrating social media data for predicting tourist arrivals in Sri Lanka.</p><!--/ Abstract__block -->\n<h3>Design/methodology/approach</h3>\n<p>Two sets of experiments are performed in this research. First, the predictive accuracy of three ML models, support vector regression (SVR), random forest (RF) and artificial neural network (ANN), is compared against the seasonal autoregressive integrated moving average (SARIMA) model using historical tourist arrivals as features. Subsequently, the impact of incorporating social media data from TripAdvisor and Google Trends as additional features is investigated.</p><!--/ Abstract__block -->\n<h3>Findings</h3>\n<p>The findings reveal that the ML models generally outperform the SARIMA model, particularly from 2019 to 2021, when several unexpected events occurred in Sri Lanka. When integrating social media data, the RF model performs significantly better during most years, whereas the SVR model does not exhibit significant improvement. Although adding social media data to the ANN model does not yield superior forecasts, it exhibits proficiency in capturing data trends.</p><!--/ Abstract__block -->\n<h3>Practical implications</h3>\n<p>The findings offer substantial implications for the industry's growth and resilience, allowing stakeholders to make accurate data-driven decisions to navigate the unpredictable dynamics of Sri Lanka's tourism sector.</p><!--/ Abstract__block -->\n<h3>Originality/value</h3>\n<p>This study presents the first exploration of ML models and the integration of social media data for forecasting Sri Lankan tourist arrivals, contributing to the advancement of research in this domain.</p><!--/ Abstract__block -->","PeriodicalId":45881,"journal":{"name":"Journal of Tourism Futures","volume":null,"pages":null},"PeriodicalIF":5.8000,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advancing tourism demand forecasting in Sri Lanka: evaluating the performance of machine learning models and the impact of social media data integration\",\"authors\":\"Isuru Udayangani Hewapathirana\",\"doi\":\"10.1108/jtf-06-2023-0149\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3>Purpose</h3>\\n<p>This study explores the pioneering approach of utilising machine learning (ML) models and integrating social media data for predicting tourist arrivals in Sri Lanka.</p><!--/ Abstract__block -->\\n<h3>Design/methodology/approach</h3>\\n<p>Two sets of experiments are performed in this research. First, the predictive accuracy of three ML models, support vector regression (SVR), random forest (RF) and artificial neural network (ANN), is compared against the seasonal autoregressive integrated moving average (SARIMA) model using historical tourist arrivals as features. Subsequently, the impact of incorporating social media data from TripAdvisor and Google Trends as additional features is investigated.</p><!--/ Abstract__block -->\\n<h3>Findings</h3>\\n<p>The findings reveal that the ML models generally outperform the SARIMA model, particularly from 2019 to 2021, when several unexpected events occurred in Sri Lanka. When integrating social media data, the RF model performs significantly better during most years, whereas the SVR model does not exhibit significant improvement. Although adding social media data to the ANN model does not yield superior forecasts, it exhibits proficiency in capturing data trends.</p><!--/ Abstract__block -->\\n<h3>Practical implications</h3>\\n<p>The findings offer substantial implications for the industry's growth and resilience, allowing stakeholders to make accurate data-driven decisions to navigate the unpredictable dynamics of Sri Lanka's tourism sector.</p><!--/ Abstract__block -->\\n<h3>Originality/value</h3>\\n<p>This study presents the first exploration of ML models and the integration of social media data for forecasting Sri Lankan tourist arrivals, contributing to the advancement of research in this domain.</p><!--/ Abstract__block -->\",\"PeriodicalId\":45881,\"journal\":{\"name\":\"Journal of Tourism Futures\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2023-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Tourism Futures\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1108/jtf-06-2023-0149\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HOSPITALITY, LEISURE, SPORT & TOURISM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Tourism Futures","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/jtf-06-2023-0149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HOSPITALITY, LEISURE, SPORT & TOURISM","Score":null,"Total":0}
引用次数: 0
摘要
目的本研究探讨了利用机器学习(ML)模型和社交媒体数据预测斯里兰卡游客抵达人数的开创性方法。首先,将支持向量回归(SVR)、随机森林(RF)和人工神经网络(ANN)这三种 ML 模型的预测准确性与季节性自回归综合移动平均(SARIMA)模型进行了比较,后者以历史游客抵达人数为特征。结果研究结果表明,ML 模型总体上优于 SARIMA 模型,尤其是在 2019 年至 2021 年期间,斯里兰卡发生了多起突发事件。在整合社交媒体数据时,RF 模型在大多数年份的表现明显更好,而 SVR 模型则没有明显改善。虽然将社交媒体数据添加到 ANN 模型中并不会产生更优越的预测结果,但它在捕捉数据趋势方面表现出了一定的能力。原创性/价值本研究首次探索了 ML 模型和社交媒体数据的整合,用于预测斯里兰卡游客抵达人数,为推动该领域的研究做出了贡献。
Advancing tourism demand forecasting in Sri Lanka: evaluating the performance of machine learning models and the impact of social media data integration
Purpose
This study explores the pioneering approach of utilising machine learning (ML) models and integrating social media data for predicting tourist arrivals in Sri Lanka.
Design/methodology/approach
Two sets of experiments are performed in this research. First, the predictive accuracy of three ML models, support vector regression (SVR), random forest (RF) and artificial neural network (ANN), is compared against the seasonal autoregressive integrated moving average (SARIMA) model using historical tourist arrivals as features. Subsequently, the impact of incorporating social media data from TripAdvisor and Google Trends as additional features is investigated.
Findings
The findings reveal that the ML models generally outperform the SARIMA model, particularly from 2019 to 2021, when several unexpected events occurred in Sri Lanka. When integrating social media data, the RF model performs significantly better during most years, whereas the SVR model does not exhibit significant improvement. Although adding social media data to the ANN model does not yield superior forecasts, it exhibits proficiency in capturing data trends.
Practical implications
The findings offer substantial implications for the industry's growth and resilience, allowing stakeholders to make accurate data-driven decisions to navigate the unpredictable dynamics of Sri Lanka's tourism sector.
Originality/value
This study presents the first exploration of ML models and the integration of social media data for forecasting Sri Lankan tourist arrivals, contributing to the advancement of research in this domain.