{"title":"文本挖掘旅游文学","authors":"Ajda Pretnar, T. Curk","doi":"10.18690/978-961-286-515-3.9","DOIUrl":null,"url":null,"abstract":"Literature reviews are essential for understanding a specific domain as they map the main topics of current re-search. Our aim was to provide a framework for retrieving articles from online databases and analyzing them in a single script. We provide the analytical pipeline as open-source (https://github.com/tourism4-0/BibMine). The main research focus was on analyzing 318 abstracts from scientific papers on tourism and innovation, which we report in Zach et al. (2019). We used LDA topic modeling to uncover ten main topics, which we analyzed using pyLDAvis visualization. We used saliency and relevance scores to determine the main words that de-scribe a topic. The uncovered topics range from climate change and land use to smart destinations, travel expe-riences, and ICT. We performed similar analyses for the term \"stakeholders,\" where we also observed the main verbs related to the query. Since verbs best define an activity, we used them to determine how stakeholders are involved in tourism development. Finally, we analyzed papers with the keyword \"technology,\" where energy efficiency, VR, web technology, and augmented tourist experiences were the main topics.","PeriodicalId":210771,"journal":{"name":"Zbornik konference »Turizem 4.0 in znanost«","volume":"758 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Text Mining Tourism Literature\",\"authors\":\"Ajda Pretnar, T. Curk\",\"doi\":\"10.18690/978-961-286-515-3.9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Literature reviews are essential for understanding a specific domain as they map the main topics of current re-search. Our aim was to provide a framework for retrieving articles from online databases and analyzing them in a single script. We provide the analytical pipeline as open-source (https://github.com/tourism4-0/BibMine). The main research focus was on analyzing 318 abstracts from scientific papers on tourism and innovation, which we report in Zach et al. (2019). We used LDA topic modeling to uncover ten main topics, which we analyzed using pyLDAvis visualization. We used saliency and relevance scores to determine the main words that de-scribe a topic. The uncovered topics range from climate change and land use to smart destinations, travel expe-riences, and ICT. We performed similar analyses for the term \\\"stakeholders,\\\" where we also observed the main verbs related to the query. Since verbs best define an activity, we used them to determine how stakeholders are involved in tourism development. Finally, we analyzed papers with the keyword \\\"technology,\\\" where energy efficiency, VR, web technology, and augmented tourist experiences were the main topics.\",\"PeriodicalId\":210771,\"journal\":{\"name\":\"Zbornik konference »Turizem 4.0 in znanost«\",\"volume\":\"758 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Zbornik konference »Turizem 4.0 in znanost«\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18690/978-961-286-515-3.9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Zbornik konference »Turizem 4.0 in znanost«","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18690/978-961-286-515-3.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
文献综述对于理解特定领域是必不可少的,因为它们映射了当前研究的主要主题。我们的目标是提供一个框架,用于从在线数据库中检索文章并在一个脚本中分析它们。我们提供开源的分析管道(https://github.com/tourism4-0/BibMine)。主要研究重点是分析318篇关于旅游和创新的科学论文摘要,我们在Zach et al.(2019)中报道了这些摘要。我们使用LDA主题建模来发现10个主要主题,并使用pyLDAvis可视化对其进行分析。我们使用显著性和相关性分数来确定描述主题的主要词汇。所涉及的主题从气候变化和土地利用到智能目的地、旅游体验和信息通信技术。我们对术语“涉众”执行了类似的分析,我们还观察了与查询相关的主要动词。由于动词最好地定义了一项活动,我们使用它们来确定利益相关者如何参与旅游发展。最后,我们分析了以“技术”为关键词的论文,其中能源效率、虚拟现实、网络技术和增强旅游体验是主要主题。
Literature reviews are essential for understanding a specific domain as they map the main topics of current re-search. Our aim was to provide a framework for retrieving articles from online databases and analyzing them in a single script. We provide the analytical pipeline as open-source (https://github.com/tourism4-0/BibMine). The main research focus was on analyzing 318 abstracts from scientific papers on tourism and innovation, which we report in Zach et al. (2019). We used LDA topic modeling to uncover ten main topics, which we analyzed using pyLDAvis visualization. We used saliency and relevance scores to determine the main words that de-scribe a topic. The uncovered topics range from climate change and land use to smart destinations, travel expe-riences, and ICT. We performed similar analyses for the term "stakeholders," where we also observed the main verbs related to the query. Since verbs best define an activity, we used them to determine how stakeholders are involved in tourism development. Finally, we analyzed papers with the keyword "technology," where energy efficiency, VR, web technology, and augmented tourist experiences were the main topics.