Extracting locations from sport and exercise-related social media messages using a neural network-based bilingual toponym recognition model

IF 1.8 Q2 GEOGRAPHY
Pengyuan Liu, Sonja Koivisto, Tuomo Hiippala, Charlotte Van der Lijn, Tuomas Vaisanen, Marisofia Nurmi, T. Toivonen, Kirsi Vehkakoski, Janne Pyykonen, Ilkka Virmasalo, Mikko Simula, Elina Hasanen, Anna-Katriina Salmikangas, P. Muukkonen
{"title":"Extracting locations from sport and exercise-related social media messages using a neural network-based bilingual toponym recognition model","authors":"Pengyuan Liu, Sonja Koivisto, Tuomo Hiippala, Charlotte Van der Lijn, Tuomas Vaisanen, Marisofia Nurmi, T. Toivonen, Kirsi Vehkakoski, Janne Pyykonen, Ilkka Virmasalo, Mikko Simula, Elina Hasanen, Anna-Katriina Salmikangas, P. Muukkonen","doi":"10.5311/josis.2022.24.167","DOIUrl":null,"url":null,"abstract":"Sport and exercise contribute to health and well-being in cities. While previous research has mainly focused on activities at specific locations such as sport facilities, \"informal sport\" that occur at arbitrary locations across the city have been largely neglected. Such activities are more challenging to observe, but this challenge may be addressed using data collected from social media platforms, because social media users regularly generate content related to sports and exercise at given locations. This allows studying all sport, including those \"informal sport\" which are at arbitrary locations, to better understand sports and exercise-related activities in cities. However, user-generated geographical information available on social media platforms is becoming scarcer and coarser. This places increased emphasis on extracting location information from free-form text content on social media, which is complicated by multilingualism and informal language. To support this effort, this article presents an end-to-end deep learning-based bilingual toponym recognition model for extracting location information from social media content related to sports and exercise. We show that our approach outperforms five state-of-the-art deep learning and machine learning models. We further demonstrate how our model can be deployed in a geoparsing framework to support city planners in promoting healthy and active lifestyles.","PeriodicalId":45389,"journal":{"name":"Journal of Spatial Information Science","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2022-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Spatial Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5311/josis.2022.24.167","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 3

Abstract

Sport and exercise contribute to health and well-being in cities. While previous research has mainly focused on activities at specific locations such as sport facilities, "informal sport" that occur at arbitrary locations across the city have been largely neglected. Such activities are more challenging to observe, but this challenge may be addressed using data collected from social media platforms, because social media users regularly generate content related to sports and exercise at given locations. This allows studying all sport, including those "informal sport" which are at arbitrary locations, to better understand sports and exercise-related activities in cities. However, user-generated geographical information available on social media platforms is becoming scarcer and coarser. This places increased emphasis on extracting location information from free-form text content on social media, which is complicated by multilingualism and informal language. To support this effort, this article presents an end-to-end deep learning-based bilingual toponym recognition model for extracting location information from social media content related to sports and exercise. We show that our approach outperforms five state-of-the-art deep learning and machine learning models. We further demonstrate how our model can be deployed in a geoparsing framework to support city planners in promoting healthy and active lifestyles.
使用基于神经网络的双语地名识别模型从运动和锻炼相关的社交媒体信息中提取位置
体育和锻炼有助于城市居民的健康和福祉。虽然以前的研究主要集中在特定地点的活动,如体育设施,但在城市任意地点发生的“非正式体育”在很大程度上被忽视了。观察这些活动更具挑战性,但这一挑战可以通过使用从社交媒体平台收集的数据来解决,因为社交媒体用户经常在特定地点生成与体育和锻炼相关的内容。这样就可以研究所有的运动,包括那些在任意地点进行的“非正式运动”,从而更好地了解城市中的体育和与运动相关的活动。然而,社交媒体平台上可用的用户生成的地理信息正变得越来越稀缺和粗糙。这使得从社交媒体上的自由文本内容中提取位置信息变得更加重要,而多语言和非正式语言使这一问题变得复杂。为了支持这一努力,本文提出了一种基于端到端深度学习的双语地名识别模型,用于从与体育和锻炼相关的社交媒体内容中提取位置信息。我们表明,我们的方法优于五种最先进的深度学习和机器学习模型。我们进一步展示了如何将我们的模型部署在地质测量框架中,以支持城市规划者促进健康和积极的生活方式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.10
自引率
0.00%
发文量
5
审稿时长
9 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信