Weirong Li, Kai Sun, Shu Wang, Yunqiang Zhu, Xiaoliang Dai, Lei Hu
{"title":"DePNR:基于 DeBERTa 的深度学习模型,具有完整的位置嵌入,可用于地理文献中的地名识别","authors":"Weirong Li, Kai Sun, Shu Wang, Yunqiang Zhu, Xiaoliang Dai, Lei Hu","doi":"10.1111/tgis.13170","DOIUrl":null,"url":null,"abstract":"Place names play an important role in linking physical places to human perception and are highly frequently used in the daily lives of people to refer to places in natural language. However, many place names may not be recorded in typical gazetteers due to their new establishment, colloquial nature, and different concerns. These unrecorded toponyms are often discussed in geographical literature; thus, it is necessary to automatically identify them from geographical literature and update existing gazetteers using computational approaches. Currently, the most advanced approaches are deep learning‐based models. However, existing models used only partial position information rather than complete position information of words in a sentence, which limits their performance in recognizing toponyms. To this end, we develop DePNR, a DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature. We train DePNR on two datasets and test it on a real dataset from geographical literature to evaluate its performance. The results show that DePNR achieves an <jats:italic>F</jats:italic>‐score of 0.8282, outperforming previous approaches, and can recognize new toponyms from literature text, potentially enriching existing gazetteers.","PeriodicalId":47842,"journal":{"name":"Transactions in GIS","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DePNR: A DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature\",\"authors\":\"Weirong Li, Kai Sun, Shu Wang, Yunqiang Zhu, Xiaoliang Dai, Lei Hu\",\"doi\":\"10.1111/tgis.13170\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Place names play an important role in linking physical places to human perception and are highly frequently used in the daily lives of people to refer to places in natural language. However, many place names may not be recorded in typical gazetteers due to their new establishment, colloquial nature, and different concerns. These unrecorded toponyms are often discussed in geographical literature; thus, it is necessary to automatically identify them from geographical literature and update existing gazetteers using computational approaches. Currently, the most advanced approaches are deep learning‐based models. However, existing models used only partial position information rather than complete position information of words in a sentence, which limits their performance in recognizing toponyms. To this end, we develop DePNR, a DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature. We train DePNR on two datasets and test it on a real dataset from geographical literature to evaluate its performance. The results show that DePNR achieves an <jats:italic>F</jats:italic>‐score of 0.8282, outperforming previous approaches, and can recognize new toponyms from literature text, potentially enriching existing gazetteers.\",\"PeriodicalId\":47842,\"journal\":{\"name\":\"Transactions in GIS\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transactions in GIS\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1111/tgis.13170\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOGRAPHY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions in GIS","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1111/tgis.13170","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 0
摘要
地名在将自然地点与人类感知联系起来方面发挥着重要作用,在人们的日常生活中被频繁使用,以自然语言指代地点。然而,许多地名由于其新建立、口语化和不同的关注点,可能没有被记录在典型的地名录中。这些未记录的地名经常在地理文献中被讨论;因此,有必要使用计算方法从地理文献中自动识别这些地名并更新现有地名录。目前,最先进的方法是基于深度学习的模型。然而,现有模型仅使用了部分位置信息,而非单词在句子中的完整位置信息,这限制了其识别地名的性能。为此,我们开发了基于 DeBERTa 的深度学习模型 DePNR,该模型具有完整的位置嵌入,可用于地理文献中的地名识别。我们在两个数据集上对 DePNR 进行了训练,并在地理文献的真实数据集上对其进行了测试,以评估其性能。结果表明,DePNR 的 F 分数达到 0.8282,优于之前的方法,并且可以从文献文本中识别新地名,从而丰富现有的地名录。
DePNR: A DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature
Place names play an important role in linking physical places to human perception and are highly frequently used in the daily lives of people to refer to places in natural language. However, many place names may not be recorded in typical gazetteers due to their new establishment, colloquial nature, and different concerns. These unrecorded toponyms are often discussed in geographical literature; thus, it is necessary to automatically identify them from geographical literature and update existing gazetteers using computational approaches. Currently, the most advanced approaches are deep learning‐based models. However, existing models used only partial position information rather than complete position information of words in a sentence, which limits their performance in recognizing toponyms. To this end, we develop DePNR, a DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature. We train DePNR on two datasets and test it on a real dataset from geographical literature to evaluate its performance. The results show that DePNR achieves an F‐score of 0.8282, outperforming previous approaches, and can recognize new toponyms from literature text, potentially enriching existing gazetteers.
期刊介绍:
Transactions in GIS is an international journal which provides a forum for high quality, original research articles, review articles, short notes and book reviews that focus on: - practical and theoretical issues influencing the development of GIS - the collection, analysis, modelling, interpretation and display of spatial data within GIS - the connections between GIS and related technologies - new GIS applications which help to solve problems affecting the natural or built environments, or business