Weirong Li, Kai Sun, Shu Wang, Yunqiang Zhu, Xiaoliang Dai, Lei Hu
{"title":"DePNR: A DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature","authors":"Weirong Li, Kai Sun, Shu Wang, Yunqiang Zhu, Xiaoliang Dai, Lei Hu","doi":"10.1111/tgis.13170","DOIUrl":null,"url":null,"abstract":"Place names play an important role in linking physical places to human perception and are highly frequently used in the daily lives of people to refer to places in natural language. However, many place names may not be recorded in typical gazetteers due to their new establishment, colloquial nature, and different concerns. These unrecorded toponyms are often discussed in geographical literature; thus, it is necessary to automatically identify them from geographical literature and update existing gazetteers using computational approaches. Currently, the most advanced approaches are deep learning‐based models. However, existing models used only partial position information rather than complete position information of words in a sentence, which limits their performance in recognizing toponyms. To this end, we develop DePNR, a DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature. We train DePNR on two datasets and test it on a real dataset from geographical literature to evaluate its performance. The results show that DePNR achieves an <jats:italic>F</jats:italic>‐score of 0.8282, outperforming previous approaches, and can recognize new toponyms from literature text, potentially enriching existing gazetteers.","PeriodicalId":47842,"journal":{"name":"Transactions in GIS","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transactions in GIS","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1111/tgis.13170","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 0
Abstract
Place names play an important role in linking physical places to human perception and are highly frequently used in the daily lives of people to refer to places in natural language. However, many place names may not be recorded in typical gazetteers due to their new establishment, colloquial nature, and different concerns. These unrecorded toponyms are often discussed in geographical literature; thus, it is necessary to automatically identify them from geographical literature and update existing gazetteers using computational approaches. Currently, the most advanced approaches are deep learning‐based models. However, existing models used only partial position information rather than complete position information of words in a sentence, which limits their performance in recognizing toponyms. To this end, we develop DePNR, a DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature. We train DePNR on two datasets and test it on a real dataset from geographical literature to evaluate its performance. The results show that DePNR achieves an F‐score of 0.8282, outperforming previous approaches, and can recognize new toponyms from literature text, potentially enriching existing gazetteers.
期刊介绍:
Transactions in GIS is an international journal which provides a forum for high quality, original research articles, review articles, short notes and book reviews that focus on: - practical and theoretical issues influencing the development of GIS - the collection, analysis, modelling, interpretation and display of spatial data within GIS - the connections between GIS and related technologies - new GIS applications which help to solve problems affecting the natural or built environments, or business