利用 OpenStreetMap 标签的大语言模型嵌入丰富建筑功能分类

IF 3 4区地球科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Earth Science Informatics Pub Date : 2024-08-27 DOI:10.1007/s12145-024-01463-8

Abdulkadir Memduhoğlu, Nir Fulman, Alexander Zipf

{"title":"利用 OpenStreetMap 标签的大语言模型嵌入丰富建筑功能分类","authors":"Abdulkadir Memduhoğlu, Nir Fulman, Alexander Zipf","doi":"10.1007/s12145-024-01463-8","DOIUrl":null,"url":null,"abstract":"<p>Automated methods for building function classification are essential due to restricted access to official building use data. Existing approaches utilize traditional Natural Language Processing (NLP) techniques to analyze textual data representing human activities, but they struggle with the ambiguity of semantic contexts. In contrast, Large Language Models (LLMs) excel at capturing the broader context of language. This study presents a method that uses LLMs to interpret OpenStreetMap (OSM) tags, combining them with physical and spatial metrics to classify urban building functions. We employed an XGBoost model trained on 32 features from six city datasets to classify urban building functions, demonstrating varying F1 scores from 67.80% in Madrid to 91.59% in Liberec. Integrating LLM embeddings enhanced the model's performance by an average of 12.5% across all cities compared to models using only physical and spatial metrics. Moreover, integrating LLM embeddings improved the model's performance by 6.2% over models that incorporate OSM tags as one-hot encodings, and when predicting based solely on OSM tags, the LLM approach outperforms traditional NLP methods in 5 out of 6 cities. These results suggest that deep contextual understanding, as captured by LLM embeddings more effectively than traditional NLP approaches, is beneficial for classification. Finally, a Pearson correlation coefficient of approximately -0.858 between population density and F1-scores suggests that denser areas present greater classification challenges. Moving forward, we recommend investigation into discrepancies in model performance across and within cities, aiming to identify generalized models.</p>","PeriodicalId":49318,"journal":{"name":"Earth Science Informatics","volume":"2 1","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enriching building function classification using Large Language Model embeddings of OpenStreetMap Tags\",\"authors\":\"Abdulkadir Memduhoğlu, Nir Fulman, Alexander Zipf\",\"doi\":\"10.1007/s12145-024-01463-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Automated methods for building function classification are essential due to restricted access to official building use data. Existing approaches utilize traditional Natural Language Processing (NLP) techniques to analyze textual data representing human activities, but they struggle with the ambiguity of semantic contexts. In contrast, Large Language Models (LLMs) excel at capturing the broader context of language. This study presents a method that uses LLMs to interpret OpenStreetMap (OSM) tags, combining them with physical and spatial metrics to classify urban building functions. We employed an XGBoost model trained on 32 features from six city datasets to classify urban building functions, demonstrating varying F1 scores from 67.80% in Madrid to 91.59% in Liberec. Integrating LLM embeddings enhanced the model's performance by an average of 12.5% across all cities compared to models using only physical and spatial metrics. Moreover, integrating LLM embeddings improved the model's performance by 6.2% over models that incorporate OSM tags as one-hot encodings, and when predicting based solely on OSM tags, the LLM approach outperforms traditional NLP methods in 5 out of 6 cities. These results suggest that deep contextual understanding, as captured by LLM embeddings more effectively than traditional NLP approaches, is beneficial for classification. Finally, a Pearson correlation coefficient of approximately -0.858 between population density and F1-scores suggests that denser areas present greater classification challenges. Moving forward, we recommend investigation into discrepancies in model performance across and within cities, aiming to identify generalized models.</p>\",\"PeriodicalId\":49318,\"journal\":{\"name\":\"Earth Science Informatics\",\"volume\":\"2 1\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Earth Science Informatics\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1007/s12145-024-01463-8\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Science Informatics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12145-024-01463-8","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

由于获取官方建筑使用数据的途径有限，因此建筑功能分类的自动化方法至关重要。现有的方法利用传统的自然语言处理 (NLP) 技术来分析代表人类活动的文本数据，但这些方法难以解决语义上下文的模糊性问题。相比之下，大型语言模型（LLM）则擅长捕捉更广泛的语言语境。本研究介绍了一种使用 LLMs 解释 OpenStreetMap (OSM) 标签的方法，该方法将 LLMs 与物理和空间指标相结合，对城市建筑功能进行分类。我们使用基于六个城市数据集 32 个特征训练的 XGBoost 模型对城市建筑功能进行分类，结果显示 F1 分数从马德里的 67.80% 到利贝雷茨的 91.59% 不等。与仅使用物理和空间指标的模型相比，整合 LLM 嵌入的模型在所有城市的平均性能提高了 12.5%。此外，与将 OSM 标签作为单击编码的模型相比，整合 LLM 嵌入的模型性能提高了 6.2%，而当仅基于 OSM 标签进行预测时，LLM 方法在 6 个城市中的 5 个城市的表现优于传统的 NLP 方法。这些结果表明，LLM 嵌入比传统的 NLP 方法更有效地捕捉到了深层次的上下文理解，有利于分类。最后，人口密度与 F1 分数之间的皮尔逊相关系数约为-0.858，这表明人口密集的地区面临着更大的分类挑战。展望未来，我们建议对城市间和城市内的模型性能差异进行调查，以确定通用模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Enriching building function classification using Large Language Model embeddings of OpenStreetMap Tags

查看原文本刊更多论文

Enriching building function classification using Large Language Model embeddings of OpenStreetMap Tags

Automated methods for building function classification are essential due to restricted access to official building use data. Existing approaches utilize traditional Natural Language Processing (NLP) techniques to analyze textual data representing human activities, but they struggle with the ambiguity of semantic contexts. In contrast, Large Language Models (LLMs) excel at capturing the broader context of language. This study presents a method that uses LLMs to interpret OpenStreetMap (OSM) tags, combining them with physical and spatial metrics to classify urban building functions. We employed an XGBoost model trained on 32 features from six city datasets to classify urban building functions, demonstrating varying F1 scores from 67.80% in Madrid to 91.59% in Liberec. Integrating LLM embeddings enhanced the model's performance by an average of 12.5% across all cities compared to models using only physical and spatial metrics. Moreover, integrating LLM embeddings improved the model's performance by 6.2% over models that incorporate OSM tags as one-hot encodings, and when predicting based solely on OSM tags, the LLM approach outperforms traditional NLP methods in 5 out of 6 cities. These results suggest that deep contextual understanding, as captured by LLM embeddings more effectively than traditional NLP approaches, is beneficial for classification. Finally, a Pearson correlation coefficient of approximately -0.858 between population density and F1-scores suggests that denser areas present greater classification challenges. Moving forward, we recommend investigation into discrepancies in model performance across and within cities, aiming to identify generalized models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Earth Science Informatics COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-GEOSCIENCES, MULTIDISCIPLINARY

CiteScore

4.60

自引率

3.60%

发文量

157

审稿时长

4.3 months

期刊介绍： The Earth Science Informatics [ESIN] journal aims at rapid publication of high-quality, current, cutting-edge, and provocative scientific work in the area of Earth Science Informatics as it relates to Earth systems science and space science. This includes articles on the application of formal and computational methods, computational Earth science, spatial and temporal analyses, and all aspects of computer applications to the acquisition, storage, processing, interchange, and visualization of data and information about the materials, properties, processes, features, and phenomena that occur at all scales and locations in the Earth system’s five components (atmosphere, hydrosphere, geosphere, biosphere, cryosphere) and in space (see "About this journal" for more detail). The quarterly journal publishes research, methodology, and software articles, as well as editorials, comments, and book and software reviews. Review articles of relevant findings, topics, and methodologies are also considered.