测量细微差别的步行性：利用ChatGPT的视觉推理与多源空间数据

IF 8.3 1区地球科学 Q1 ENVIRONMENTAL STUDIES

Computers Environment and Urban Systems Pub Date : 2025-06-18 DOI:10.1016/j.compenvurbsys.2025.102319

Donghwan Ki , Hojun Lee , Keundeok Park , Jaehyun Ha , Sugie Lee

{"title":"测量细微差别的步行性：利用ChatGPT的视觉推理与多源空间数据","authors":"Donghwan Ki , Hojun Lee , Keundeok Park , Jaehyun Ha , Sugie Lee","doi":"10.1016/j.compenvurbsys.2025.102319","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.</div></div>","PeriodicalId":48241,"journal":{"name":"Computers Environment and Urban Systems","volume":"121 ","pages":"Article 102319"},"PeriodicalIF":8.3000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Measuring nuanced walkability: Leveraging ChatGPT's vision reasoning with multisource spatial data\",\"authors\":\"Donghwan Ki , Hojun Lee , Keundeok Park , Jaehyun Ha , Sugie Lee\",\"doi\":\"10.1016/j.compenvurbsys.2025.102319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.</div></div>\",\"PeriodicalId\":48241,\"journal\":{\"name\":\"Computers Environment and Urban Systems\",\"volume\":\"121 \",\"pages\":\"Article 102319\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers Environment and Urban Systems\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0198971525000729\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers Environment and Urban Systems","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0198971525000729","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}

引用次数: 0

摘要

城市分析工具的最新进展，特别是街景图像（SVI）数据和计算机视觉（CV）算法，如语义分割，通过自动评估中尺度特征（如绿化比例），增强了步行性测量。然而，虽然SVI数据包含丰富的环境信息，但现成的CV模型通常难以捕捉到与中尺度元素（如绿化质量或人行道维护）相关的微尺度特征-设计细节。此外，由于CV算法通常孤立地评估环境特征，它们往往无法考虑空间安排和特征之间的视觉和谐，从而限制了它们支持可步行性整体评估的能力。最近，多模态大型语言模型（mllm），特别是ChatGPT，通过模仿人类感知引入了一种变革性的图像分析方法。本研究提出了一个综合的步行性测量框架，该框架利用ChatGPT在多个空间数据中的视觉推理，包括svi和GIS土地利用和道路网络地图。为了验证这一框架，我们将chatgpt生成的步行性评级与人类评估进行了比较，并检查了它们与报告的步行行为数据的关系。此外，通过将ChatGPT生成的结果与传统的步行性测量工具（如GIS和现成的CV模型）的评估结果进行比较，我们强调了ChatGPT在步行性评估中的新贡献。本研究通过引入基于chatgpt的框架来进行更全面的步行性评估，从而推动了文献的发展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Measuring nuanced walkability: Leveraging ChatGPT's vision reasoning with multisource spatial data

Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computers Environment and Urban Systems Multiple-

CiteScore

13.30

自引率

7.40%

发文量

111

审稿时长

32 days

期刊介绍： Computers, Environment and Urban Systemsis an interdisciplinary journal publishing cutting-edge and innovative computer-based research on environmental and urban systems, that privileges the geospatial perspective. The journal welcomes original high quality scholarship of a theoretical, applied or technological nature, and provides a stimulating presentation of perspectives, research developments, overviews of important new technologies and uses of major computational, information-based, and visualization innovations. Applied and theoretical contributions demonstrate the scope of computer-based analysis fostering a better understanding of environmental and urban systems, their spatial scope and their dynamics.