测量细微差别的步行性:利用ChatGPT的视觉推理与多源空间数据

IF 7.1 1区 地球科学 Q1 ENVIRONMENTAL STUDIES
Donghwan Ki , Hojun Lee , Keundeok Park , Jaehyun Ha , Sugie Lee
{"title":"测量细微差别的步行性:利用ChatGPT的视觉推理与多源空间数据","authors":"Donghwan Ki ,&nbsp;Hojun Lee ,&nbsp;Keundeok Park ,&nbsp;Jaehyun Ha ,&nbsp;Sugie Lee","doi":"10.1016/j.compenvurbsys.2025.102319","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.</div></div>","PeriodicalId":48241,"journal":{"name":"Computers Environment and Urban Systems","volume":"121 ","pages":"Article 102319"},"PeriodicalIF":7.1000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Measuring nuanced walkability: Leveraging ChatGPT's vision reasoning with multisource spatial data\",\"authors\":\"Donghwan Ki ,&nbsp;Hojun Lee ,&nbsp;Keundeok Park ,&nbsp;Jaehyun Ha ,&nbsp;Sugie Lee\",\"doi\":\"10.1016/j.compenvurbsys.2025.102319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.</div></div>\",\"PeriodicalId\":48241,\"journal\":{\"name\":\"Computers Environment and Urban Systems\",\"volume\":\"121 \",\"pages\":\"Article 102319\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers Environment and Urban Systems\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0198971525000729\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers Environment and Urban Systems","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0198971525000729","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
引用次数: 0

摘要

城市分析工具的最新进展,特别是街景图像(SVI)数据和计算机视觉(CV)算法,如语义分割,通过自动评估中尺度特征(如绿化比例),增强了步行性测量。然而,虽然SVI数据包含丰富的环境信息,但现成的CV模型通常难以捕捉到与中尺度元素(如绿化质量或人行道维护)相关的微尺度特征-设计细节。此外,由于CV算法通常孤立地评估环境特征,它们往往无法考虑空间安排和特征之间的视觉和谐,从而限制了它们支持可步行性整体评估的能力。最近,多模态大型语言模型(mllm),特别是ChatGPT,通过模仿人类感知引入了一种变革性的图像分析方法。本研究提出了一个综合的步行性测量框架,该框架利用ChatGPT在多个空间数据中的视觉推理,包括svi和GIS土地利用和道路网络地图。为了验证这一框架,我们将chatgpt生成的步行性评级与人类评估进行了比较,并检查了它们与报告的步行行为数据的关系。此外,通过将ChatGPT生成的结果与传统的步行性测量工具(如GIS和现成的CV模型)的评估结果进行比较,我们强调了ChatGPT在步行性评估中的新贡献。本研究通过引入基于chatgpt的框架来进行更全面的步行性评估,从而推动了文献的发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Measuring nuanced walkability: Leveraging ChatGPT's vision reasoning with multisource spatial data
Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
13.30
自引率
7.40%
发文量
111
审稿时长
32 days
期刊介绍: Computers, Environment and Urban Systemsis an interdisciplinary journal publishing cutting-edge and innovative computer-based research on environmental and urban systems, that privileges the geospatial perspective. The journal welcomes original high quality scholarship of a theoretical, applied or technological nature, and provides a stimulating presentation of perspectives, research developments, overviews of important new technologies and uses of major computational, information-based, and visualization innovations. Applied and theoretical contributions demonstrate the scope of computer-based analysis fostering a better understanding of environmental and urban systems, their spatial scope and their dynamics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信