Donghwan Ki , Hojun Lee , Keundeok Park , Jaehyun Ha , Sugie Lee
{"title":"测量细微差别的步行性:利用ChatGPT的视觉推理与多源空间数据","authors":"Donghwan Ki , Hojun Lee , Keundeok Park , Jaehyun Ha , Sugie Lee","doi":"10.1016/j.compenvurbsys.2025.102319","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.</div></div>","PeriodicalId":48241,"journal":{"name":"Computers Environment and Urban Systems","volume":"121 ","pages":"Article 102319"},"PeriodicalIF":7.1000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Measuring nuanced walkability: Leveraging ChatGPT's vision reasoning with multisource spatial data\",\"authors\":\"Donghwan Ki , Hojun Lee , Keundeok Park , Jaehyun Ha , Sugie Lee\",\"doi\":\"10.1016/j.compenvurbsys.2025.102319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.</div></div>\",\"PeriodicalId\":48241,\"journal\":{\"name\":\"Computers Environment and Urban Systems\",\"volume\":\"121 \",\"pages\":\"Article 102319\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers Environment and Urban Systems\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0198971525000729\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers Environment and Urban Systems","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0198971525000729","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
Measuring nuanced walkability: Leveraging ChatGPT's vision reasoning with multisource spatial data
Recent advances in urban analytical tools, particularly street view image (SVI) data and computer vision (CV) algorithms, such as semantic segmentation, have enhanced walkability measurement by enabling the automated assessment of mesoscale features, such as greenery proportions. However, while SVI data contain rich environmental information, off-the-shelf CV models generally struggle to capture microscale features—design details attached to mesoscale elements, such as the quality of greenery or sidewalk maintenance. Moreover, because CV algorithms typically evaluate environmental features in isolation, they often fail to account for spatial arrangements and visual harmony among features, limiting their ability to support a holistic assessment of walkability. Recently, multimodal large language models (MLLMs), particularly ChatGPT, have introduced a transformative approach to image analysis by mimicking human perception. This study proposes a comprehensive walkability measurement framework that leverages ChatGPT's vision reasoning across multiple spatial data, including SVIs and GIS land use and road network maps. To validate this framework, we compare ChatGPT-generated walkability ratings with human assessments and examine their relationship with reported walking behavior data. Furthermore, by comparing ChatGPT-generated outcomes with evaluations from conventional walkability measurement tools, such as GIS and off-the-shelf CV models, we highlight the novel contribution of ChatGPT in walkability assessment. This research advances the literature by introducing a ChatGPT-based framework for a more comprehensive walkability assessment.
期刊介绍:
Computers, Environment and Urban Systemsis an interdisciplinary journal publishing cutting-edge and innovative computer-based research on environmental and urban systems, that privileges the geospatial perspective. The journal welcomes original high quality scholarship of a theoretical, applied or technological nature, and provides a stimulating presentation of perspectives, research developments, overviews of important new technologies and uses of major computational, information-based, and visualization innovations. Applied and theoretical contributions demonstrate the scope of computer-based analysis fostering a better understanding of environmental and urban systems, their spatial scope and their dynamics.