{"title":"A flexible framework for identifying urban villages using Sentinel-2 observations and deep learning","authors":"Yizhen Wu , Xi Li , Yu Gong , Dennis Mwaniki","doi":"10.1016/j.jag.2025.104631","DOIUrl":null,"url":null,"abstract":"<div><div>Identifying urban villages (UVs) is essential for supporting urban renewal and governance, as these informal areas present significant challenges to urban sustainability. Existing multi-modal deep-learning studies have accurately identified UVs in individual cities; however, their scalability is constrained by substantial computational demands and data availability. Currently, efficient deep-learning models tailored for large-scale, consistent UV maps using freely released Sentinel-2 data remain underexplored. Therefore, we proposed a flexible framework leveraging multiple Sentinel-2 observations and deep-learning models to cost-effectively support large-scale UV mapping. We specifically employed a dual-branch encoder to consolidate discriminative features: one branch extracted static information from annual images, while the other captured seasonal dynamics induced by building shadows across four seasonal images. These features were then jointly represented using fusion modules and used for UV identification through skip connections. The model was trained and tested on self-compiled Guangzhou-Shenzhen samples, with the Pearl River Delta serving to analyze applicability and transferability. Results demonstrated incorporating dynamic information notably enhanced the accuracy of UV identification. The proposed framework exhibited optimal performance (mIoU: 83.33 %) with robust spatial generalization, surpassing traditional methods (mIoU improvements: 0.46–12.49 %). Compared to multi-modal data-driven studies, our framework can be conveniently implemented in complex urban environments while maintaining both efficiency and accuracy in large-scale applications. This cost-effective solution offers high-quality, large-scale UV mapping, thereby advancing UV management and planning efforts.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"141 ","pages":"Article 104631"},"PeriodicalIF":7.6000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156984322500278X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0
Abstract
Identifying urban villages (UVs) is essential for supporting urban renewal and governance, as these informal areas present significant challenges to urban sustainability. Existing multi-modal deep-learning studies have accurately identified UVs in individual cities; however, their scalability is constrained by substantial computational demands and data availability. Currently, efficient deep-learning models tailored for large-scale, consistent UV maps using freely released Sentinel-2 data remain underexplored. Therefore, we proposed a flexible framework leveraging multiple Sentinel-2 observations and deep-learning models to cost-effectively support large-scale UV mapping. We specifically employed a dual-branch encoder to consolidate discriminative features: one branch extracted static information from annual images, while the other captured seasonal dynamics induced by building shadows across four seasonal images. These features were then jointly represented using fusion modules and used for UV identification through skip connections. The model was trained and tested on self-compiled Guangzhou-Shenzhen samples, with the Pearl River Delta serving to analyze applicability and transferability. Results demonstrated incorporating dynamic information notably enhanced the accuracy of UV identification. The proposed framework exhibited optimal performance (mIoU: 83.33 %) with robust spatial generalization, surpassing traditional methods (mIoU improvements: 0.46–12.49 %). Compared to multi-modal data-driven studies, our framework can be conveniently implemented in complex urban environments while maintaining both efficiency and accuracy in large-scale applications. This cost-effective solution offers high-quality, large-scale UV mapping, thereby advancing UV management and planning efforts.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.