João P. da Silva , José F. Rodrigues-Jr , João P. de Albuquerque
{"title":"On the power of CNNs to detect slums in Brazil","authors":"João P. da Silva , José F. Rodrigues-Jr , João P. de Albuquerque","doi":"10.1016/j.compenvurbsys.2025.102306","DOIUrl":null,"url":null,"abstract":"<div><div>The rapid expansion of slums poses a critical challenge for urban planning in Low- and Middle-Income Countries (LMICs), where traditional data collection methods like censuses are often outdated and insufficient. This study examines the transferability and generalization capabilities of deep learning models, specifically Convolutional Neural Networks (CNNs), for automated slum detection across six Brazilian cities with varying urban morphologies: São Paulo, Rio de Janeiro, Belo Horizonte, Brasília, Salvador, and Porto Alegre. Utilizing Very High Resolution (VHR) and High Resolution (HR) satellite imagery, we trained and evaluated models based on the EfficientNetV2L architecture. Our experimental results show that CNN models trained on data from a single city achieved high accuracy within that city (F1 scores exceeding 0.90 with VHR imagery), but their performance significantly decreased when applied to other cities (F1 scores dropping below 0.80), highlighting the impact of regional variations in urban morphology. Conversely, a generalized model trained on combined data from all six cities maintained robust performance across all cities, achieving F1 scores above 0.80 with VHR imagery. These findings indicate that while CNNs are effective for automated slum mapping, regional diversity necessitates training on diverse datasets to ensure generalization. We provide a comprehensive methodology over an openly shared dataset, and code to facilitate future research and applications in urban geoscience. The aim is to enhance the scalability and generalization of remote sensing and deep learning methods for slum identification across diverse urban environments.</div></div>","PeriodicalId":48241,"journal":{"name":"Computers Environment and Urban Systems","volume":"121 ","pages":"Article 102306"},"PeriodicalIF":7.1000,"publicationDate":"2025-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers Environment and Urban Systems","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0198971525000596","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
引用次数: 0
Abstract
The rapid expansion of slums poses a critical challenge for urban planning in Low- and Middle-Income Countries (LMICs), where traditional data collection methods like censuses are often outdated and insufficient. This study examines the transferability and generalization capabilities of deep learning models, specifically Convolutional Neural Networks (CNNs), for automated slum detection across six Brazilian cities with varying urban morphologies: São Paulo, Rio de Janeiro, Belo Horizonte, Brasília, Salvador, and Porto Alegre. Utilizing Very High Resolution (VHR) and High Resolution (HR) satellite imagery, we trained and evaluated models based on the EfficientNetV2L architecture. Our experimental results show that CNN models trained on data from a single city achieved high accuracy within that city (F1 scores exceeding 0.90 with VHR imagery), but their performance significantly decreased when applied to other cities (F1 scores dropping below 0.80), highlighting the impact of regional variations in urban morphology. Conversely, a generalized model trained on combined data from all six cities maintained robust performance across all cities, achieving F1 scores above 0.80 with VHR imagery. These findings indicate that while CNNs are effective for automated slum mapping, regional diversity necessitates training on diverse datasets to ensure generalization. We provide a comprehensive methodology over an openly shared dataset, and code to facilitate future research and applications in urban geoscience. The aim is to enhance the scalability and generalization of remote sensing and deep learning methods for slum identification across diverse urban environments.
期刊介绍:
Computers, Environment and Urban Systemsis an interdisciplinary journal publishing cutting-edge and innovative computer-based research on environmental and urban systems, that privileges the geospatial perspective. The journal welcomes original high quality scholarship of a theoretical, applied or technological nature, and provides a stimulating presentation of perspectives, research developments, overviews of important new technologies and uses of major computational, information-based, and visualization innovations. Applied and theoretical contributions demonstrate the scope of computer-based analysis fostering a better understanding of environmental and urban systems, their spatial scope and their dynamics.