How Accurate Are High Resolution Settlement Maps at Predicting Population Counts in Data Scarce Settings?

IF 2.6 2区 社会学 Q1 DEMOGRAPHY
Edith Darin, Ridhi Kashyap, Douglas R. Leasure
{"title":"How Accurate Are High Resolution Settlement Maps at Predicting Population Counts in Data Scarce Settings?","authors":"Edith Darin,&nbsp;Ridhi Kashyap,&nbsp;Douglas R. Leasure","doi":"10.1002/psp.70083","DOIUrl":null,"url":null,"abstract":"<p>Despite the recent milestone of the world population surpassing 8 billion, disparities in population data reliability persist, with many countries facing outdated or incomplete census data. Such inaccuracies have far-reaching implications for various sectors, including public health, urban planning, and resource allocation. The study leverages the rich data environment provided by the detailed 2018 Colombian census data and its coverage indicator, which create a high-quality controlled environment to assess the performance of census-independent population estimation approaches. Drawing from a diverse range of environmental landscapes in Colombia, we evaluate the effectiveness of satellite imagery-derived settlement maps in conjunction with various modeling techniques. We explore two estimation approaches based on settlement maps: a data-driven machine learning approach exemplified by a random forest model and a process-driven probabilistic approach exemplified by a hierarchical Bayesian model. Our findings underscore the efficacy of Bayesian modeling in addressing data scarcity and bias, providing robust estimates and quantifying model uncertainty. However, the random forest model performs better when data inputs are detailed and unbiased. We further emphasize the importance of considering settlement map characteristics in the modeling process, while recognizing the overall limitations of relying solely on satellite imagery for population counts. Through a rigorous evaluation of different stages of the population modeling pipeline—data input, model selection, and outcome assessment—this study provides key insights into the challenges and requirements of using satellite imagery-derived settlement maps for population estimation in data-scarce contexts.</p>","PeriodicalId":48067,"journal":{"name":"Population Space and Place","volume":"31 6","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/psp.70083","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Population Space and Place","FirstCategoryId":"90","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/psp.70083","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DEMOGRAPHY","Score":null,"Total":0}
引用次数: 0

Abstract

Despite the recent milestone of the world population surpassing 8 billion, disparities in population data reliability persist, with many countries facing outdated or incomplete census data. Such inaccuracies have far-reaching implications for various sectors, including public health, urban planning, and resource allocation. The study leverages the rich data environment provided by the detailed 2018 Colombian census data and its coverage indicator, which create a high-quality controlled environment to assess the performance of census-independent population estimation approaches. Drawing from a diverse range of environmental landscapes in Colombia, we evaluate the effectiveness of satellite imagery-derived settlement maps in conjunction with various modeling techniques. We explore two estimation approaches based on settlement maps: a data-driven machine learning approach exemplified by a random forest model and a process-driven probabilistic approach exemplified by a hierarchical Bayesian model. Our findings underscore the efficacy of Bayesian modeling in addressing data scarcity and bias, providing robust estimates and quantifying model uncertainty. However, the random forest model performs better when data inputs are detailed and unbiased. We further emphasize the importance of considering settlement map characteristics in the modeling process, while recognizing the overall limitations of relying solely on satellite imagery for population counts. Through a rigorous evaluation of different stages of the population modeling pipeline—data input, model selection, and outcome assessment—this study provides key insights into the challenges and requirements of using satellite imagery-derived settlement maps for population estimation in data-scarce contexts.

Abstract Image

在数据稀缺的情况下,高分辨率定居点地图在预测人口数量方面有多准确?
尽管最近世界人口超过80亿,但人口数据可靠性方面的差异仍然存在,许多国家面临过时或不完整的人口普查数据。这种不准确对包括公共卫生、城市规划和资源分配在内的各个部门产生了深远的影响。该研究利用了详细的2018年哥伦比亚人口普查数据及其覆盖指标提供的丰富数据环境,为评估独立于人口普查的人口估计方法的绩效创造了一个高质量的受控环境。从哥伦比亚的各种环境景观中,我们评估了卫星图像衍生的聚落地图与各种建模技术相结合的有效性。我们探索了两种基于定居点地图的估计方法:以随机森林模型为例的数据驱动的机器学习方法和以分层贝叶斯模型为例的过程驱动的概率方法。我们的研究结果强调了贝叶斯模型在解决数据稀缺性和偏差、提供稳健估计和量化模型不确定性方面的有效性。然而,当数据输入详细且无偏时,随机森林模型表现更好。我们进一步强调在建模过程中考虑住区地图特征的重要性,同时认识到完全依靠卫星图像进行人口统计的总体局限性。通过对人口建模管道的不同阶段(数据输入、模型选择和结果评估)的严格评估,本研究为在数据稀缺的情况下使用卫星图像衍生的定居图进行人口估计的挑战和要求提供了关键见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.00
自引率
12.50%
发文量
87
期刊介绍: Population, Space and Place aims to be the leading English-language research journal in the field of geographical population studies. It intends to: - Inform population researchers of the best theoretical and empirical research on topics related to population, space and place - Promote and further enhance the international standing of population research through the exchange of views on what constitutes best research practice - Facilitate debate on issues of policy relevance and encourage the widest possible discussion and dissemination of the applications of research on populations - Review and evaluate the significance of recent research findings and provide an international platform where researchers can discuss the future course of population research
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信