{"title":"在数据稀缺的情况下,高分辨率定居点地图在预测人口数量方面有多准确?","authors":"Edith Darin, Ridhi Kashyap, Douglas R. Leasure","doi":"10.1002/psp.70083","DOIUrl":null,"url":null,"abstract":"<p>Despite the recent milestone of the world population surpassing 8 billion, disparities in population data reliability persist, with many countries facing outdated or incomplete census data. Such inaccuracies have far-reaching implications for various sectors, including public health, urban planning, and resource allocation. The study leverages the rich data environment provided by the detailed 2018 Colombian census data and its coverage indicator, which create a high-quality controlled environment to assess the performance of census-independent population estimation approaches. Drawing from a diverse range of environmental landscapes in Colombia, we evaluate the effectiveness of satellite imagery-derived settlement maps in conjunction with various modeling techniques. We explore two estimation approaches based on settlement maps: a data-driven machine learning approach exemplified by a random forest model and a process-driven probabilistic approach exemplified by a hierarchical Bayesian model. Our findings underscore the efficacy of Bayesian modeling in addressing data scarcity and bias, providing robust estimates and quantifying model uncertainty. However, the random forest model performs better when data inputs are detailed and unbiased. We further emphasize the importance of considering settlement map characteristics in the modeling process, while recognizing the overall limitations of relying solely on satellite imagery for population counts. Through a rigorous evaluation of different stages of the population modeling pipeline—data input, model selection, and outcome assessment—this study provides key insights into the challenges and requirements of using satellite imagery-derived settlement maps for population estimation in data-scarce contexts.</p>","PeriodicalId":48067,"journal":{"name":"Population Space and Place","volume":"31 6","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/psp.70083","citationCount":"0","resultStr":"{\"title\":\"How Accurate Are High Resolution Settlement Maps at Predicting Population Counts in Data Scarce Settings?\",\"authors\":\"Edith Darin, Ridhi Kashyap, Douglas R. Leasure\",\"doi\":\"10.1002/psp.70083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Despite the recent milestone of the world population surpassing 8 billion, disparities in population data reliability persist, with many countries facing outdated or incomplete census data. Such inaccuracies have far-reaching implications for various sectors, including public health, urban planning, and resource allocation. The study leverages the rich data environment provided by the detailed 2018 Colombian census data and its coverage indicator, which create a high-quality controlled environment to assess the performance of census-independent population estimation approaches. Drawing from a diverse range of environmental landscapes in Colombia, we evaluate the effectiveness of satellite imagery-derived settlement maps in conjunction with various modeling techniques. We explore two estimation approaches based on settlement maps: a data-driven machine learning approach exemplified by a random forest model and a process-driven probabilistic approach exemplified by a hierarchical Bayesian model. Our findings underscore the efficacy of Bayesian modeling in addressing data scarcity and bias, providing robust estimates and quantifying model uncertainty. However, the random forest model performs better when data inputs are detailed and unbiased. We further emphasize the importance of considering settlement map characteristics in the modeling process, while recognizing the overall limitations of relying solely on satellite imagery for population counts. Through a rigorous evaluation of different stages of the population modeling pipeline—data input, model selection, and outcome assessment—this study provides key insights into the challenges and requirements of using satellite imagery-derived settlement maps for population estimation in data-scarce contexts.</p>\",\"PeriodicalId\":48067,\"journal\":{\"name\":\"Population Space and Place\",\"volume\":\"31 6\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/psp.70083\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Population Space and Place\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/psp.70083\",\"RegionNum\":2,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"DEMOGRAPHY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Population Space and Place","FirstCategoryId":"90","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/psp.70083","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DEMOGRAPHY","Score":null,"Total":0}
How Accurate Are High Resolution Settlement Maps at Predicting Population Counts in Data Scarce Settings?
Despite the recent milestone of the world population surpassing 8 billion, disparities in population data reliability persist, with many countries facing outdated or incomplete census data. Such inaccuracies have far-reaching implications for various sectors, including public health, urban planning, and resource allocation. The study leverages the rich data environment provided by the detailed 2018 Colombian census data and its coverage indicator, which create a high-quality controlled environment to assess the performance of census-independent population estimation approaches. Drawing from a diverse range of environmental landscapes in Colombia, we evaluate the effectiveness of satellite imagery-derived settlement maps in conjunction with various modeling techniques. We explore two estimation approaches based on settlement maps: a data-driven machine learning approach exemplified by a random forest model and a process-driven probabilistic approach exemplified by a hierarchical Bayesian model. Our findings underscore the efficacy of Bayesian modeling in addressing data scarcity and bias, providing robust estimates and quantifying model uncertainty. However, the random forest model performs better when data inputs are detailed and unbiased. We further emphasize the importance of considering settlement map characteristics in the modeling process, while recognizing the overall limitations of relying solely on satellite imagery for population counts. Through a rigorous evaluation of different stages of the population modeling pipeline—data input, model selection, and outcome assessment—this study provides key insights into the challenges and requirements of using satellite imagery-derived settlement maps for population estimation in data-scarce contexts.
期刊介绍:
Population, Space and Place aims to be the leading English-language research journal in the field of geographical population studies. It intends to: - Inform population researchers of the best theoretical and empirical research on topics related to population, space and place - Promote and further enhance the international standing of population research through the exchange of views on what constitutes best research practice - Facilitate debate on issues of policy relevance and encourage the widest possible discussion and dissemination of the applications of research on populations - Review and evaluate the significance of recent research findings and provide an international platform where researchers can discuss the future course of population research