{"title":"Integrating Survey and Geospatial Data for Geographical Targeting of the Poor and Vulnerable: Evidence from Malawi","authors":"M. Gualavisi, David Newhouse","doi":"10.1093/wber/lhae025","DOIUrl":null,"url":null,"abstract":"\n To address the challenge of identifying the poorest villages in developing countries, this study introduces a cost-effective strategy that leverages a combination of household consumption surveys, geospatial data, and a partial registry. The study simulates a partial registry, containing data from 450 villages across 10 impoverished districts of Malawi, and contains proxy poverty indicators. These indicators are used to impute estimates of household per capita consumption, which in turn are used to train a prediction model using publicly available geospatial data. This method is evaluated against an imputed reference of village welfare, derived from the 2016 household survey. The partial registry approach is benchmarked against three alternatives: proxy means test scores, the Meta Relative Wealth Index, and predictions from household surveys with geospatial indicators. Results show the partial registry model's rank correlation with actual welfare measures at 0.75, outperforming the other methods significantly, which ranged from −0.02 to 0.2. These findings hold under various robustness checks, including the addition of Gaussian noise, indicating that collecting household-level proxy poverty data in low-income areas can significantly improve the performance of machine learning models that integrate survey and satellite imagery data for village-level geographic targeting.","PeriodicalId":361118,"journal":{"name":"The World Bank Economic Review","volume":"51 20","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The World Bank Economic Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/wber/lhae025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To address the challenge of identifying the poorest villages in developing countries, this study introduces a cost-effective strategy that leverages a combination of household consumption surveys, geospatial data, and a partial registry. The study simulates a partial registry, containing data from 450 villages across 10 impoverished districts of Malawi, and contains proxy poverty indicators. These indicators are used to impute estimates of household per capita consumption, which in turn are used to train a prediction model using publicly available geospatial data. This method is evaluated against an imputed reference of village welfare, derived from the 2016 household survey. The partial registry approach is benchmarked against three alternatives: proxy means test scores, the Meta Relative Wealth Index, and predictions from household surveys with geospatial indicators. Results show the partial registry model's rank correlation with actual welfare measures at 0.75, outperforming the other methods significantly, which ranged from −0.02 to 0.2. These findings hold under various robustness checks, including the addition of Gaussian noise, indicating that collecting household-level proxy poverty data in low-income areas can significantly improve the performance of machine learning models that integrate survey and satellite imagery data for village-level geographic targeting.