Differential Privacy and the Accuracy of County-Level Net Migration Estimates.

Richelle L Winkler, Jaclyn L Butler, Katherine J Curtis, David Egan-Robertson
{"title":"Differential Privacy and the Accuracy of County-Level Net Migration Estimates.","authors":"Richelle L Winkler, Jaclyn L Butler, Katherine J Curtis, David Egan-Robertson","doi":"10.1007/s11113-021-09664-5","DOIUrl":null,"url":null,"abstract":"<p><p>Each decade since the 1950s, demographers have generated high quality net migration estimates by age, sex, and race for US counties using decennial census data as starting and ending populations. The estimates have been downloaded tens of thousands of times and widely used for planning, diverse applications, and research. Census 2020 should allow the series to extend through the 2010-2020 decade. The accuracy of new estimates, however, could be challenged by differentially private (DP) disclosure avoidance techniques in Census 2020 data products. This research brief estimates the impact of DP implementation on the accuracy of county-level net migration estimates. Using differentially private Census 2010 demonstration data, we construct a hypothetical set of DP migration estimates for 2000-2010 and compare them to published estimates, using common accuracy metrics and spatial analysis. Findings show that based on demonstration data released in 2020, net migration estimates by five-year age groups would only be accurate enough for use in about half of counties. Inaccuracies are larger in counties with populations less than 50,000, among age groups 65 and over, and among Hispanics. These problems are not fully resolved by grouping into broader age groups. Moreover, errors tend to cluster spatially in some regions of the country. Ultimately, the ability to generate accurate net migration estimates at the same level of detail as in the past will depend on the Census Bureau's allocation of the privacy loss budget.</p>","PeriodicalId":80485,"journal":{"name":"Annals of the Association of American Geographers. Association of American Geographers","volume":"72 1","pages":"417-435"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11485725/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of the Association of American Geographers. Association of American Geographers","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1007/s11113-021-09664-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/7/8 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Each decade since the 1950s, demographers have generated high quality net migration estimates by age, sex, and race for US counties using decennial census data as starting and ending populations. The estimates have been downloaded tens of thousands of times and widely used for planning, diverse applications, and research. Census 2020 should allow the series to extend through the 2010-2020 decade. The accuracy of new estimates, however, could be challenged by differentially private (DP) disclosure avoidance techniques in Census 2020 data products. This research brief estimates the impact of DP implementation on the accuracy of county-level net migration estimates. Using differentially private Census 2010 demonstration data, we construct a hypothetical set of DP migration estimates for 2000-2010 and compare them to published estimates, using common accuracy metrics and spatial analysis. Findings show that based on demonstration data released in 2020, net migration estimates by five-year age groups would only be accurate enough for use in about half of counties. Inaccuracies are larger in counties with populations less than 50,000, among age groups 65 and over, and among Hispanics. These problems are not fully resolved by grouping into broader age groups. Moreover, errors tend to cluster spatially in some regions of the country. Ultimately, the ability to generate accurate net migration estimates at the same level of detail as in the past will depend on the Census Bureau's allocation of the privacy loss budget.

差异隐私与县级净移民估算的准确性。
自 20 世纪 50 年代以来,每十年人口学家都会使用十年一次的人口普查数据作为起始和终止人口,按年龄、性别和种族对美国各县的人口净迁移进行高质量的估算。这些估算数据已被下载数万次,并被广泛用于规划、各种应用和研究。2020 年人口普查将使该系列数据延续到 2010-2020 十年。然而,2020 年人口普查数据产品中的差异保密 (DP) 披露规避技术可能会对新估算值的准确性提出挑战。本研究简报估算了 DP 的实施对县级净移民估算准确性的影响。利用 2010 年非公开人口普查示范数据,我们构建了一套假设的 2000-2010 年非公开人口普查移民估计值,并利用通用的准确性指标和空间分析,将其与已公布的估计值进行比较。研究结果表明,基于 2020 年发布的示范数据,按五年年龄组划分的净移民估算值的准确性仅足以在约一半的县中使用。在人口少于 50,000 的县、65 岁及以上年龄组和西班牙裔中,不准确度更高。这些问题并不能通过更广泛的年龄组分组完全解决。此外,误差往往集中在国内某些地区。最终,能否在与过去相同的详细程度上生成准确的净迁移估计值,将取决于人口普查局对隐私损失预算的分配。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信