行政记录中个人种族预测的错误分类和偏差

IF 5.9 1区 社会学 Q1 POLITICAL SCIENCE
Lisa P. Argyle, Michael J. Barber
{"title":"行政记录中个人种族预测的错误分类和偏差","authors":"Lisa P. Argyle, Michael J. Barber","doi":"10.1017/s0003055423000229","DOIUrl":null,"url":null,"abstract":"We show that a common method of predicting individuals’ race in administrative records, Bayesian Improved Surname Geocoding (BISG), produces misclassification errors that are strongly correlated with demographic and socioeconomic factors. In addition to the high error rates for some racial subgroups, the misclassification rates are correlated with the political and economic characteristics of a voter’s neighborhood. Racial and ethnic minorities who live in wealthy, highly educated, and politically active areas are most likely to be misclassified as white by BISG. Inferences about the relationship between sociodemographic factors and political outcomes, like voting, are likely to be biased in models using BISG to infer race. We develop an improved method in which the BISG estimates are incorporated into a machine learning model that accounts for class imbalance and incorporates individual and neighborhood characteristics. Our model decreases the misclassification rates among non-white individuals, in some cases by as much as 50%.","PeriodicalId":48451,"journal":{"name":"American Political Science Review","volume":" ","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2023-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Misclassification and Bias in Predictions of Individual Ethnicity from Administrative Records\",\"authors\":\"Lisa P. Argyle, Michael J. Barber\",\"doi\":\"10.1017/s0003055423000229\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We show that a common method of predicting individuals’ race in administrative records, Bayesian Improved Surname Geocoding (BISG), produces misclassification errors that are strongly correlated with demographic and socioeconomic factors. In addition to the high error rates for some racial subgroups, the misclassification rates are correlated with the political and economic characteristics of a voter’s neighborhood. Racial and ethnic minorities who live in wealthy, highly educated, and politically active areas are most likely to be misclassified as white by BISG. Inferences about the relationship between sociodemographic factors and political outcomes, like voting, are likely to be biased in models using BISG to infer race. We develop an improved method in which the BISG estimates are incorporated into a machine learning model that accounts for class imbalance and incorporates individual and neighborhood characteristics. Our model decreases the misclassification rates among non-white individuals, in some cases by as much as 50%.\",\"PeriodicalId\":48451,\"journal\":{\"name\":\"American Political Science Review\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.9000,\"publicationDate\":\"2023-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American Political Science Review\",\"FirstCategoryId\":\"90\",\"ListUrlMain\":\"https://doi.org/10.1017/s0003055423000229\",\"RegionNum\":1,\"RegionCategory\":\"社会学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"POLITICAL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Political Science Review","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1017/s0003055423000229","RegionNum":1,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"POLITICAL SCIENCE","Score":null,"Total":0}
引用次数: 2

摘要

我们发现,在行政记录中预测个人种族的一种常见方法,贝叶斯改进姓氏地理编码(BISG),会产生与人口统计和社会经济因素密切相关的错误分类错误。除了一些种族亚组的高错误率外,错误分类率还与选民所在社区的政治和经济特征有关。生活在富裕、受过高等教育和政治活跃地区的种族和少数民族最有可能被BISG错误地归类为白人。在使用BISG推断种族的模型中,对社会人口因素和政治结果(如投票)之间关系的推断可能存在偏见。我们开发了一种改进的方法,在该方法中,BISG估计被纳入机器学习模型,该模型考虑了类不平衡,并纳入了个体和邻域特征。我们的模型降低了非白人个体的错误分类率,在某些情况下高达50%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Misclassification and Bias in Predictions of Individual Ethnicity from Administrative Records
We show that a common method of predicting individuals’ race in administrative records, Bayesian Improved Surname Geocoding (BISG), produces misclassification errors that are strongly correlated with demographic and socioeconomic factors. In addition to the high error rates for some racial subgroups, the misclassification rates are correlated with the political and economic characteristics of a voter’s neighborhood. Racial and ethnic minorities who live in wealthy, highly educated, and politically active areas are most likely to be misclassified as white by BISG. Inferences about the relationship between sociodemographic factors and political outcomes, like voting, are likely to be biased in models using BISG to infer race. We develop an improved method in which the BISG estimates are incorporated into a machine learning model that accounts for class imbalance and incorporates individual and neighborhood characteristics. Our model decreases the misclassification rates among non-white individuals, in some cases by as much as 50%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.80
自引率
5.90%
发文量
119
期刊介绍: American Political Science Review is political science''s premier scholarly research journal, providing peer-reviewed articles and review essays from subfields throughout the discipline. Areas covered include political theory, American politics, public policy, public administration, comparative politics, and international relations. APSR has published continuously since 1906. American Political Science Review is sold ONLY as part of a joint subscription with Perspectives on Politics and PS: Political Science & Politics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信