{"title":"Algorithmic fairness in credit scoring","authors":"Teresa Bono, Karen Croxson, Adam Giles","doi":"10.1093/oxrep/grab020","DOIUrl":null,"url":null,"abstract":"\n The use of machine learning as an input into decision-making is on the rise, owing to its ability to uncover hidden patterns in large data and improve prediction accuracy. Questions have been raised, however, about the potential distributional impacts of these technologies, with one concern being that they may perpetuate or even amplify human biases from the past. Exploiting detailed credit file data for 800,000 UK borrowers, we simulate a switch from a traditional (logit) credit scoring model to ensemble machine-learning methods. We confirm that machine-learning models are more accurate overall. We also find that they do as well as the simpler traditional model on relevant fairness criteria, where these criteria pertain to overall accuracy and error rates for population subgroups defined along protected or sensitive lines (gender, race, health status, and deprivation). We do observe some differences in the way credit-scoring models perform for different subgroups, but these manifest under a traditional modelling approach and switching to machine learning neither exacerbates nor eliminates these issues. The paper discusses some of the mechanical and data factors that may contribute to statistical fairness issues in the context of credit scoring.","PeriodicalId":48024,"journal":{"name":"Oxford Review of Economic Policy","volume":" ","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oxford Review of Economic Policy","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1093/oxrep/grab020","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 13
Abstract
The use of machine learning as an input into decision-making is on the rise, owing to its ability to uncover hidden patterns in large data and improve prediction accuracy. Questions have been raised, however, about the potential distributional impacts of these technologies, with one concern being that they may perpetuate or even amplify human biases from the past. Exploiting detailed credit file data for 800,000 UK borrowers, we simulate a switch from a traditional (logit) credit scoring model to ensemble machine-learning methods. We confirm that machine-learning models are more accurate overall. We also find that they do as well as the simpler traditional model on relevant fairness criteria, where these criteria pertain to overall accuracy and error rates for population subgroups defined along protected or sensitive lines (gender, race, health status, and deprivation). We do observe some differences in the way credit-scoring models perform for different subgroups, but these manifest under a traditional modelling approach and switching to machine learning neither exacerbates nor eliminates these issues. The paper discusses some of the mechanical and data factors that may contribute to statistical fairness issues in the context of credit scoring.
期刊介绍:
The Oxford Review of Economic Policy is a refereed journal which is published quarterly. Each issue concentrates on a current theme in economic policy, with a balance between macro- and microeconomics, and comprises an assessment and a number of articles. It gives a valuable appraisal of economic policies worldwide. While the analysis is challenging and at the forefront of current thinking, articles are presented in non-technical language to make them readily accessible to all readers. The Oxford Review is aimed at a wide audience including government, business and policy-makers, as well as academics and students. It is required reading for those who need to know where research is leading.