关于NIJ累犯预测挑战中乘法公平得分的说明

IF 3.1 Q1 CRIMINOLOGY & PENOLOGY

Crime Science Pub Date : 2021-08-06 DOI:10.1186/s40163-021-00152-x

Mohler, George, Porter, Michael D.

{"title":"关于NIJ累犯预测挑战中乘法公平得分的说明","authors":"Mohler, George, Porter, Michael D.","doi":"10.1186/s40163-021-00152-x","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Background</h3><p>The 2021 NIJ recidivism forecasting challenge asks participants to construct predictive models of recidivism while balancing false positive rates across groups of Black and white individuals through a multiplicative fairness score. We investigate the performance of several models for forecasting 1-year recidivism and optimizing the NIJ multiplicative fairness metric.</p><h3 data-test=\"abstract-sub-heading\">Methods</h3><p>We consider standard linear and logistic regression, a penalized regression that optimizes a convex surrogate loss (that we show has an analytical solution), two post-processing techniques, linear regression with re-balanced data, a black-box general purpose optimizer applied directly to the NIJ metric and a gradient boosting machine learning approach.</p><h3 data-test=\"abstract-sub-heading\">Results</h3><p>For the set of models investigated, we find that a simple heuristic of truncating scores at the decision threshold (thus predicting no recidivism across the data) yields as good or better NIJ fairness scores on held out data compared to other, more sophisticated approaches. We also find that when the cutoff is further away from the base rate of recidivism, as is the case in the competition where the base rate is 0.29 and the cutoff is 0.5, then simply optimizing the mean square error gives nearly optimal NIJ fairness metric solutions.</p><h3 data-test=\"abstract-sub-heading\">Conclusions</h3><p>The multiplicative metric in the 2021 NIJ recidivism forecasting competition encourages solutions that simply optimize MSE and/or use truncation, therefore yielding trivial solutions that forecast no one will recidivate.</p>","PeriodicalId":37844,"journal":{"name":"Crime Science","volume":"16 3","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2021-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"A note on the multiplicative fairness score in the NIJ recidivism forecasting challenge\",\"authors\":\"Mohler, George, Porter, Michael D.\",\"doi\":\"10.1186/s40163-021-00152-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Background</h3><p>The 2021 NIJ recidivism forecasting challenge asks participants to construct predictive models of recidivism while balancing false positive rates across groups of Black and white individuals through a multiplicative fairness score. We investigate the performance of several models for forecasting 1-year recidivism and optimizing the NIJ multiplicative fairness metric.</p><h3 data-test=\\\"abstract-sub-heading\\\">Methods</h3><p>We consider standard linear and logistic regression, a penalized regression that optimizes a convex surrogate loss (that we show has an analytical solution), two post-processing techniques, linear regression with re-balanced data, a black-box general purpose optimizer applied directly to the NIJ metric and a gradient boosting machine learning approach.</p><h3 data-test=\\\"abstract-sub-heading\\\">Results</h3><p>For the set of models investigated, we find that a simple heuristic of truncating scores at the decision threshold (thus predicting no recidivism across the data) yields as good or better NIJ fairness scores on held out data compared to other, more sophisticated approaches. We also find that when the cutoff is further away from the base rate of recidivism, as is the case in the competition where the base rate is 0.29 and the cutoff is 0.5, then simply optimizing the mean square error gives nearly optimal NIJ fairness metric solutions.</p><h3 data-test=\\\"abstract-sub-heading\\\">Conclusions</h3><p>The multiplicative metric in the 2021 NIJ recidivism forecasting competition encourages solutions that simply optimize MSE and/or use truncation, therefore yielding trivial solutions that forecast no one will recidivate.</p>\",\"PeriodicalId\":37844,\"journal\":{\"name\":\"Crime Science\",\"volume\":\"16 3\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2021-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Crime Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s40163-021-00152-x\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CRIMINOLOGY & PENOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crime Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s40163-021-00152-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CRIMINOLOGY & PENOLOGY","Score":null,"Total":0}

引用次数: 7

摘要

2021年NIJ累犯预测挑战要求参与者构建累犯预测模型，同时通过乘法公平得分平衡黑人和白人群体的假阳性率。我们研究了几种预测1年再犯的模型的性能，并优化了NIJ乘法公平指标。我们考虑了标准线性和逻辑回归，优化凸代理损失的惩罚回归(我们显示有解析解)，两种后处理技术，重新平衡数据的线性回归，直接应用于NIJ度量的黑盒通用优化器和梯度增强机器学习方法。对于所调查的模型集，我们发现，与其他更复杂的方法相比，在决策阈值处截断分数的简单启发式方法(从而预测整个数据中没有再犯)在保留数据上产生良好或更好的NIJ公平分数。我们还发现，当临界值离累犯的基本率更远时，就像在比赛中，基本率为0.29，临界值为0.5的情况下，那么简单地优化均方误差就会给出接近最优的NIJ公平度量解决方案。2021年NIJ累犯预测竞赛中的乘法指标鼓励简单优化MSE和/或使用截断的解决方案，因此产生预测没有人会再犯的平凡解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A note on the multiplicative fairness score in the NIJ recidivism forecasting challenge

Background

The 2021 NIJ recidivism forecasting challenge asks participants to construct predictive models of recidivism while balancing false positive rates across groups of Black and white individuals through a multiplicative fairness score. We investigate the performance of several models for forecasting 1-year recidivism and optimizing the NIJ multiplicative fairness metric.

Methods

We consider standard linear and logistic regression, a penalized regression that optimizes a convex surrogate loss (that we show has an analytical solution), two post-processing techniques, linear regression with re-balanced data, a black-box general purpose optimizer applied directly to the NIJ metric and a gradient boosting machine learning approach.

Results

For the set of models investigated, we find that a simple heuristic of truncating scores at the decision threshold (thus predicting no recidivism across the data) yields as good or better NIJ fairness scores on held out data compared to other, more sophisticated approaches. We also find that when the cutoff is further away from the base rate of recidivism, as is the case in the competition where the base rate is 0.29 and the cutoff is 0.5, then simply optimizing the mean square error gives nearly optimal NIJ fairness metric solutions.

Conclusions

The multiplicative metric in the 2021 NIJ recidivism forecasting competition encourages solutions that simply optimize MSE and/or use truncation, therefore yielding trivial solutions that forecast no one will recidivate.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Crime Science Social Sciences-Cultural Studies

CiteScore

11.90

自引率

8.20%

发文量

审稿时长

13 weeks

期刊介绍： Crime Science is an international, interdisciplinary, peer-reviewed journal with an applied focus. The journal''s main focus is on research articles and systematic reviews that reflect the growing cooperation among a variety of fields, including environmental criminology, economics, engineering, geography, public health, psychology, statistics and urban planning, on improving the detection, prevention and understanding of crime and disorder. Crime Science will publish theoretical articles that are relevant to the field, for example, approaches that integrate theories from different disciplines. The goal of the journal is to broaden the scientific base for the understanding, analysis and control of crime and disorder. It is aimed at researchers, practitioners and policy-makers with an interest in crime reduction. It will also publish short contributions on timely topics including crime patterns, technological advances for detection and prevention, and analytical techniques, and on the crime reduction applications of research from a wide range of fields. Crime Science publishes research articles, systematic reviews, short contributions and theoretical articles. While Crime Science uses the APA reference style, the journal welcomes submissions using alternative reference styles on a case-by-case basis.