{"title":"An Open Source Replication of a Winning Recidivism Prediction Model.","authors":"Giovanni M Circo, Andrew P Wheeler","doi":"10.1177/0306624X221133004","DOIUrl":null,"url":null,"abstract":"<p><p>We present results of our winning solution to the National Institute of Justice recidivism forecasting challenge. Our team, \"MCHawks,\" placed highly in both terms of accuracy (as measured via the Brier score), as well as the fairness criteria (weighted by differences in false positive rates between White and Black parolees). We used a non-linear machine learning model, XGBoost, although we detail our search of different model specifications, as many different models' predictive performance is very similar. Our solution to balancing false positive rates is trivial; we bias predictions to always be \"low risk\" so false positive rates for each racial group are zero. We discuss changes to the fairness metric to promote non-trivial solutions. By providing open-source replication materials, it is within the capabilities of others to build just as accurate models without extensive statistical expertise or computational resources.</p>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":" ","pages":"438-453"},"PeriodicalIF":16.4000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1177/0306624X221133004","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/11/3 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
We present results of our winning solution to the National Institute of Justice recidivism forecasting challenge. Our team, "MCHawks," placed highly in both terms of accuracy (as measured via the Brier score), as well as the fairness criteria (weighted by differences in false positive rates between White and Black parolees). We used a non-linear machine learning model, XGBoost, although we detail our search of different model specifications, as many different models' predictive performance is very similar. Our solution to balancing false positive rates is trivial; we bias predictions to always be "low risk" so false positive rates for each racial group are zero. We discuss changes to the fairness metric to promote non-trivial solutions. By providing open-source replication materials, it is within the capabilities of others to build just as accurate models without extensive statistical expertise or computational resources.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.