{"title":"Comparative Analysis of Machine Learning Models for Recidivism Prediction Based on Chi-square Test","authors":"Zhihao Zhang, Zhaohua Huang, Zhongbao Wan, Lingci Meng","doi":"10.1109/ICAA53760.2021.00012","DOIUrl":null,"url":null,"abstract":"In order to excavate the influencing factors of recidivsim of the prisoners so as to achieve the purpose of prevention and redction of crime. This article proposes a feature selection method based on the experience of field experts and chi-square test, and uses the data from 2004 survey of inmates in state and federal correctional facilities as source, through data cleaning and data discretizes, and select five machine learning models for training and prediction respectively. Taking the accuracy rate, recall rate and values as evaluation indicators, compared the recidivism prediction capabilities of the five models. The results show that the feature selection method proposed in this paper can greatly impove the accuracy and recall rate of each model, and the logisitc regression model has a strong comprehensive ability.","PeriodicalId":121879,"journal":{"name":"2021 International Conference on Intelligent Computing, Automation and Applications (ICAA)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Intelligent Computing, Automation and Applications (ICAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAA53760.2021.00012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In order to excavate the influencing factors of recidivsim of the prisoners so as to achieve the purpose of prevention and redction of crime. This article proposes a feature selection method based on the experience of field experts and chi-square test, and uses the data from 2004 survey of inmates in state and federal correctional facilities as source, through data cleaning and data discretizes, and select five machine learning models for training and prediction respectively. Taking the accuracy rate, recall rate and values as evaluation indicators, compared the recidivism prediction capabilities of the five models. The results show that the feature selection method proposed in this paper can greatly impove the accuracy and recall rate of each model, and the logisitc regression model has a strong comprehensive ability.