Understanding the development, performance, fairness, and transparency of machine learning models used in child protection prediction: A systematic review.
{"title":"Understanding the development, performance, fairness, and transparency of machine learning models used in child protection prediction: A systematic review.","authors":"Claudia Bull, Steve Kisely, Kim Betts, Yanan Hu","doi":"10.1016/j.chiabu.2025.107630","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To understand the development and validation of contemporary machine learning (ML) models for child protection prediction, their performance evaluation, integration of fairness, and operationalisation of model explainability and transparency.</p><p><strong>Methods: </strong>This systematic review followed the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines. Model transparency was assessed against the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis + Artificial Intelligence (TRIPOD+AI) criteria, while study risk of bias and model applicability were evaluated using Prediction model Risk Of Bias ASsessment Tool (PROBAST) criteria.</p><p><strong>Results: </strong>Eleven studies were identified, employing various ML approaches such as supervised classification models (e.g., binary classification, decision trees, support vector machines), regression models, and ensemble methods. These models utilised administrative health, child welfare, and criminal/court data. Performance was evaluated using a range of discrimination, classification, and calibration metrics, yielding variable results. Only four models incorporated group fairness, focusing on race/ethnicity as the protected attribute. Explainability and transparency were enhanced through Receiver Operating Curves, Precision-Recall Curves, feature importance plots, and SHapley Additive exPlanations (SHAP) plots. According to TRIPOD+AI criteria, only four studies reported likely reproducible models. Based on PROBAST criteria, all studies had unclear or high risk of bias.</p><p><strong>Conclusions: </strong>This is the first review to use TRIPOD+AI and PROBAST criteria to assess the risk of bias and transparency of ML models in child protection prediction. The findings reveal that the field remains methodologically immature, with many models lacking fair, transparent, and reproducible methods. Adoption of advanced fairness techniques (beyond fairness-through-unawareness), stakeholder involvement in model development and validation, and transparency through data and code sharing will be essential for the ethical and effective design of ML models, ultimately improving decision-making processes and outcomes for vulnerable children and families.</p>","PeriodicalId":51343,"journal":{"name":"Child Abuse & Neglect","volume":"169 Pt 1","pages":"107630"},"PeriodicalIF":3.4000,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Child Abuse & Neglect","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1016/j.chiabu.2025.107630","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FAMILY STUDIES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To understand the development and validation of contemporary machine learning (ML) models for child protection prediction, their performance evaluation, integration of fairness, and operationalisation of model explainability and transparency.
Methods: This systematic review followed the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines. Model transparency was assessed against the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis + Artificial Intelligence (TRIPOD+AI) criteria, while study risk of bias and model applicability were evaluated using Prediction model Risk Of Bias ASsessment Tool (PROBAST) criteria.
Results: Eleven studies were identified, employing various ML approaches such as supervised classification models (e.g., binary classification, decision trees, support vector machines), regression models, and ensemble methods. These models utilised administrative health, child welfare, and criminal/court data. Performance was evaluated using a range of discrimination, classification, and calibration metrics, yielding variable results. Only four models incorporated group fairness, focusing on race/ethnicity as the protected attribute. Explainability and transparency were enhanced through Receiver Operating Curves, Precision-Recall Curves, feature importance plots, and SHapley Additive exPlanations (SHAP) plots. According to TRIPOD+AI criteria, only four studies reported likely reproducible models. Based on PROBAST criteria, all studies had unclear or high risk of bias.
Conclusions: This is the first review to use TRIPOD+AI and PROBAST criteria to assess the risk of bias and transparency of ML models in child protection prediction. The findings reveal that the field remains methodologically immature, with many models lacking fair, transparent, and reproducible methods. Adoption of advanced fairness techniques (beyond fairness-through-unawareness), stakeholder involvement in model development and validation, and transparency through data and code sharing will be essential for the ethical and effective design of ML models, ultimately improving decision-making processes and outcomes for vulnerable children and families.
期刊介绍:
Official Publication of the International Society for Prevention of Child Abuse and Neglect. Child Abuse & Neglect The International Journal, provides an international, multidisciplinary forum on all aspects of child abuse and neglect, with special emphasis on prevention and treatment; the scope extends further to all those aspects of life which either favor or hinder child development. While contributions will primarily be from the fields of psychology, psychiatry, social work, medicine, nursing, law enforcement, legislature, education, and anthropology, the Journal encourages the concerned lay individual and child-oriented advocate organizations to contribute.