Ryan Crowley, Katherine Parkin, Emma Rocheteau, Efthalia Massou, Yasmin Friedmann, Ann John, Rachel Sippy, Pietro Liò, Anna Moore
{"title":"Machine learning for prediction of childhood mental health problems in social care.","authors":"Ryan Crowley, Katherine Parkin, Emma Rocheteau, Efthalia Massou, Yasmin Friedmann, Ann John, Rachel Sippy, Pietro Liò, Anna Moore","doi":"10.1192/bjo.2025.32","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Rates of childhood mental health problems are increasing in the UK. Early identification of childhood mental health problems is challenging but critical to children's future psychosocial development. This is particularly important for children with social care contact because earlier identification can facilitate earlier intervention. Clinical prediction tools could improve these early intervention efforts.</p><p><strong>Aims: </strong>Characterise a novel cohort consisting of children in social care and develop effective machine learning models for prediction of childhood mental health problems.</p><p><strong>Method: </strong>We used linked, de-identified data from the Secure Anonymised Information Linkage Databank to create a cohort of 26 820 children in Wales, UK, receiving social care services. Integrating health, social care and education data, we developed several machine learning models aimed at predicting childhood mental health problems. We assessed the performance, interpretability and fairness of these models.</p><p><strong>Results: </strong>Risk factors strongly associated with childhood mental health problems included age, substance misuse and being a looked after child. The best-performing model, a gradient boosting classifier, achieved an area under the receiver operating characteristic curve of 0.75 (95% CI 0.73-0.78). Assessments of algorithmic fairness showed potential biases within these models.</p><p><strong>Conclusions: </strong>Machine learning performance on this prediction task was promising. Predictive performance in social care settings can be bolstered by linking diverse routinely collected data-sets, making available a range of heterogenous risk factors relating to clinical, social and environmental exposures.</p>","PeriodicalId":9038,"journal":{"name":"BJPsych Open","volume":"11 3","pages":"e86"},"PeriodicalIF":3.9000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12052593/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BJPsych Open","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1192/bjo.2025.32","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Rates of childhood mental health problems are increasing in the UK. Early identification of childhood mental health problems is challenging but critical to children's future psychosocial development. This is particularly important for children with social care contact because earlier identification can facilitate earlier intervention. Clinical prediction tools could improve these early intervention efforts.
Aims: Characterise a novel cohort consisting of children in social care and develop effective machine learning models for prediction of childhood mental health problems.
Method: We used linked, de-identified data from the Secure Anonymised Information Linkage Databank to create a cohort of 26 820 children in Wales, UK, receiving social care services. Integrating health, social care and education data, we developed several machine learning models aimed at predicting childhood mental health problems. We assessed the performance, interpretability and fairness of these models.
Results: Risk factors strongly associated with childhood mental health problems included age, substance misuse and being a looked after child. The best-performing model, a gradient boosting classifier, achieved an area under the receiver operating characteristic curve of 0.75 (95% CI 0.73-0.78). Assessments of algorithmic fairness showed potential biases within these models.
Conclusions: Machine learning performance on this prediction task was promising. Predictive performance in social care settings can be bolstered by linking diverse routinely collected data-sets, making available a range of heterogenous risk factors relating to clinical, social and environmental exposures.
背景:在英国,儿童心理健康问题的比率正在上升。早期发现儿童心理健康问题具有挑战性,但对儿童未来的社会心理发展至关重要。这对有社会关怀接触的儿童尤其重要,因为早期识别有助于早期干预。临床预测工具可以改善这些早期干预工作。目的:描述一个由社会护理儿童组成的新队列,并开发有效的机器学习模型来预测儿童心理健康问题。方法:我们使用来自安全匿名信息链接数据库的链接的、去识别的数据来创建一个来自英国威尔士的26820名接受社会护理服务的儿童队列。整合健康、社会关怀和教育数据,我们开发了几个旨在预测儿童心理健康问题的机器学习模型。我们评估了这些模型的性能、可解释性和公平性。结果:与儿童心理健康问题密切相关的风险因素包括年龄、药物滥用和被照顾的儿童。表现最好的模型是梯度增强分类器,其在接收者工作特征曲线下的面积为0.75 (95% CI 0.73-0.78)。对算法公平性的评估显示了这些模型中潜在的偏见。结论:机器学习在这个预测任务上的表现是有希望的。通过将各种常规收集的数据集联系起来,提供与临床、社会和环境暴露有关的一系列异质性风险因素,可以加强社会护理环境中的预测绩效。
期刊介绍:
Announcing the launch of BJPsych Open, an exciting new open access online journal for the publication of all methodologically sound research in all fields of psychiatry and disciplines related to mental health. BJPsych Open will maintain the highest scientific, peer review, and ethical standards of the BJPsych, ensure rapid publication for authors whilst sharing research with no cost to the reader in the spirit of maximising dissemination and public engagement. Cascade submission from BJPsych to BJPsych Open is a new option for authors whose first priority is rapid online publication with the prestigious BJPsych brand. Authors will also retain copyright to their works under a creative commons license.