Development and Validation of Prediction Models for the Diagnosis of Autism Spectrum Disorder in a Korean General Population

Hyelee Kim MD, MAS, MS , Bennett L. Leventhal MD , Yun-Joo Koh PhD , Efstathios D. Gennatas MBBS, PhD , Young Shin Kim MD, MPH, MS, PhD
{"title":"Development and Validation of Prediction Models for the Diagnosis of Autism Spectrum Disorder in a Korean General Population","authors":"Hyelee Kim MD, MAS, MS ,&nbsp;Bennett L. Leventhal MD ,&nbsp;Yun-Joo Koh PhD ,&nbsp;Efstathios D. Gennatas MBBS, PhD ,&nbsp;Young Shin Kim MD, MPH, MS, PhD","doi":"10.1016/j.jaacop.2024.03.005","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Delays in autism spectrum disorder (ASD) diagnosis and treatment are significant clinical problems that can be addressed by timely, community-based assessment. This study examined tools for identifying ASD in community settings using machine learning (ML) models.</div></div><div><h3>Method</h3><div>This study analyzed population-based cross-sectional studies (2005-2017) of ASD in South Korea. A community sample of 62,083 children was screened using the Autism Spectrum Screening Questionnaire (ASSQ) and teacher/caregiver referrals. Caregivers completed the Behavior Assessment System for Children–2nd Edition (BASC-2) and the Social Responsiveness Scale (SRS). Screen positives were offered a comprehensive clinical evaluation. Among the first-graders in regular elementary schools who completed the diagnostic evaluation (N = 746), supervised ML models (generalized linear model with elastic net regularization [GLMNET], classification and regression tree, random forest, and gradient boosting [GB]) were developed and validated for classification of ASD. Models were developed in the single questionnaire and combined questionnaire datasets, using questionnaire responses and demographic and developmental information.</div></div><div><h3>Results</h3><div>ASD was diagnosed in 46.2% of children (median age, 6.8 years [interquartile range, 6.5-7.1 years]; 71.7% boys). Among single questionnaire models, the BASC GB model demonstrated the best discrimination ability (area under the curve 0.80, 95% CI 0.75-0.83). Area under the curve of the GLMNET model with combined ASSQ, BASC-2, and SRS was the highest, 0.82 (95% CI 0.77-0.89); the predicted risk of ASD by the GB model of combined questionnaires agreed the best with the observed risk of ASD compared with other ML models.</div></div><div><h3>Conclusion</h3><div>Caregiver questionnaire ML models showed future promise for identifying children with ASD in community settings.</div></div><div><h3>Plain language summary</h3><div>To tackle the problem of delayed autism diagnosis, a study in South Korea used machine learning tools to identify autism spectrum disorder (ASD) from a community sample of over 62,000 children. By analyzing questionnaire responses along with developmental data, researchers developed models to classify ASD, with the best model achieving accuracy with an area under the curve (AUC) statistic of 0.82. The findings suggest that machine learning models based on caregiver questionnaires have significant potential for early identification of ASD in community settings. This could lead to more timely interventions for affected children.</div></div>","PeriodicalId":73525,"journal":{"name":"JAACAP open","volume":"3 2","pages":"Pages 302-312"},"PeriodicalIF":0.0000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAACAP open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949732924000425","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Objective

Delays in autism spectrum disorder (ASD) diagnosis and treatment are significant clinical problems that can be addressed by timely, community-based assessment. This study examined tools for identifying ASD in community settings using machine learning (ML) models.

Method

This study analyzed population-based cross-sectional studies (2005-2017) of ASD in South Korea. A community sample of 62,083 children was screened using the Autism Spectrum Screening Questionnaire (ASSQ) and teacher/caregiver referrals. Caregivers completed the Behavior Assessment System for Children–2nd Edition (BASC-2) and the Social Responsiveness Scale (SRS). Screen positives were offered a comprehensive clinical evaluation. Among the first-graders in regular elementary schools who completed the diagnostic evaluation (N = 746), supervised ML models (generalized linear model with elastic net regularization [GLMNET], classification and regression tree, random forest, and gradient boosting [GB]) were developed and validated for classification of ASD. Models were developed in the single questionnaire and combined questionnaire datasets, using questionnaire responses and demographic and developmental information.

Results

ASD was diagnosed in 46.2% of children (median age, 6.8 years [interquartile range, 6.5-7.1 years]; 71.7% boys). Among single questionnaire models, the BASC GB model demonstrated the best discrimination ability (area under the curve 0.80, 95% CI 0.75-0.83). Area under the curve of the GLMNET model with combined ASSQ, BASC-2, and SRS was the highest, 0.82 (95% CI 0.77-0.89); the predicted risk of ASD by the GB model of combined questionnaires agreed the best with the observed risk of ASD compared with other ML models.

Conclusion

Caregiver questionnaire ML models showed future promise for identifying children with ASD in community settings.

Plain language summary

To tackle the problem of delayed autism diagnosis, a study in South Korea used machine learning tools to identify autism spectrum disorder (ASD) from a community sample of over 62,000 children. By analyzing questionnaire responses along with developmental data, researchers developed models to classify ASD, with the best model achieving accuracy with an area under the curve (AUC) statistic of 0.82. The findings suggest that machine learning models based on caregiver questionnaires have significant potential for early identification of ASD in community settings. This could lead to more timely interventions for affected children.
韩国普通人群自闭症谱系障碍诊断预测模型的建立与验证
目的自闭症谱系障碍(ASD)的诊断和治疗延迟是一个重要的临床问题,可以通过及时的、基于社区的评估来解决。本研究使用机器学习(ML)模型检查了在社区环境中识别ASD的工具。方法本研究分析了韩国基于人群的ASD横断面研究(2005-2017)。使用自闭症谱系筛查问卷(ASSQ)和教师/护理人员推荐对62,083名儿童进行了社区样本筛查。照顾者完成儿童行为评估系统-第二版(BASC-2)和社会反应量表(SRS)。筛查阳性者给予全面的临床评价。在完成诊断评估的普通小学一年级学生(N = 746)中,我们开发并验证了监督ML模型(弹性网络正则化广义线性模型[GLMNET]、分类与回归树、随机森林和梯度增强模型[GB])对ASD的分类。模型是在单一问卷和组合问卷数据集中开发的,使用问卷回答和人口统计和发展信息。结果46.2%的儿童被诊断为自闭症谱系障碍(年龄中位数为6.8岁[四分位数间距为6.5 ~ 7.1岁];71.7%的男孩)。在单问卷模型中,BASC GB模型的判别能力最好(曲线下面积0.80,95% CI 0.75 ~ 0.83)。结合ASSQ、BASC-2和SRS的GLMNET模型曲线下面积最大,为0.82 (95% CI 0.77 ~ 0.89);与其他ML模型相比,联合问卷的GB模型预测的ASD风险与观察到的ASD风险一致性最好。结论照顾者问卷ML模型在社区环境中识别ASD儿童具有良好的应用前景。为了解决自闭症诊断延迟的问题,韩国的一项研究使用机器学习工具从超过6.2万名儿童的社区样本中识别自闭症谱系障碍(ASD)。研究人员通过分析问卷回答和发育数据,建立了ASD分类模型,最佳模型的曲线下面积(AUC)统计量为0.82。研究结果表明,基于护理人员问卷的机器学习模型在社区环境中具有早期识别ASD的巨大潜力。这可能导致对受影响儿童进行更及时的干预。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
JAACAP open
JAACAP open Psychiatry and Mental Health
自引率
0.00%
发文量
0
审稿时长
16 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信