Titus Hou , Daniel An , Caitlin W. Hicks , Elliott Haut , Isam W. Nasr
{"title":"使用监督机器学习和ICD10在马里兰州卫生服务成本审查委员会数据集中识别儿科创伤患者的非意外创伤。","authors":"Titus Hou , Daniel An , Caitlin W. Hicks , Elliott Haut , Isam W. Nasr","doi":"10.1016/j.chiabu.2024.107228","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Identifying non-accidental trauma (NAT) in pediatric trauma patients is challenging. We developed a machine learning model that uses demographic characteristics and ICD10 codes to detect the first diagnosis of NAT.</div></div><div><h3>Methods</h3><div>We analyzed data from the Maryland Health Services Cost Review Commission (2015–2020) for patients aged 0–19 years. Relevant ICD10 codes associated with NAT and trauma were identified. Health records preceding the patients' first trauma diagnosis were analyzed. Random forest models were built using covariates selected through penalized regularization. Models were developed for confirmed and suspected NAT. Data was divided into 80/20 split for model training and testing. We conducted analysis in R.</div></div><div><h3>Results</h3><div>We analyzed 128,351 non-NAT trauma patients, 522 confirmed NAT patients, and 2128 suspected NAT patients totaling 364,217 encounters. Variable selection identified 55 covariates for confirmed NAT and 65 for suspected NAT for model development. These covariates were primarily musculoskeletal injuries of the head and extremities. Model testing results are summarized in Table 1.</div></div><div><h3>Conclusion</h3><div>Our study uses machine learning to identify NAT within the pediatric trauma cohort. Analyzing ICD10 categories before the first traumatic diagnosis may allow for earlier detection of NAT. Additional research in building learning models with ICD10 codes is needed to better understand how clinician and billing biases may impact predictive models. Supervised machine learning can potentially augment clinical decision-making and enhance pediatric trauma care.</div></div>","PeriodicalId":51343,"journal":{"name":"Child Abuse & Neglect","volume":"160 ","pages":"Article 107228"},"PeriodicalIF":3.4000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using supervised machine learning and ICD10 to identify non-accidental trauma in pediatric trauma patients in the Maryland Health Services Cost Review Commission dataset\",\"authors\":\"Titus Hou , Daniel An , Caitlin W. Hicks , Elliott Haut , Isam W. Nasr\",\"doi\":\"10.1016/j.chiabu.2024.107228\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Identifying non-accidental trauma (NAT) in pediatric trauma patients is challenging. We developed a machine learning model that uses demographic characteristics and ICD10 codes to detect the first diagnosis of NAT.</div></div><div><h3>Methods</h3><div>We analyzed data from the Maryland Health Services Cost Review Commission (2015–2020) for patients aged 0–19 years. Relevant ICD10 codes associated with NAT and trauma were identified. Health records preceding the patients' first trauma diagnosis were analyzed. Random forest models were built using covariates selected through penalized regularization. Models were developed for confirmed and suspected NAT. Data was divided into 80/20 split for model training and testing. We conducted analysis in R.</div></div><div><h3>Results</h3><div>We analyzed 128,351 non-NAT trauma patients, 522 confirmed NAT patients, and 2128 suspected NAT patients totaling 364,217 encounters. Variable selection identified 55 covariates for confirmed NAT and 65 for suspected NAT for model development. These covariates were primarily musculoskeletal injuries of the head and extremities. Model testing results are summarized in Table 1.</div></div><div><h3>Conclusion</h3><div>Our study uses machine learning to identify NAT within the pediatric trauma cohort. Analyzing ICD10 categories before the first traumatic diagnosis may allow for earlier detection of NAT. Additional research in building learning models with ICD10 codes is needed to better understand how clinician and billing biases may impact predictive models. Supervised machine learning can potentially augment clinical decision-making and enhance pediatric trauma care.</div></div>\",\"PeriodicalId\":51343,\"journal\":{\"name\":\"Child Abuse & Neglect\",\"volume\":\"160 \",\"pages\":\"Article 107228\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Child Abuse & Neglect\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0145213424006215\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"FAMILY STUDIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Child Abuse & Neglect","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0145213424006215","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FAMILY STUDIES","Score":null,"Total":0}
Using supervised machine learning and ICD10 to identify non-accidental trauma in pediatric trauma patients in the Maryland Health Services Cost Review Commission dataset
Background
Identifying non-accidental trauma (NAT) in pediatric trauma patients is challenging. We developed a machine learning model that uses demographic characteristics and ICD10 codes to detect the first diagnosis of NAT.
Methods
We analyzed data from the Maryland Health Services Cost Review Commission (2015–2020) for patients aged 0–19 years. Relevant ICD10 codes associated with NAT and trauma were identified. Health records preceding the patients' first trauma diagnosis were analyzed. Random forest models were built using covariates selected through penalized regularization. Models were developed for confirmed and suspected NAT. Data was divided into 80/20 split for model training and testing. We conducted analysis in R.
Results
We analyzed 128,351 non-NAT trauma patients, 522 confirmed NAT patients, and 2128 suspected NAT patients totaling 364,217 encounters. Variable selection identified 55 covariates for confirmed NAT and 65 for suspected NAT for model development. These covariates were primarily musculoskeletal injuries of the head and extremities. Model testing results are summarized in Table 1.
Conclusion
Our study uses machine learning to identify NAT within the pediatric trauma cohort. Analyzing ICD10 categories before the first traumatic diagnosis may allow for earlier detection of NAT. Additional research in building learning models with ICD10 codes is needed to better understand how clinician and billing biases may impact predictive models. Supervised machine learning can potentially augment clinical decision-making and enhance pediatric trauma care.
期刊介绍:
Official Publication of the International Society for Prevention of Child Abuse and Neglect. Child Abuse & Neglect The International Journal, provides an international, multidisciplinary forum on all aspects of child abuse and neglect, with special emphasis on prevention and treatment; the scope extends further to all those aspects of life which either favor or hinder child development. While contributions will primarily be from the fields of psychology, psychiatry, social work, medicine, nursing, law enforcement, legislature, education, and anthropology, the Journal encourages the concerned lay individual and child-oriented advocate organizations to contribute.