{"title":"Prediction of non-suicidal self-injury (NSSI) among rural Chinese junior high school students: a machine learning approach.","authors":"Zhongliang Jiang, Yonghua Cui, Hui Xu, Cody Abbey, Wenjian Xu, Weitong Guo, Dongdong Zhang, Jintong Liu, Jingwen Jin, Ying Li","doi":"10.1186/s12991-024-00534-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Non-suicidal self-injury (NSSI) is a serious issue that is increasingly prevalent among children and adolescents, especially in rural areas. Developing a suitable predictive model for NSSI is crucial for early identification and intervention.</p><p><strong>Methods: </strong>This study included 2090 Chinese rural children and adolescents. Participants' sociodemographic information, symptoms of anxiety as well as depression, personality traits, family environment and NSSI behaviors were collected through a questionnaire survey. Gender, age, grade, and all survey results except sociodemographic information were used as relevant factors for prediction. Support vector machines, decision tree and random forest models were trained and validated by the train set and valid set, respectively. The metrics of each model were tested and compared to select the most suitable one. Furthermore, the mean decrease Gini index was calculated to measure the importance of relevant factors.</p><p><strong>Results: </strong>The prevalence of NSSI was 38.3%. Out of the 6 models assessed, the random forest model demonstrated the highest suitability in predicting the prevalence of NSSI. It achieved sensitivity, specificity, AUC, accuracy, precision, and F1 scores of 0.65, 0.72, 0.76, 0.70, 0.57, and 0.61, respectively. Anxiety and depression were the top two contributing factors in the prediction model. Neuroticism and conflict were the factors that contributed the most to personality traits and family environment, respectively, in terms of prediction. In addition, demographic factors contributed little to the prediction in this study.</p><p><strong>Conclusion: </strong>This study focused on Chinese children and adolescents in rural areas and demonstrated the potential of using machine learning approaches in predicting NSSI. Our research complements the application of machine learning methods to psychiatric and psychological problems.</p>","PeriodicalId":7942,"journal":{"name":"Annals of General Psychiatry","volume":"23 1","pages":"48"},"PeriodicalIF":3.6000,"publicationDate":"2024-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622475/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of General Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12991-024-00534-w","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
Abstract
Aims: Non-suicidal self-injury (NSSI) is a serious issue that is increasingly prevalent among children and adolescents, especially in rural areas. Developing a suitable predictive model for NSSI is crucial for early identification and intervention.
Methods: This study included 2090 Chinese rural children and adolescents. Participants' sociodemographic information, symptoms of anxiety as well as depression, personality traits, family environment and NSSI behaviors were collected through a questionnaire survey. Gender, age, grade, and all survey results except sociodemographic information were used as relevant factors for prediction. Support vector machines, decision tree and random forest models were trained and validated by the train set and valid set, respectively. The metrics of each model were tested and compared to select the most suitable one. Furthermore, the mean decrease Gini index was calculated to measure the importance of relevant factors.
Results: The prevalence of NSSI was 38.3%. Out of the 6 models assessed, the random forest model demonstrated the highest suitability in predicting the prevalence of NSSI. It achieved sensitivity, specificity, AUC, accuracy, precision, and F1 scores of 0.65, 0.72, 0.76, 0.70, 0.57, and 0.61, respectively. Anxiety and depression were the top two contributing factors in the prediction model. Neuroticism and conflict were the factors that contributed the most to personality traits and family environment, respectively, in terms of prediction. In addition, demographic factors contributed little to the prediction in this study.
Conclusion: This study focused on Chinese children and adolescents in rural areas and demonstrated the potential of using machine learning approaches in predicting NSSI. Our research complements the application of machine learning methods to psychiatric and psychological problems.
期刊介绍:
Annals of General Psychiatry considers manuscripts on all aspects of psychiatry, including neuroscience and psychological medicine. Both basic and clinical neuroscience contributions are encouraged.
Annals of General Psychiatry emphasizes a biopsychosocial approach to illness and health and strongly supports and follows the principles of evidence-based medicine. As an open access journal, Annals of General Psychiatry facilitates the worldwide distribution of high quality psychiatry and mental health research. The journal considers submissions on a wide range of topics including, but not limited to, psychopharmacology, forensic psychiatry, psychotic disorders, psychiatric genetics, and mood and anxiety disorders.