预测自杀行为的结果:关键因素和机器学习模型分析。

IF 3.4 2区 医学 Q2 PSYCHIATRY
Mohammad Bazrafshan, Kourosh Sayehmiri
{"title":"预测自杀行为的结果:关键因素和机器学习模型分析。","authors":"Mohammad Bazrafshan, Kourosh Sayehmiri","doi":"10.1186/s12888-024-06273-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Suicidal behaviors, which may lead to death (suicide) or survival (suicide attempt), are influenced by various factors. Identifying the specific risk factors for suicidal behavior mortality is critical for improving prevention strategies and clinical interventions. Predicting the outcomes of suicidal behaviors can help identify individuals at higher risk of death, enabling timely and targeted interventions. This study aimed to determine the critical risk factors associated with suicidal behavior mortality and identify an effective classification model for predicting suicidal behavior outcomes.</p><p><strong>Materials and methods: </strong>This study utilized data recorded in the suicidal behavior registry system of hospitals in Ilam Province. In the first phase, duplicate records were removed, and the data was numerically encoded via Python version 3.11; then, the data was analyzed using chi-square and Fisher's exact tests in SPSS version 22 software to identify the factors influencing suicidal behavior mortality. In the second phase, missing data were removed, and the dataset was standardized. Five binary classification algorithms were utilized, including Random Forest, Logistic Regression, and Decision Trees, with hyperparameters optimized using the area under the receiver operating characteristic curve (AUC) and F1 score metrics. These models were compared based on accuracy, recall, precision, F1 score, and AUC.</p><p><strong>Results: </strong>Among 3833 cases of suicidal behavior in various hospitals in Ilam Province, the results indicated that the method of suicidal behavior (P < 0.001), reason for suicidal behavior (P < 0.001), age group (P < 0.001), education level (P < 0.001), marital status (P = 0.004), and employment status (P = 0.042) were significantly associated with suicide. Variables such as the season of suicidal behavior, gender, father's education, and mother's education were not significantly related to suicidal behavior mortality. Furthermore, the random forest model demonstrated the highest area under the ROC curve (0.79) and the highest classification accuracy and F1 score on both the training data (0.85 and 0.2, respectively) and test data (0.86 and 0.31, respectively) for predicting suicidal behaviors outcomes among the models tested.</p><p><strong>Conclusion: </strong>This study identified key factors such as older age, lower education, divorce or widowhood, employment, physical methods, and socioeconomic issues as significant predictors of suicidal behavior outcomes. A combination of statistical models for feature selection and machine learning algorithms for prediction was used, with Random Forest showing the best performance. This approach highlights the potential of integrating statistical methods with machine learning to improve suicide risk prediction and intervention strategies.</p>","PeriodicalId":9029,"journal":{"name":"BMC Psychiatry","volume":"24 1","pages":"841"},"PeriodicalIF":3.4000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11583731/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models.\",\"authors\":\"Mohammad Bazrafshan, Kourosh Sayehmiri\",\"doi\":\"10.1186/s12888-024-06273-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Suicidal behaviors, which may lead to death (suicide) or survival (suicide attempt), are influenced by various factors. Identifying the specific risk factors for suicidal behavior mortality is critical for improving prevention strategies and clinical interventions. Predicting the outcomes of suicidal behaviors can help identify individuals at higher risk of death, enabling timely and targeted interventions. This study aimed to determine the critical risk factors associated with suicidal behavior mortality and identify an effective classification model for predicting suicidal behavior outcomes.</p><p><strong>Materials and methods: </strong>This study utilized data recorded in the suicidal behavior registry system of hospitals in Ilam Province. In the first phase, duplicate records were removed, and the data was numerically encoded via Python version 3.11; then, the data was analyzed using chi-square and Fisher's exact tests in SPSS version 22 software to identify the factors influencing suicidal behavior mortality. In the second phase, missing data were removed, and the dataset was standardized. Five binary classification algorithms were utilized, including Random Forest, Logistic Regression, and Decision Trees, with hyperparameters optimized using the area under the receiver operating characteristic curve (AUC) and F1 score metrics. These models were compared based on accuracy, recall, precision, F1 score, and AUC.</p><p><strong>Results: </strong>Among 3833 cases of suicidal behavior in various hospitals in Ilam Province, the results indicated that the method of suicidal behavior (P < 0.001), reason for suicidal behavior (P < 0.001), age group (P < 0.001), education level (P < 0.001), marital status (P = 0.004), and employment status (P = 0.042) were significantly associated with suicide. Variables such as the season of suicidal behavior, gender, father's education, and mother's education were not significantly related to suicidal behavior mortality. Furthermore, the random forest model demonstrated the highest area under the ROC curve (0.79) and the highest classification accuracy and F1 score on both the training data (0.85 and 0.2, respectively) and test data (0.86 and 0.31, respectively) for predicting suicidal behaviors outcomes among the models tested.</p><p><strong>Conclusion: </strong>This study identified key factors such as older age, lower education, divorce or widowhood, employment, physical methods, and socioeconomic issues as significant predictors of suicidal behavior outcomes. A combination of statistical models for feature selection and machine learning algorithms for prediction was used, with Random Forest showing the best performance. This approach highlights the potential of integrating statistical methods with machine learning to improve suicide risk prediction and intervention strategies.</p>\",\"PeriodicalId\":9029,\"journal\":{\"name\":\"BMC Psychiatry\",\"volume\":\"24 1\",\"pages\":\"841\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11583731/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Psychiatry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12888-024-06273-2\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12888-024-06273-2","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

摘要

背景:自杀行为可能导致死亡(自杀)或存活(自杀未遂),而自杀行为受多种因素影响。确定自杀行为致死的具体风险因素对于改进预防策略和临床干预措施至关重要。预测自杀行为的结果有助于识别死亡风险较高的个体,从而及时采取有针对性的干预措施。本研究旨在确定与自杀行为死亡率相关的关键风险因素,并确定预测自杀行为结果的有效分类模型:本研究利用伊拉姆省医院自杀行为登记系统中记录的数据。第一阶段,删除重复记录,并通过 Python 3.11 版对数据进行数字编码;然后,在 SPSS 22 版软件中使用卡方检验和费雪精确检验对数据进行分析,以确定影响自杀行为死亡率的因素。第二阶段,剔除缺失数据,并对数据集进行标准化处理。使用了五种二元分类算法,包括随机森林、逻辑回归和决策树,并使用接收者操作特征曲线下面积(AUC)和 F1 分数指标对超参数进行了优化。根据准确率、召回率、精确度、F1 分数和 AUC 对这些模型进行了比较:在伊拉姆省多家医院的 3833 个自杀行为病例中,结果表明自杀行为的方法(P 结论:该研究发现了一些关键因素,如年龄较大、自杀行为的发生率较高,而年龄较小的自杀行为发生率较低:本研究发现,年龄较大、教育程度较低、离婚或丧偶、就业、身体方法和社会经济问题等关键因素是自杀行为结果的重要预测因素。研究结合使用了用于特征选择的统计模型和用于预测的机器学习算法,其中随机森林算法表现最佳。这种方法凸显了将统计方法与机器学习相结合以改进自杀风险预测和干预策略的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models.

Background: Suicidal behaviors, which may lead to death (suicide) or survival (suicide attempt), are influenced by various factors. Identifying the specific risk factors for suicidal behavior mortality is critical for improving prevention strategies and clinical interventions. Predicting the outcomes of suicidal behaviors can help identify individuals at higher risk of death, enabling timely and targeted interventions. This study aimed to determine the critical risk factors associated with suicidal behavior mortality and identify an effective classification model for predicting suicidal behavior outcomes.

Materials and methods: This study utilized data recorded in the suicidal behavior registry system of hospitals in Ilam Province. In the first phase, duplicate records were removed, and the data was numerically encoded via Python version 3.11; then, the data was analyzed using chi-square and Fisher's exact tests in SPSS version 22 software to identify the factors influencing suicidal behavior mortality. In the second phase, missing data were removed, and the dataset was standardized. Five binary classification algorithms were utilized, including Random Forest, Logistic Regression, and Decision Trees, with hyperparameters optimized using the area under the receiver operating characteristic curve (AUC) and F1 score metrics. These models were compared based on accuracy, recall, precision, F1 score, and AUC.

Results: Among 3833 cases of suicidal behavior in various hospitals in Ilam Province, the results indicated that the method of suicidal behavior (P < 0.001), reason for suicidal behavior (P < 0.001), age group (P < 0.001), education level (P < 0.001), marital status (P = 0.004), and employment status (P = 0.042) were significantly associated with suicide. Variables such as the season of suicidal behavior, gender, father's education, and mother's education were not significantly related to suicidal behavior mortality. Furthermore, the random forest model demonstrated the highest area under the ROC curve (0.79) and the highest classification accuracy and F1 score on both the training data (0.85 and 0.2, respectively) and test data (0.86 and 0.31, respectively) for predicting suicidal behaviors outcomes among the models tested.

Conclusion: This study identified key factors such as older age, lower education, divorce or widowhood, employment, physical methods, and socioeconomic issues as significant predictors of suicidal behavior outcomes. A combination of statistical models for feature selection and machine learning algorithms for prediction was used, with Random Forest showing the best performance. This approach highlights the potential of integrating statistical methods with machine learning to improve suicide risk prediction and intervention strategies.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Psychiatry
BMC Psychiatry 医学-精神病学
CiteScore
5.90
自引率
4.50%
发文量
716
审稿时长
3-6 weeks
期刊介绍: BMC Psychiatry is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of psychiatric disorders, as well as related molecular genetics, pathophysiology, and epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信