钢铁工人高血压风险预测模型研究

Q3 Medicine
K Y Guo, Y X Zhu, Y X Zhang, C Yang, H Zhao, Y L Jin
{"title":"钢铁工人高血压风险预测模型研究","authors":"K Y Guo, Y X Zhu, Y X Zhang, C Yang, H Zhao, Y L Jin","doi":"10.3760/cma.j.cn121094-20240517-00223","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> To identify risk factors influencing the incidence of hypertension among steelworkers (Homo sapiens) and establish an effective and easily implementable hypertension prediction model. <b>Methods:</b> In September 2023, 2214 steelworkers (Homo sapiens) were selected as study subjects. Basic demographic information, lifestyle, and occupational exposure data were collected, along with physiological measurements such as height, weight, and blood pressure. Multivariate unconditional logistic regression analysis was employed based on relevant literature to determine influencing factors for hypertension among steelworkers (Homo sapiens). Python 3.9 software was used to construct and compare logistic regression, support vector machine (SVM), random forest, extreme gradient boosting tree (XGBoost), and LGBM models. Model performance was evaluated using metrics such as receiver operating characteristic (ROC) curves, accuracy, calibration curves, and F1 scores. The Shapley Additive Explanations (SHAP) model was introduced for feature importance analysis to enhance the interpretability of the prediction model. <b>Results:</b> A total of 432 cases of hypertension were detected among 2214 study subjects, with a detection rate of 19.51%. Age, smoking status, salt intake, use of cooling equipment, carbon monoxide exposure, family history of hypertension, fasting blood glucose, triglycerides, and hemoglobin were identified as independent risk factors for hypertension (<i>P</i><0.05). A comparison of the five models revealed the following performance metrics: logistic regression achieved an accuracy of 0.853, F1 score of 0.680, Brier score of 0.108, and AUC of 0.907; SVM demonstrated an accuracy of 0.863, F1 score of 0.687, Brier score of 0.081, and AUC of 0.910; random forest showed an accuracy of 0.857, F1 score of 0.603, Brier score of 0.105, and AUC of 0.861; XGBoost yielded an accuracy of 0.850, F1 score of 0.684, Brier score of 0.117, and AUC of 0.899; and the LGBM model exhibited an accuracy of 0.838, F1 score of 0.625, Brier score of 0.112, and AUC of 0.870. <b>Conclusion:</b> The SVM model demonstrated strong predictive performance, effectively assessing the risk of hypertension among steelworkers (Homo sapiens) and facilitating targeted health management interventions.</p>","PeriodicalId":23958,"journal":{"name":"中华劳动卫生职业病杂志","volume":"43 8","pages":"573-579"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Study on risk prediction model of hypertension in steel workers].\",\"authors\":\"K Y Guo, Y X Zhu, Y X Zhang, C Yang, H Zhao, Y L Jin\",\"doi\":\"10.3760/cma.j.cn121094-20240517-00223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><b>Objective:</b> To identify risk factors influencing the incidence of hypertension among steelworkers (Homo sapiens) and establish an effective and easily implementable hypertension prediction model. <b>Methods:</b> In September 2023, 2214 steelworkers (Homo sapiens) were selected as study subjects. Basic demographic information, lifestyle, and occupational exposure data were collected, along with physiological measurements such as height, weight, and blood pressure. Multivariate unconditional logistic regression analysis was employed based on relevant literature to determine influencing factors for hypertension among steelworkers (Homo sapiens). Python 3.9 software was used to construct and compare logistic regression, support vector machine (SVM), random forest, extreme gradient boosting tree (XGBoost), and LGBM models. Model performance was evaluated using metrics such as receiver operating characteristic (ROC) curves, accuracy, calibration curves, and F1 scores. The Shapley Additive Explanations (SHAP) model was introduced for feature importance analysis to enhance the interpretability of the prediction model. <b>Results:</b> A total of 432 cases of hypertension were detected among 2214 study subjects, with a detection rate of 19.51%. Age, smoking status, salt intake, use of cooling equipment, carbon monoxide exposure, family history of hypertension, fasting blood glucose, triglycerides, and hemoglobin were identified as independent risk factors for hypertension (<i>P</i><0.05). A comparison of the five models revealed the following performance metrics: logistic regression achieved an accuracy of 0.853, F1 score of 0.680, Brier score of 0.108, and AUC of 0.907; SVM demonstrated an accuracy of 0.863, F1 score of 0.687, Brier score of 0.081, and AUC of 0.910; random forest showed an accuracy of 0.857, F1 score of 0.603, Brier score of 0.105, and AUC of 0.861; XGBoost yielded an accuracy of 0.850, F1 score of 0.684, Brier score of 0.117, and AUC of 0.899; and the LGBM model exhibited an accuracy of 0.838, F1 score of 0.625, Brier score of 0.112, and AUC of 0.870. <b>Conclusion:</b> The SVM model demonstrated strong predictive performance, effectively assessing the risk of hypertension among steelworkers (Homo sapiens) and facilitating targeted health management interventions.</p>\",\"PeriodicalId\":23958,\"journal\":{\"name\":\"中华劳动卫生职业病杂志\",\"volume\":\"43 8\",\"pages\":\"573-579\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"中华劳动卫生职业病杂志\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3760/cma.j.cn121094-20240517-00223\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华劳动卫生职业病杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn121094-20240517-00223","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

摘要

目的:探讨影响钢铁工人(智人)高血压发病的危险因素,建立有效且易于实现的高血压预测模型。方法:于2023年9月选取2214名钢铁工人(智人)作为研究对象。收集了基本的人口统计信息、生活方式和职业暴露数据,以及生理测量数据,如身高、体重和血压。基于相关文献,采用多因素无条件logistic回归分析确定钢铁工人(智人)高血压的影响因素。使用Python 3.9软件构建逻辑回归、支持向量机(SVM)、随机森林、极端梯度增强树(XGBoost)和LGBM模型并进行比较。使用受试者工作特征(ROC)曲线、准确性、校准曲线和F1分数等指标评估模型的性能。引入Shapley加性解释(SHAP)模型进行特征重要性分析,提高预测模型的可解释性。结果:2214名研究对象共检出高血压432例,检出率为19.51%。年龄、吸烟状况、盐摄入量、使用冷却设备、一氧化碳暴露、高血压家族史、空腹血糖、甘油三酯和血红蛋白被确定为高血压的独立危险因素(结论:支持向量机模型具有较强的预测性能,可以有效地评估钢铁工人(智人)的高血压风险,促进有针对性的健康管理干预。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
[Study on risk prediction model of hypertension in steel workers].

Objective: To identify risk factors influencing the incidence of hypertension among steelworkers (Homo sapiens) and establish an effective and easily implementable hypertension prediction model. Methods: In September 2023, 2214 steelworkers (Homo sapiens) were selected as study subjects. Basic demographic information, lifestyle, and occupational exposure data were collected, along with physiological measurements such as height, weight, and blood pressure. Multivariate unconditional logistic regression analysis was employed based on relevant literature to determine influencing factors for hypertension among steelworkers (Homo sapiens). Python 3.9 software was used to construct and compare logistic regression, support vector machine (SVM), random forest, extreme gradient boosting tree (XGBoost), and LGBM models. Model performance was evaluated using metrics such as receiver operating characteristic (ROC) curves, accuracy, calibration curves, and F1 scores. The Shapley Additive Explanations (SHAP) model was introduced for feature importance analysis to enhance the interpretability of the prediction model. Results: A total of 432 cases of hypertension were detected among 2214 study subjects, with a detection rate of 19.51%. Age, smoking status, salt intake, use of cooling equipment, carbon monoxide exposure, family history of hypertension, fasting blood glucose, triglycerides, and hemoglobin were identified as independent risk factors for hypertension (P<0.05). A comparison of the five models revealed the following performance metrics: logistic regression achieved an accuracy of 0.853, F1 score of 0.680, Brier score of 0.108, and AUC of 0.907; SVM demonstrated an accuracy of 0.863, F1 score of 0.687, Brier score of 0.081, and AUC of 0.910; random forest showed an accuracy of 0.857, F1 score of 0.603, Brier score of 0.105, and AUC of 0.861; XGBoost yielded an accuracy of 0.850, F1 score of 0.684, Brier score of 0.117, and AUC of 0.899; and the LGBM model exhibited an accuracy of 0.838, F1 score of 0.625, Brier score of 0.112, and AUC of 0.870. Conclusion: The SVM model demonstrated strong predictive performance, effectively assessing the risk of hypertension among steelworkers (Homo sapiens) and facilitating targeted health management interventions.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
中华劳动卫生职业病杂志
中华劳动卫生职业病杂志 Medicine-Medicine (all)
CiteScore
1.00
自引率
0.00%
发文量
9764
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信