{"title":"Assessment of EMR ML Mining Methods for Measuring Association between Metal Mixture and Mortality for Hypertension.","authors":"Site Xu, Mu Sun","doi":"10.1007/s40292-024-00666-w","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>There are limited data available regarding the connection between heavy metal exposure and mortality among hypertension patients.</p><p><strong>Aim: </strong>We intend to establish an interpretable machine learning (ML) model with high efficiency and robustness that monitors mortality based on heavy metal exposure among hypertension patients.</p><p><strong>Methods: </strong>Our datasets were obtained from the US National Health and Nutrition Examination Survey (NHANES, 2013-2018). We developed 5 ML models for mortality prediction among hypertension patients by heavy metal exposure, and tested them by 10 discrimination characteristics. Further, we chose the optimally performing model after parameter adjustment by genetic algorithm (GA) for prediction. Finally, in order to visualize the model's ability to make decisions, we used SHapley Additive exPlanation (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME) algorithm to illustrate the features. The study included 2347 participants in total.</p><p><strong>Results: </strong>A best-performing eXtreme Gradient Boosting (XGB) with GA for mortality prediction among hypertension patients by 13 heavy metals was selected (AUC 0.959; 95% CI 0.953-0.965; accuracy 96.8%). According to sum of SHAP values, cadmium (0.094), cobalt (2.048), lead (1.12), tungsten (0.129) in urine, and lead (2.026), mercury (1.703) in blood positively influenced the model, while barium (- 0.001), molybdenum (- 2.066), antimony (- 0.398), tin (- 0.498), thallium (- 2.297) in urine, and selenium (- 0.842), manganese (- 1.193) in blood negatively influenced the model.</p><p><strong>Conclusions: </strong>Hypertension patients' mortality associated with heavy metal exposure was predicted by an efficient, robust, and interpretable GA-XGB model with SHAP and LIME. Cadmium, cobalt, lead, tungsten in urine, and mercury in blood are positively correlated with mortality, while barium, molybdenum, antimony, tin, thallium in urine, and lead, selenium, manganese in blood is negatively correlated with mortality.</p>","PeriodicalId":12890,"journal":{"name":"High Blood Pressure & Cardiovascular Prevention","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11485017/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"High Blood Pressure & Cardiovascular Prevention","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s40292-024-00666-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"PERIPHERAL VASCULAR DISEASE","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: There are limited data available regarding the connection between heavy metal exposure and mortality among hypertension patients.
Aim: We intend to establish an interpretable machine learning (ML) model with high efficiency and robustness that monitors mortality based on heavy metal exposure among hypertension patients.
Methods: Our datasets were obtained from the US National Health and Nutrition Examination Survey (NHANES, 2013-2018). We developed 5 ML models for mortality prediction among hypertension patients by heavy metal exposure, and tested them by 10 discrimination characteristics. Further, we chose the optimally performing model after parameter adjustment by genetic algorithm (GA) for prediction. Finally, in order to visualize the model's ability to make decisions, we used SHapley Additive exPlanation (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME) algorithm to illustrate the features. The study included 2347 participants in total.
Results: A best-performing eXtreme Gradient Boosting (XGB) with GA for mortality prediction among hypertension patients by 13 heavy metals was selected (AUC 0.959; 95% CI 0.953-0.965; accuracy 96.8%). According to sum of SHAP values, cadmium (0.094), cobalt (2.048), lead (1.12), tungsten (0.129) in urine, and lead (2.026), mercury (1.703) in blood positively influenced the model, while barium (- 0.001), molybdenum (- 2.066), antimony (- 0.398), tin (- 0.498), thallium (- 2.297) in urine, and selenium (- 0.842), manganese (- 1.193) in blood negatively influenced the model.
Conclusions: Hypertension patients' mortality associated with heavy metal exposure was predicted by an efficient, robust, and interpretable GA-XGB model with SHAP and LIME. Cadmium, cobalt, lead, tungsten in urine, and mercury in blood are positively correlated with mortality, while barium, molybdenum, antimony, tin, thallium in urine, and lead, selenium, manganese in blood is negatively correlated with mortality.
期刊介绍:
High Blood Pressure & Cardiovascular Prevention promotes knowledge, update and discussion in the field of hypertension and cardiovascular disease prevention, by providing a regular programme of independent review articles covering key aspects of the management of hypertension and cardiovascular diseases. The journal includes: Invited ''State of the Art'' reviews. Expert commentaries on guidelines, major trials, technical advances.Presentation of new intervention trials design.''Pros and Cons'' or round tables on controversial issues.Statements on guidelines from hypertension and cardiovascular scientific societies.Socio-economic issues.Cost/benefit in prevention of cardiovascular diseases.Monitoring of healthcare systems.News and views from the Italian Society of Hypertension (including abstracts).All manuscripts are subject to peer review by international experts. Letters to the editor are welcomed and will be considered for publication.