{"title":"各种重金属暴露对非糖尿病人群胰岛素抵抗的影响:从机器学习建模的角度分析可解释性。","authors":"Jun Liu, Xingyu Li, Peng Zhu","doi":"10.1007/s12011-024-04126-3","DOIUrl":null,"url":null,"abstract":"<p><p>Increasing and compelling evidence has been proved that heavy metal exposure is involved in the development of insulin resistance (IR). We trained an interpretable predictive machine learning (ML) model for IR in the non-diabetic populations based on levels of heavy metal exposure. A total of 4354 participants from the NHANES (2003-2020) with complete information were randomly divided into a training set and a test set. Twelve ML algorithms, including random forest (RF), XGBoost (XGB), logistic regression (LR), GaussianNB (GNB), ridge regression (RR), support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), AdaBoost (AB), Gradient Boosting Decision Tree (GBDT), Voting Classifier (VC), and K-Nearest Neighbour (KNN), were constructed for IR prediction using the training set. Among these models, the RF algorithm had the best predictive performance, showing an accuracy of 80.14%, an AUC of 0.856, and an F1 score of 0.74 in the test set. We embedded three interpretable methods, the permutation feature importance analysis, partial dependence plot (PDP), and Shapley additive explanations (SHAP) in RF model for model interpretation. Urinary Ba, urinary Mo, blood Pb, and blood Cd levels were identified as the main influencers of IR. Within a specific range, urinary Ba (0.56-3.56 µg/L) and urinary Mo (1.06-20.25 µg/L) levels exhibited the most pronounced upwards trend with the risk of IR, while blood Pb (0.05-2.81 µg/dL) and blood Cd (0.24-0.65 µg/L) levels showed a declining trend with IR. The findings on the synergistic effects demonstrated that controlling urinary Ba levels might be more crucial for the management of IR. The SHAP decision plot offered personalized care for IR based on heavy metal control. In conclusion, by utilizing interpretable ML approaches, we emphasize the predictive value of heavy metals for IR, especially Ba, Mo, Pb, and Cd.</p>","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effects of Various Heavy Metal Exposures on Insulin Resistance in Non-diabetic Populations: Interpretability Analysis from Machine Learning Modeling Perspective.\",\"authors\":\"Jun Liu, Xingyu Li, Peng Zhu\",\"doi\":\"10.1007/s12011-024-04126-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Increasing and compelling evidence has been proved that heavy metal exposure is involved in the development of insulin resistance (IR). We trained an interpretable predictive machine learning (ML) model for IR in the non-diabetic populations based on levels of heavy metal exposure. A total of 4354 participants from the NHANES (2003-2020) with complete information were randomly divided into a training set and a test set. Twelve ML algorithms, including random forest (RF), XGBoost (XGB), logistic regression (LR), GaussianNB (GNB), ridge regression (RR), support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), AdaBoost (AB), Gradient Boosting Decision Tree (GBDT), Voting Classifier (VC), and K-Nearest Neighbour (KNN), were constructed for IR prediction using the training set. Among these models, the RF algorithm had the best predictive performance, showing an accuracy of 80.14%, an AUC of 0.856, and an F1 score of 0.74 in the test set. We embedded three interpretable methods, the permutation feature importance analysis, partial dependence plot (PDP), and Shapley additive explanations (SHAP) in RF model for model interpretation. Urinary Ba, urinary Mo, blood Pb, and blood Cd levels were identified as the main influencers of IR. Within a specific range, urinary Ba (0.56-3.56 µg/L) and urinary Mo (1.06-20.25 µg/L) levels exhibited the most pronounced upwards trend with the risk of IR, while blood Pb (0.05-2.81 µg/dL) and blood Cd (0.24-0.65 µg/L) levels showed a declining trend with IR. The findings on the synergistic effects demonstrated that controlling urinary Ba levels might be more crucial for the management of IR. The SHAP decision plot offered personalized care for IR based on heavy metal control. In conclusion, by utilizing interpretable ML approaches, we emphasize the predictive value of heavy metals for IR, especially Ba, Mo, Pb, and Cd.</p>\",\"PeriodicalId\":3,\"journal\":{\"name\":\"ACS Applied Electronic Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Electronic Materials\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s12011-024-04126-3\",\"RegionNum\":3,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/2/26 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s12011-024-04126-3","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/26 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
摘要
越来越多令人信服的证据证明,重金属暴露与胰岛素抵抗(IR)的形成有关。我们根据重金属暴露水平,在非糖尿病人群中训练了一个可解释的胰岛素抵抗预测机器学习(ML)模型。我们将 NHANES(2003-2020 年)中信息完整的 4354 名参与者随机分为训练集和测试集。利用训练集构建了 12 种 ML 算法,包括随机森林 (RF)、XGBoost (XGB)、逻辑回归 (LR)、GaussianNB (GNB)、脊回归 (RR)、支持向量机 (SVM)、多层感知器 (MLP)、决策树 (DT)、AdaBoost (AB)、梯度提升决策树 (GBDT)、投票分类器 (VC) 和 K-Nearest Neighbour (KNN),用于 IR 预测。在这些模型中,RF 算法的预测性能最好,在测试集中的准确率为 80.14%,AUC 为 0.856,F1 得分为 0.74。我们在 RF 模型中嵌入了三种可解释的方法,即排列特征重要性分析、部分依赖图(PDP)和夏普利加法解释(SHAP),用于模型解释。尿钡、尿钼、血铅和血镉水平被确定为影响 IR 的主要因素。在特定范围内,尿钡(0.56-3.56 µg/L)和尿钼(1.06-20.25 µg/L)水平与 IR 风险呈最明显的上升趋势,而血铅(0.05-2.81 µg/dL)和血镉(0.24-0.65 µg/L)水平与 IR 呈下降趋势。协同效应的研究结果表明,控制尿钡水平可能对IR的治疗更为重要。SHAP决策图根据重金属控制情况为IR提供个性化治疗。总之,通过利用可解释的多变量方法,我们强调了重金属对 IR 的预测价值,尤其是钡、钼、铅和镉。
Effects of Various Heavy Metal Exposures on Insulin Resistance in Non-diabetic Populations: Interpretability Analysis from Machine Learning Modeling Perspective.
Increasing and compelling evidence has been proved that heavy metal exposure is involved in the development of insulin resistance (IR). We trained an interpretable predictive machine learning (ML) model for IR in the non-diabetic populations based on levels of heavy metal exposure. A total of 4354 participants from the NHANES (2003-2020) with complete information were randomly divided into a training set and a test set. Twelve ML algorithms, including random forest (RF), XGBoost (XGB), logistic regression (LR), GaussianNB (GNB), ridge regression (RR), support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), AdaBoost (AB), Gradient Boosting Decision Tree (GBDT), Voting Classifier (VC), and K-Nearest Neighbour (KNN), were constructed for IR prediction using the training set. Among these models, the RF algorithm had the best predictive performance, showing an accuracy of 80.14%, an AUC of 0.856, and an F1 score of 0.74 in the test set. We embedded three interpretable methods, the permutation feature importance analysis, partial dependence plot (PDP), and Shapley additive explanations (SHAP) in RF model for model interpretation. Urinary Ba, urinary Mo, blood Pb, and blood Cd levels were identified as the main influencers of IR. Within a specific range, urinary Ba (0.56-3.56 µg/L) and urinary Mo (1.06-20.25 µg/L) levels exhibited the most pronounced upwards trend with the risk of IR, while blood Pb (0.05-2.81 µg/dL) and blood Cd (0.24-0.65 µg/L) levels showed a declining trend with IR. The findings on the synergistic effects demonstrated that controlling urinary Ba levels might be more crucial for the management of IR. The SHAP decision plot offered personalized care for IR based on heavy metal control. In conclusion, by utilizing interpretable ML approaches, we emphasize the predictive value of heavy metals for IR, especially Ba, Mo, Pb, and Cd.