各种重金属暴露对非糖尿病人群胰岛素抵抗的影响:从机器学习建模的角度分析可解释性。

IF 3.4 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
Biological Trace Element Research Pub Date : 2024-12-01 Epub Date: 2024-02-26 DOI:10.1007/s12011-024-04126-3
Jun Liu, Xingyu Li, Peng Zhu
{"title":"各种重金属暴露对非糖尿病人群胰岛素抵抗的影响:从机器学习建模的角度分析可解释性。","authors":"Jun Liu, Xingyu Li, Peng Zhu","doi":"10.1007/s12011-024-04126-3","DOIUrl":null,"url":null,"abstract":"<p><p>Increasing and compelling evidence has been proved that heavy metal exposure is involved in the development of insulin resistance (IR). We trained an interpretable predictive machine learning (ML) model for IR in the non-diabetic populations based on levels of heavy metal exposure. A total of 4354 participants from the NHANES (2003-2020) with complete information were randomly divided into a training set and a test set. Twelve ML algorithms, including random forest (RF), XGBoost (XGB), logistic regression (LR), GaussianNB (GNB), ridge regression (RR), support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), AdaBoost (AB), Gradient Boosting Decision Tree (GBDT), Voting Classifier (VC), and K-Nearest Neighbour (KNN), were constructed for IR prediction using the training set. Among these models, the RF algorithm had the best predictive performance, showing an accuracy of 80.14%, an AUC of 0.856, and an F1 score of 0.74 in the test set. We embedded three interpretable methods, the permutation feature importance analysis, partial dependence plot (PDP), and Shapley additive explanations (SHAP) in RF model for model interpretation. Urinary Ba, urinary Mo, blood Pb, and blood Cd levels were identified as the main influencers of IR. Within a specific range, urinary Ba (0.56-3.56 µg/L) and urinary Mo (1.06-20.25 µg/L) levels exhibited the most pronounced upwards trend with the risk of IR, while blood Pb (0.05-2.81 µg/dL) and blood Cd (0.24-0.65 µg/L) levels showed a declining trend with IR. The findings on the synergistic effects demonstrated that controlling urinary Ba levels might be more crucial for the management of IR. The SHAP decision plot offered personalized care for IR based on heavy metal control. In conclusion, by utilizing interpretable ML approaches, we emphasize the predictive value of heavy metals for IR, especially Ba, Mo, Pb, and Cd.</p>","PeriodicalId":8917,"journal":{"name":"Biological Trace Element Research","volume":" ","pages":"5438-5452"},"PeriodicalIF":3.4000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effects of Various Heavy Metal Exposures on Insulin Resistance in Non-diabetic Populations: Interpretability Analysis from Machine Learning Modeling Perspective.\",\"authors\":\"Jun Liu, Xingyu Li, Peng Zhu\",\"doi\":\"10.1007/s12011-024-04126-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Increasing and compelling evidence has been proved that heavy metal exposure is involved in the development of insulin resistance (IR). We trained an interpretable predictive machine learning (ML) model for IR in the non-diabetic populations based on levels of heavy metal exposure. A total of 4354 participants from the NHANES (2003-2020) with complete information were randomly divided into a training set and a test set. Twelve ML algorithms, including random forest (RF), XGBoost (XGB), logistic regression (LR), GaussianNB (GNB), ridge regression (RR), support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), AdaBoost (AB), Gradient Boosting Decision Tree (GBDT), Voting Classifier (VC), and K-Nearest Neighbour (KNN), were constructed for IR prediction using the training set. Among these models, the RF algorithm had the best predictive performance, showing an accuracy of 80.14%, an AUC of 0.856, and an F1 score of 0.74 in the test set. We embedded three interpretable methods, the permutation feature importance analysis, partial dependence plot (PDP), and Shapley additive explanations (SHAP) in RF model for model interpretation. Urinary Ba, urinary Mo, blood Pb, and blood Cd levels were identified as the main influencers of IR. Within a specific range, urinary Ba (0.56-3.56 µg/L) and urinary Mo (1.06-20.25 µg/L) levels exhibited the most pronounced upwards trend with the risk of IR, while blood Pb (0.05-2.81 µg/dL) and blood Cd (0.24-0.65 µg/L) levels showed a declining trend with IR. The findings on the synergistic effects demonstrated that controlling urinary Ba levels might be more crucial for the management of IR. The SHAP decision plot offered personalized care for IR based on heavy metal control. In conclusion, by utilizing interpretable ML approaches, we emphasize the predictive value of heavy metals for IR, especially Ba, Mo, Pb, and Cd.</p>\",\"PeriodicalId\":8917,\"journal\":{\"name\":\"Biological Trace Element Research\",\"volume\":\" \",\"pages\":\"5438-5452\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biological Trace Element Research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s12011-024-04126-3\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/2/26 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biological Trace Element Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s12011-024-04126-3","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/26 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

越来越多令人信服的证据证明,重金属暴露与胰岛素抵抗(IR)的形成有关。我们根据重金属暴露水平,在非糖尿病人群中训练了一个可解释的胰岛素抵抗预测机器学习(ML)模型。我们将 NHANES(2003-2020 年)中信息完整的 4354 名参与者随机分为训练集和测试集。利用训练集构建了 12 种 ML 算法,包括随机森林 (RF)、XGBoost (XGB)、逻辑回归 (LR)、GaussianNB (GNB)、脊回归 (RR)、支持向量机 (SVM)、多层感知器 (MLP)、决策树 (DT)、AdaBoost (AB)、梯度提升决策树 (GBDT)、投票分类器 (VC) 和 K-Nearest Neighbour (KNN),用于 IR 预测。在这些模型中,RF 算法的预测性能最好,在测试集中的准确率为 80.14%,AUC 为 0.856,F1 得分为 0.74。我们在 RF 模型中嵌入了三种可解释的方法,即排列特征重要性分析、部分依赖图(PDP)和夏普利加法解释(SHAP),用于模型解释。尿钡、尿钼、血铅和血镉水平被确定为影响 IR 的主要因素。在特定范围内,尿钡(0.56-3.56 µg/L)和尿钼(1.06-20.25 µg/L)水平与 IR 风险呈最明显的上升趋势,而血铅(0.05-2.81 µg/dL)和血镉(0.24-0.65 µg/L)水平与 IR 呈下降趋势。协同效应的研究结果表明,控制尿钡水平可能对IR的治疗更为重要。SHAP决策图根据重金属控制情况为IR提供个性化治疗。总之,通过利用可解释的多变量方法,我们强调了重金属对 IR 的预测价值,尤其是钡、钼、铅和镉。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Effects of Various Heavy Metal Exposures on Insulin Resistance in Non-diabetic Populations: Interpretability Analysis from Machine Learning Modeling Perspective.

Increasing and compelling evidence has been proved that heavy metal exposure is involved in the development of insulin resistance (IR). We trained an interpretable predictive machine learning (ML) model for IR in the non-diabetic populations based on levels of heavy metal exposure. A total of 4354 participants from the NHANES (2003-2020) with complete information were randomly divided into a training set and a test set. Twelve ML algorithms, including random forest (RF), XGBoost (XGB), logistic regression (LR), GaussianNB (GNB), ridge regression (RR), support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), AdaBoost (AB), Gradient Boosting Decision Tree (GBDT), Voting Classifier (VC), and K-Nearest Neighbour (KNN), were constructed for IR prediction using the training set. Among these models, the RF algorithm had the best predictive performance, showing an accuracy of 80.14%, an AUC of 0.856, and an F1 score of 0.74 in the test set. We embedded three interpretable methods, the permutation feature importance analysis, partial dependence plot (PDP), and Shapley additive explanations (SHAP) in RF model for model interpretation. Urinary Ba, urinary Mo, blood Pb, and blood Cd levels were identified as the main influencers of IR. Within a specific range, urinary Ba (0.56-3.56 µg/L) and urinary Mo (1.06-20.25 µg/L) levels exhibited the most pronounced upwards trend with the risk of IR, while blood Pb (0.05-2.81 µg/dL) and blood Cd (0.24-0.65 µg/L) levels showed a declining trend with IR. The findings on the synergistic effects demonstrated that controlling urinary Ba levels might be more crucial for the management of IR. The SHAP decision plot offered personalized care for IR based on heavy metal control. In conclusion, by utilizing interpretable ML approaches, we emphasize the predictive value of heavy metals for IR, especially Ba, Mo, Pb, and Cd.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biological Trace Element Research
Biological Trace Element Research 生物-内分泌学与代谢
CiteScore
8.70
自引率
10.30%
发文量
459
审稿时长
2 months
期刊介绍: Biological Trace Element Research provides a much-needed central forum for the emergent, interdisciplinary field of research on the biological, environmental, and biomedical roles of trace elements. Rather than confine itself to biochemistry, the journal emphasizes the integrative aspects of trace metal research in all appropriate fields, publishing human and animal nutritional studies devoted to the fundamental chemistry and biochemistry at issue as well as to the elucidation of the relevant aspects of preventive medicine, epidemiology, clinical chemistry, agriculture, endocrinology, animal science, pharmacology, microbiology, toxicology, virology, marine biology, sensory physiology, developmental biology, and related fields.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信