开发一种可解释的机器学习模型,用于轻松检测乳腺癌幸存者中的胰岛素抵抗:一项横断面研究。

IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS
Mengxia Fu, Zhiming Peng, Xue Yu, Dapeng Lv, Min Wu
{"title":"开发一种可解释的机器学习模型,用于轻松检测乳腺癌幸存者中的胰岛素抵抗:一项横断面研究。","authors":"Mengxia Fu, Zhiming Peng, Xue Yu, Dapeng Lv, Min Wu","doi":"10.1186/s12911-025-03189-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop and validate a classification model for insulin resistance in female individuals who have survived breast cancer using easily obtainable clinical and demographic features.</p><p><strong>Methods: </strong>Data were obtained from the U.S. National Health and Nutrition Examination Survey (NHANES) spanning 1999 to March 2020. A total of 340 female individuals who have survived breast cancer were included, and participants were randomly assigned to a training set (n = 239) and a testing set (n = 101). Multiple machine learning algorithms were trained, including Logistic Regression, Random Forest, and Support Vector Machine. Model performance was evaluated using area under the receiver operating characteristic curve (AUC) and decision curve analysis (DCA).</p><p><strong>Results: </strong>All models demonstrated strong classification performance in the testing set, with AUC values exceeding 0.87. Among them, the Random Forest and Support Vector Machine models showed superior performance in DCA. Of the seven input features-body mass index, fasting blood glucose, triglyceride, HDL cholesterol, poverty income ratio, race, and education-fasting blood glucose had the highest positive feature importance for classifying insulin resistance.</p><p><strong>Conclusions: </strong>This study demonstrates the feasibility of using machine learning algorithms to accurately predict insulin resistance in individuals who have survived breast cancer with a limited set of clinical and demographic variables. The Random Forest and Support Vector Machine models, in particular, offer strong classification performance and may support clinicians in early identification and management of insulin resistance among individuals in this high-risk population.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"341"},"PeriodicalIF":3.8000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12465519/pdf/","citationCount":"0","resultStr":"{\"title\":\"Developing an interpretable machine learning model for easily detecting insulin resistance among breast cancer survivors: a cross-sectional study.\",\"authors\":\"Mengxia Fu, Zhiming Peng, Xue Yu, Dapeng Lv, Min Wu\",\"doi\":\"10.1186/s12911-025-03189-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>To develop and validate a classification model for insulin resistance in female individuals who have survived breast cancer using easily obtainable clinical and demographic features.</p><p><strong>Methods: </strong>Data were obtained from the U.S. National Health and Nutrition Examination Survey (NHANES) spanning 1999 to March 2020. A total of 340 female individuals who have survived breast cancer were included, and participants were randomly assigned to a training set (n = 239) and a testing set (n = 101). Multiple machine learning algorithms were trained, including Logistic Regression, Random Forest, and Support Vector Machine. Model performance was evaluated using area under the receiver operating characteristic curve (AUC) and decision curve analysis (DCA).</p><p><strong>Results: </strong>All models demonstrated strong classification performance in the testing set, with AUC values exceeding 0.87. Among them, the Random Forest and Support Vector Machine models showed superior performance in DCA. Of the seven input features-body mass index, fasting blood glucose, triglyceride, HDL cholesterol, poverty income ratio, race, and education-fasting blood glucose had the highest positive feature importance for classifying insulin resistance.</p><p><strong>Conclusions: </strong>This study demonstrates the feasibility of using machine learning algorithms to accurately predict insulin resistance in individuals who have survived breast cancer with a limited set of clinical and demographic variables. The Random Forest and Support Vector Machine models, in particular, offer strong classification performance and may support clinicians in early identification and management of insulin resistance among individuals in this high-risk population.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"341\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12465519/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-03189-z\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03189-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

目的:利用易于获得的临床和人口学特征,开发并验证乳腺癌女性患者胰岛素抵抗的分类模型。方法:数据来自1999年至2020年3月的美国国家健康与营养检查调查(NHANES)。共有340名乳腺癌幸存者被纳入研究,参与者被随机分配到一个训练组(n = 239)和一个测试组(n = 101)。训练了多种机器学习算法,包括逻辑回归、随机森林和支持向量机。采用受试者工作特征曲线下面积(AUC)和决策曲线分析(DCA)对模型性能进行评价。结果:所有模型在测试集中表现出较强的分类性能,AUC值均超过0.87。其中,随机森林模型和支持向量机模型在DCA中表现出较好的性能。在体重指数、空腹血糖、甘油三酯、高密度脂蛋白胆固醇、贫困收入比、种族和教育程度这七个输入特征中,空腹血糖对分类胰岛素抵抗具有最高的正向特征重要性。结论:本研究证明了使用机器学习算法在有限的临床和人口变量下准确预测乳腺癌患者胰岛素抵抗的可行性。特别是随机森林和支持向量机模型,提供了强大的分类性能,可以支持临床医生在这一高危人群中早期识别和管理胰岛素抵抗。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Developing an interpretable machine learning model for easily detecting insulin resistance among breast cancer survivors: a cross-sectional study.

Developing an interpretable machine learning model for easily detecting insulin resistance among breast cancer survivors: a cross-sectional study.

Developing an interpretable machine learning model for easily detecting insulin resistance among breast cancer survivors: a cross-sectional study.

Developing an interpretable machine learning model for easily detecting insulin resistance among breast cancer survivors: a cross-sectional study.

Objective: To develop and validate a classification model for insulin resistance in female individuals who have survived breast cancer using easily obtainable clinical and demographic features.

Methods: Data were obtained from the U.S. National Health and Nutrition Examination Survey (NHANES) spanning 1999 to March 2020. A total of 340 female individuals who have survived breast cancer were included, and participants were randomly assigned to a training set (n = 239) and a testing set (n = 101). Multiple machine learning algorithms were trained, including Logistic Regression, Random Forest, and Support Vector Machine. Model performance was evaluated using area under the receiver operating characteristic curve (AUC) and decision curve analysis (DCA).

Results: All models demonstrated strong classification performance in the testing set, with AUC values exceeding 0.87. Among them, the Random Forest and Support Vector Machine models showed superior performance in DCA. Of the seven input features-body mass index, fasting blood glucose, triglyceride, HDL cholesterol, poverty income ratio, race, and education-fasting blood glucose had the highest positive feature importance for classifying insulin resistance.

Conclusions: This study demonstrates the feasibility of using machine learning algorithms to accurately predict insulin resistance in individuals who have survived breast cancer with a limited set of clinical and demographic variables. The Random Forest and Support Vector Machine models, in particular, offer strong classification performance and may support clinicians in early identification and management of insulin resistance among individuals in this high-risk population.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信