使用常规血液生物标志物和衍生指标预测易损性颈动脉斑块的机器学习模型的开发和验证:对性别相关风险模式的见解。

IF 10.6 1区 医学 Q1 CARDIAC & CARDIOVASCULAR SYSTEMS
Yimin E, Zhichao Yao, Maolin Ge, Guijun Huo, Jian Huang, Yao Tang, Zhanao Liu, Ziyi Tan, Yuqi Zeng, Junjie Cao, Dayong Zhou
{"title":"使用常规血液生物标志物和衍生指标预测易损性颈动脉斑块的机器学习模型的开发和验证:对性别相关风险模式的见解。","authors":"Yimin E, Zhichao Yao, Maolin Ge, Guijun Huo, Jian Huang, Yao Tang, Zhanao Liu, Ziyi Tan, Yuqi Zeng, Junjie Cao, Dayong Zhou","doi":"10.1186/s12933-025-02867-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Early detection of vulnerable carotid plaques is critical for stroke prevention. This study aimed to develop a machine learning model based on routine blood tests and derived indices to predict plaque vulnerability and assess sex-specific risk patterns across biomarker value ranges.</p><p><strong>Methods: </strong>We retrospectively included 1701 hospitalized patients from Suzhou Municipal Hospital (2019-2020), selected from an initial cohort of 10,028 individuals. All patients underwent carotid ultrasound, with vulnerable plaques identified using predefined imaging criteria. A total of 30 laboratory variables-including blood count, coagulation, and biochemistry-were extracted, alongside derived indices such as triglyceride-glucose index (TyG), atherogenic index of plasma (AIP), neutrophil-to-lymphocyte ratio (NLR) and others. Features were standardized and selected based on statistical and clinical relevance. Five machine learning models were trained using a 7:3 train-test split and evaluated by cross-validation. Model performance was assessed using AUC, sensitivity, and specificity. The best model was interpreted using SHapley Additive exPlanations (SHAP) analysis. Sex differences were explored using Mann-Whitney U tests and restricted cubic spline (RCS) modeling across value intervals.</p><p><strong>Results: </strong>The Random Forest model showed the highest predictive performance (AUC = 0.847; 95% CI 0.791-0.895; specificity = 89.4%; sensitivity = 64.2%). SHAP analysis identified gender, age, fibrinogen, NLR, creatinine, fasting blood glucose, uric acid to high-density lipoprotein ratio (UHR), TyG, systemic inflammation response index (SIRI), and lymphocyte count as top predictors. Significant sex-specific differences in SHAP values were observed for key biomarkers, including age, UHR, TyG, SIRI, and others. RCS modeling further revealed distinct sex-related patterns in plaque vulnerability across biomarker value ranges.</p><p><strong>Conclusion: </strong>A Random Forest model integrating routine blood markers and derived indices accurately predicted vulnerable carotid plaques. The results underscore the importance of sex-specific risk assessment, highlighting differential effects of key biomarkers across genders and value intervals.</p>","PeriodicalId":9374,"journal":{"name":"Cardiovascular Diabetology","volume":"24 1","pages":"326"},"PeriodicalIF":10.6000,"publicationDate":"2025-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337436/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of a machine learning model for predicting vulnerable carotid plaques using routine blood biomarkers and derived indicators: insights into sex-related risk patterns.\",\"authors\":\"Yimin E, Zhichao Yao, Maolin Ge, Guijun Huo, Jian Huang, Yao Tang, Zhanao Liu, Ziyi Tan, Yuqi Zeng, Junjie Cao, Dayong Zhou\",\"doi\":\"10.1186/s12933-025-02867-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Early detection of vulnerable carotid plaques is critical for stroke prevention. This study aimed to develop a machine learning model based on routine blood tests and derived indices to predict plaque vulnerability and assess sex-specific risk patterns across biomarker value ranges.</p><p><strong>Methods: </strong>We retrospectively included 1701 hospitalized patients from Suzhou Municipal Hospital (2019-2020), selected from an initial cohort of 10,028 individuals. All patients underwent carotid ultrasound, with vulnerable plaques identified using predefined imaging criteria. A total of 30 laboratory variables-including blood count, coagulation, and biochemistry-were extracted, alongside derived indices such as triglyceride-glucose index (TyG), atherogenic index of plasma (AIP), neutrophil-to-lymphocyte ratio (NLR) and others. Features were standardized and selected based on statistical and clinical relevance. Five machine learning models were trained using a 7:3 train-test split and evaluated by cross-validation. Model performance was assessed using AUC, sensitivity, and specificity. The best model was interpreted using SHapley Additive exPlanations (SHAP) analysis. Sex differences were explored using Mann-Whitney U tests and restricted cubic spline (RCS) modeling across value intervals.</p><p><strong>Results: </strong>The Random Forest model showed the highest predictive performance (AUC = 0.847; 95% CI 0.791-0.895; specificity = 89.4%; sensitivity = 64.2%). SHAP analysis identified gender, age, fibrinogen, NLR, creatinine, fasting blood glucose, uric acid to high-density lipoprotein ratio (UHR), TyG, systemic inflammation response index (SIRI), and lymphocyte count as top predictors. Significant sex-specific differences in SHAP values were observed for key biomarkers, including age, UHR, TyG, SIRI, and others. RCS modeling further revealed distinct sex-related patterns in plaque vulnerability across biomarker value ranges.</p><p><strong>Conclusion: </strong>A Random Forest model integrating routine blood markers and derived indices accurately predicted vulnerable carotid plaques. The results underscore the importance of sex-specific risk assessment, highlighting differential effects of key biomarkers across genders and value intervals.</p>\",\"PeriodicalId\":9374,\"journal\":{\"name\":\"Cardiovascular Diabetology\",\"volume\":\"24 1\",\"pages\":\"326\"},\"PeriodicalIF\":10.6000,\"publicationDate\":\"2025-08-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337436/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cardiovascular Diabetology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12933-025-02867-6\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cardiovascular Diabetology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12933-025-02867-6","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

背景:早期发现颈动脉易损斑块是预防脑卒中的关键。本研究旨在开发一种基于常规血液检查和衍生指数的机器学习模型,以预测斑块易损性并评估跨生物标志物值范围的性别特异性风险模式。方法:我们回顾性地纳入了苏州市立医院(2019-2020)的1701例住院患者,这些患者是从10028名初始队列中选择的。所有患者均行颈动脉超声检查,使用预先确定的成像标准识别易损斑块。总共提取了30个实验室变量,包括血细胞计数、凝血和生物化学,以及衍生指标,如甘油三酯-葡萄糖指数(TyG)、血浆动脉粥样硬化指数(AIP)、中性粒细胞与淋巴细胞比率(NLR)等。根据统计学和临床相关性对特征进行标准化和选择。五个机器学习模型使用7:3训练-测试分割进行训练,并通过交叉验证进行评估。采用AUC、敏感性和特异性评估模型性能。最佳模型采用SHapley加性解释(SHAP)分析进行解释。使用Mann-Whitney U检验和限制性三次样条(RCS)模型跨值区间探讨性别差异。结果:随机森林模型的预测性能最高(AUC = 0.847;95% ci 0.791-0.895;特异性= 89.4%;灵敏度= 64.2%)。SHAP分析发现,性别、年龄、纤维蛋白原、NLR、肌酐、空腹血糖、尿酸与高密度脂蛋白比(UHR)、TyG、全身炎症反应指数(SIRI)和淋巴细胞计数是最重要的预测因子。关键生物标志物(包括年龄、UHR、TyG、SIRI等)的SHAP值存在显著的性别差异。RCS模型进一步揭示了不同生物标志物值范围内斑块易损性的不同性别相关模式。结论:结合常规血液指标和衍生指标的随机森林模型能准确预测颈动脉易损斑块。结果强调了性别特异性风险评估的重要性,强调了关键生物标志物在性别和值区间的差异效应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Development and validation of a machine learning model for predicting vulnerable carotid plaques using routine blood biomarkers and derived indicators: insights into sex-related risk patterns.

Background: Early detection of vulnerable carotid plaques is critical for stroke prevention. This study aimed to develop a machine learning model based on routine blood tests and derived indices to predict plaque vulnerability and assess sex-specific risk patterns across biomarker value ranges.

Methods: We retrospectively included 1701 hospitalized patients from Suzhou Municipal Hospital (2019-2020), selected from an initial cohort of 10,028 individuals. All patients underwent carotid ultrasound, with vulnerable plaques identified using predefined imaging criteria. A total of 30 laboratory variables-including blood count, coagulation, and biochemistry-were extracted, alongside derived indices such as triglyceride-glucose index (TyG), atherogenic index of plasma (AIP), neutrophil-to-lymphocyte ratio (NLR) and others. Features were standardized and selected based on statistical and clinical relevance. Five machine learning models were trained using a 7:3 train-test split and evaluated by cross-validation. Model performance was assessed using AUC, sensitivity, and specificity. The best model was interpreted using SHapley Additive exPlanations (SHAP) analysis. Sex differences were explored using Mann-Whitney U tests and restricted cubic spline (RCS) modeling across value intervals.

Results: The Random Forest model showed the highest predictive performance (AUC = 0.847; 95% CI 0.791-0.895; specificity = 89.4%; sensitivity = 64.2%). SHAP analysis identified gender, age, fibrinogen, NLR, creatinine, fasting blood glucose, uric acid to high-density lipoprotein ratio (UHR), TyG, systemic inflammation response index (SIRI), and lymphocyte count as top predictors. Significant sex-specific differences in SHAP values were observed for key biomarkers, including age, UHR, TyG, SIRI, and others. RCS modeling further revealed distinct sex-related patterns in plaque vulnerability across biomarker value ranges.

Conclusion: A Random Forest model integrating routine blood markers and derived indices accurately predicted vulnerable carotid plaques. The results underscore the importance of sex-specific risk assessment, highlighting differential effects of key biomarkers across genders and value intervals.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Cardiovascular Diabetology
Cardiovascular Diabetology 医学-内分泌学与代谢
CiteScore
12.30
自引率
15.10%
发文量
240
审稿时长
1 months
期刊介绍: Cardiovascular Diabetology is a journal that welcomes manuscripts exploring various aspects of the relationship between diabetes, cardiovascular health, and the metabolic syndrome. We invite submissions related to clinical studies, genetic investigations, experimental research, pharmacological studies, epidemiological analyses, and molecular biology research in this field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信