Interpretable machine learning for predicting isolated basal septal hypertrophy.

IF 2.6 3区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
PLoS ONE Pub Date : 2025-06-30 eCollection Date: 2025-01-01 DOI:10.1371/journal.pone.0325992
Lei Gao, Boyan Tian, Qiqi Jia, Xingyu He, Guannan Zhao, Yueheng Wang
{"title":"Interpretable machine learning for predicting isolated basal septal hypertrophy.","authors":"Lei Gao, Boyan Tian, Qiqi Jia, Xingyu He, Guannan Zhao, Yueheng Wang","doi":"10.1371/journal.pone.0325992","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The basal septal hypertrophy(BSH) is an often under-recognized morphological change in the left ventricle. This is a common echocardiographic finding with a prevalence of approximately 7-20%, which may indicate early structural and functional remodeling of the left ventricle in certain pathologies. It also poses a risk of severe left ventricular outflow tract obstruction and is a significant cause of postoperative complications in patients undergoing transcatheter aortic valve implantation (TAVI). Compared to traditional algorithms, machine learning algorithms are more effective at capturing nonlinear relationships and developing more accurate diagnostic and predictive models. However, no predictive models for BSH have been developed using machine learning algorithms.</p><p><strong>Objective: </strong>To evaluate the effectiveness of five machine learning algorithms in predicting thickening of the basal segment of the interventricular septum and to develop a simple, yet efficient, prediction model for BSH.</p><p><strong>Methods: </strong>Echocardiographic and clinical data from 902 patients were collected from the First Central Hospital of Baoding City, including 91 BSH patients and 811 non-BSH patients. The data were divided into training and test sets in a 7:3 ratio. Five machine learning algorithms -XGBoost, Random Forest(RF), Dicision tree(DT), K-Nearest Neighbor classification(KNN), and Naive Bayes(NB) were applied to construct the models, combined with logistic regression (LR) based on Lasso regression. The performance of each model was evaluated using Receiver Operating Characteristic curve (ROC),calibration curves and Decision Curve Analysis (DCA)curve, with the model demonstrating the best performance being selected. The shapley additive explanation (SHAP) method was employed to interpret the XBoost and RF models.</p><p><strong>Results: </strong>The logistic regression (LR) of the Lasso regression model showed that IVS-AO Angle, Left Ventricular Mass Index (LVMI), Diastolic Left Ventricular Internal Diameter Index (LVIDdI), Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Distance from mitral valve closure point to basal segment of interventricular septum (MVCP-Sd), GLU, and Mitral Valve peak A (MV-A) were associated with BSH, with odds ratios (OR) of 0.86 (0.831-0.888), 1.034 (1.018-1.052), 0.104 (0.023-0.403), 1.041 (1.021-1.064), 0.964 (0.93-0.998), 0.852 (0.764-0.949), 1.146 (1.023-1.281), and 0.967 (0.947-0.987), respectively. The area under the ROC curve (AUC) for Model-relevant variable IVS-AO Angle, MVCP_Sd,LVMI, GLU, LVIDdI, SBP,DBP,LVIDdI,MV_A were 0.87,0.68,0.66,0.55,0.56,0.67,0.75,0.75. The AUC for the algorithms (XGBoost, RF, DT, KNN, NB) in the test set were 0.92, 0.91, 0.85, 0.84, and 0.88, respectively. The SHAP method identified eight predictor variables for BSH based on importance rankings, with the top four being IVS-AO Angle, LVMI, LVIDdI, and SBP, with IVS-AO Angle emerging as the most important predictor. The external validation of the RF model yielded an AUC of 0.86.</p><p><strong>Conclusion: </strong>Machine learning can effectively predict BSH, with IVS-AO Angle identified as an independent predictor. The RF model, being simple to operate, can be applied to the risk management of BSH patients.</p>","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 6","pages":"e0325992"},"PeriodicalIF":2.6000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12208500/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0325992","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The basal septal hypertrophy(BSH) is an often under-recognized morphological change in the left ventricle. This is a common echocardiographic finding with a prevalence of approximately 7-20%, which may indicate early structural and functional remodeling of the left ventricle in certain pathologies. It also poses a risk of severe left ventricular outflow tract obstruction and is a significant cause of postoperative complications in patients undergoing transcatheter aortic valve implantation (TAVI). Compared to traditional algorithms, machine learning algorithms are more effective at capturing nonlinear relationships and developing more accurate diagnostic and predictive models. However, no predictive models for BSH have been developed using machine learning algorithms.

Objective: To evaluate the effectiveness of five machine learning algorithms in predicting thickening of the basal segment of the interventricular septum and to develop a simple, yet efficient, prediction model for BSH.

Methods: Echocardiographic and clinical data from 902 patients were collected from the First Central Hospital of Baoding City, including 91 BSH patients and 811 non-BSH patients. The data were divided into training and test sets in a 7:3 ratio. Five machine learning algorithms -XGBoost, Random Forest(RF), Dicision tree(DT), K-Nearest Neighbor classification(KNN), and Naive Bayes(NB) were applied to construct the models, combined with logistic regression (LR) based on Lasso regression. The performance of each model was evaluated using Receiver Operating Characteristic curve (ROC),calibration curves and Decision Curve Analysis (DCA)curve, with the model demonstrating the best performance being selected. The shapley additive explanation (SHAP) method was employed to interpret the XBoost and RF models.

Results: The logistic regression (LR) of the Lasso regression model showed that IVS-AO Angle, Left Ventricular Mass Index (LVMI), Diastolic Left Ventricular Internal Diameter Index (LVIDdI), Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Distance from mitral valve closure point to basal segment of interventricular septum (MVCP-Sd), GLU, and Mitral Valve peak A (MV-A) were associated with BSH, with odds ratios (OR) of 0.86 (0.831-0.888), 1.034 (1.018-1.052), 0.104 (0.023-0.403), 1.041 (1.021-1.064), 0.964 (0.93-0.998), 0.852 (0.764-0.949), 1.146 (1.023-1.281), and 0.967 (0.947-0.987), respectively. The area under the ROC curve (AUC) for Model-relevant variable IVS-AO Angle, MVCP_Sd,LVMI, GLU, LVIDdI, SBP,DBP,LVIDdI,MV_A were 0.87,0.68,0.66,0.55,0.56,0.67,0.75,0.75. The AUC for the algorithms (XGBoost, RF, DT, KNN, NB) in the test set were 0.92, 0.91, 0.85, 0.84, and 0.88, respectively. The SHAP method identified eight predictor variables for BSH based on importance rankings, with the top four being IVS-AO Angle, LVMI, LVIDdI, and SBP, with IVS-AO Angle emerging as the most important predictor. The external validation of the RF model yielded an AUC of 0.86.

Conclusion: Machine learning can effectively predict BSH, with IVS-AO Angle identified as an independent predictor. The RF model, being simple to operate, can be applied to the risk management of BSH patients.

Abstract Image

Abstract Image

Abstract Image

预测孤立性基底间隔肥大的可解释机器学习。
背景:基底间隔肥厚(BSH)是一种常被忽视的左心室形态学改变。这是一种常见的超声心动图发现,患病率约为7-20%,这可能表明在某些病理中左心室的早期结构和功能重构。它还会造成严重的左心室流出道梗阻,是经导管主动脉瓣植入术(TAVI)患者术后并发症的重要原因。与传统算法相比,机器学习算法在捕捉非线性关系和开发更准确的诊断和预测模型方面更有效。然而,目前还没有使用机器学习算法开发出BSH的预测模型。目的:评价五种机器学习算法在预测室间隔基底段增厚方面的有效性,并建立一种简单有效的室间隔基底段增厚预测模型。方法:收集保定市第一中心医院902例患者的超声心动图及临床资料,其中BSH患者91例,非BSH患者811例。数据以7:3的比例分为训练集和测试集。采用xgboost、随机森林(RF)、决策树(DT)、k近邻分类(KNN)和朴素贝叶斯(NB)五种机器学习算法构建模型,并结合基于Lasso回归的逻辑回归(LR)。采用受试者工作特征曲线(ROC)、校正曲线和决策曲线分析(DCA)对各模型的性能进行评价,选择性能最佳的模型。采用shapley加性解释(SHAP)方法对XBoost和RF模型进行解释。结果:Lasso回归模型的logistic回归(LR)显示,IVS-AO角、左室质量指数(LVMI)、左室舒张内径指数(LVIDdI)、收缩压(SBP)、舒张压(DBP)、二尖瓣关闭点至室间隔基段距离(MVCP-Sd)、GLU、二尖瓣峰值A (MV-A)与BSH相关,比值比(OR)分别为0.86(0.831-0.888)、1.034(1.018-1.052)、0.104(0.023-0.403)。分别为1.041(1.021-1.064)、0.964(0.93-0.998)、0.852(0.764-0.949)、1.146(1.023-1.281)、0.967(0.947-0.987)。模型相关变量IVS-AO角、MVCP_Sd、LVMI、GLU、LVIDdI、SBP、DBP、LVIDdI、MV_A的ROC曲线下面积(AUC)分别为0.87、0.68、0.66、0.55、0.56、0.67、0.75、0.75。测试集中算法(XGBoost、RF、DT、KNN、NB)的AUC分别为0.92、0.91、0.85、0.84和0.88。SHAP方法根据重要性排序确定了8个预测变量,其中前四个是IVS-AO角度、LVMI、LVIDdI和SBP,其中IVS-AO角度是最重要的预测变量。外部验证RF模型的AUC为0.86。结论:机器学习可有效预测BSH, IVS-AO角度可作为独立预测因子。RF模型操作简单,可应用于BSH患者的风险管理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
PLoS ONE
PLoS ONE 生物-生物学
CiteScore
6.20
自引率
5.40%
发文量
14242
审稿时长
3.7 months
期刊介绍: PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides: * Open-access—freely accessible online, authors retain copyright * Fast publication times * Peer review by expert, practicing researchers * Post-publication tools to indicate quality and impact * Community-based dialogue on articles * Worldwide media coverage
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信