Lei Gao, Boyan Tian, Qiqi Jia, Xingyu He, Guannan Zhao, Yueheng Wang
{"title":"Interpretable machine learning for predicting isolated basal septal hypertrophy.","authors":"Lei Gao, Boyan Tian, Qiqi Jia, Xingyu He, Guannan Zhao, Yueheng Wang","doi":"10.1371/journal.pone.0325992","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The basal septal hypertrophy(BSH) is an often under-recognized morphological change in the left ventricle. This is a common echocardiographic finding with a prevalence of approximately 7-20%, which may indicate early structural and functional remodeling of the left ventricle in certain pathologies. It also poses a risk of severe left ventricular outflow tract obstruction and is a significant cause of postoperative complications in patients undergoing transcatheter aortic valve implantation (TAVI). Compared to traditional algorithms, machine learning algorithms are more effective at capturing nonlinear relationships and developing more accurate diagnostic and predictive models. However, no predictive models for BSH have been developed using machine learning algorithms.</p><p><strong>Objective: </strong>To evaluate the effectiveness of five machine learning algorithms in predicting thickening of the basal segment of the interventricular septum and to develop a simple, yet efficient, prediction model for BSH.</p><p><strong>Methods: </strong>Echocardiographic and clinical data from 902 patients were collected from the First Central Hospital of Baoding City, including 91 BSH patients and 811 non-BSH patients. The data were divided into training and test sets in a 7:3 ratio. Five machine learning algorithms -XGBoost, Random Forest(RF), Dicision tree(DT), K-Nearest Neighbor classification(KNN), and Naive Bayes(NB) were applied to construct the models, combined with logistic regression (LR) based on Lasso regression. The performance of each model was evaluated using Receiver Operating Characteristic curve (ROC),calibration curves and Decision Curve Analysis (DCA)curve, with the model demonstrating the best performance being selected. The shapley additive explanation (SHAP) method was employed to interpret the XBoost and RF models.</p><p><strong>Results: </strong>The logistic regression (LR) of the Lasso regression model showed that IVS-AO Angle, Left Ventricular Mass Index (LVMI), Diastolic Left Ventricular Internal Diameter Index (LVIDdI), Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Distance from mitral valve closure point to basal segment of interventricular septum (MVCP-Sd), GLU, and Mitral Valve peak A (MV-A) were associated with BSH, with odds ratios (OR) of 0.86 (0.831-0.888), 1.034 (1.018-1.052), 0.104 (0.023-0.403), 1.041 (1.021-1.064), 0.964 (0.93-0.998), 0.852 (0.764-0.949), 1.146 (1.023-1.281), and 0.967 (0.947-0.987), respectively. The area under the ROC curve (AUC) for Model-relevant variable IVS-AO Angle, MVCP_Sd,LVMI, GLU, LVIDdI, SBP,DBP,LVIDdI,MV_A were 0.87,0.68,0.66,0.55,0.56,0.67,0.75,0.75. The AUC for the algorithms (XGBoost, RF, DT, KNN, NB) in the test set were 0.92, 0.91, 0.85, 0.84, and 0.88, respectively. The SHAP method identified eight predictor variables for BSH based on importance rankings, with the top four being IVS-AO Angle, LVMI, LVIDdI, and SBP, with IVS-AO Angle emerging as the most important predictor. The external validation of the RF model yielded an AUC of 0.86.</p><p><strong>Conclusion: </strong>Machine learning can effectively predict BSH, with IVS-AO Angle identified as an independent predictor. The RF model, being simple to operate, can be applied to the risk management of BSH patients.</p>","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 6","pages":"e0325992"},"PeriodicalIF":2.6000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12208500/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0325992","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The basal septal hypertrophy(BSH) is an often under-recognized morphological change in the left ventricle. This is a common echocardiographic finding with a prevalence of approximately 7-20%, which may indicate early structural and functional remodeling of the left ventricle in certain pathologies. It also poses a risk of severe left ventricular outflow tract obstruction and is a significant cause of postoperative complications in patients undergoing transcatheter aortic valve implantation (TAVI). Compared to traditional algorithms, machine learning algorithms are more effective at capturing nonlinear relationships and developing more accurate diagnostic and predictive models. However, no predictive models for BSH have been developed using machine learning algorithms.
Objective: To evaluate the effectiveness of five machine learning algorithms in predicting thickening of the basal segment of the interventricular septum and to develop a simple, yet efficient, prediction model for BSH.
Methods: Echocardiographic and clinical data from 902 patients were collected from the First Central Hospital of Baoding City, including 91 BSH patients and 811 non-BSH patients. The data were divided into training and test sets in a 7:3 ratio. Five machine learning algorithms -XGBoost, Random Forest(RF), Dicision tree(DT), K-Nearest Neighbor classification(KNN), and Naive Bayes(NB) were applied to construct the models, combined with logistic regression (LR) based on Lasso regression. The performance of each model was evaluated using Receiver Operating Characteristic curve (ROC),calibration curves and Decision Curve Analysis (DCA)curve, with the model demonstrating the best performance being selected. The shapley additive explanation (SHAP) method was employed to interpret the XBoost and RF models.
Results: The logistic regression (LR) of the Lasso regression model showed that IVS-AO Angle, Left Ventricular Mass Index (LVMI), Diastolic Left Ventricular Internal Diameter Index (LVIDdI), Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Distance from mitral valve closure point to basal segment of interventricular septum (MVCP-Sd), GLU, and Mitral Valve peak A (MV-A) were associated with BSH, with odds ratios (OR) of 0.86 (0.831-0.888), 1.034 (1.018-1.052), 0.104 (0.023-0.403), 1.041 (1.021-1.064), 0.964 (0.93-0.998), 0.852 (0.764-0.949), 1.146 (1.023-1.281), and 0.967 (0.947-0.987), respectively. The area under the ROC curve (AUC) for Model-relevant variable IVS-AO Angle, MVCP_Sd,LVMI, GLU, LVIDdI, SBP,DBP,LVIDdI,MV_A were 0.87,0.68,0.66,0.55,0.56,0.67,0.75,0.75. The AUC for the algorithms (XGBoost, RF, DT, KNN, NB) in the test set were 0.92, 0.91, 0.85, 0.84, and 0.88, respectively. The SHAP method identified eight predictor variables for BSH based on importance rankings, with the top four being IVS-AO Angle, LVMI, LVIDdI, and SBP, with IVS-AO Angle emerging as the most important predictor. The external validation of the RF model yielded an AUC of 0.86.
Conclusion: Machine learning can effectively predict BSH, with IVS-AO Angle identified as an independent predictor. The RF model, being simple to operate, can be applied to the risk management of BSH patients.
期刊介绍:
PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides:
* Open-access—freely accessible online, authors retain copyright
* Fast publication times
* Peer review by expert, practicing researchers
* Post-publication tools to indicate quality and impact
* Community-based dialogue on articles
* Worldwide media coverage