Anca Chiriac MD, PhD , Che Ngufor PhD , Holly K. van Houten BA , Raphael Mwangi MS , Malini Madhavan MBBS , Peter A. Noseworthy MD , Samuel J. Asirvatham MD , Sabrina D. Phillips MD , Christopher J. McLeod MB ChB, PhD
{"title":"超越心房颤动:机器学习算法可预测先天性心脏病成人患者的中风情况","authors":"Anca Chiriac MD, PhD , Che Ngufor PhD , Holly K. van Houten BA , Raphael Mwangi MS , Malini Madhavan MBBS , Peter A. Noseworthy MD , Samuel J. Asirvatham MD , Sabrina D. Phillips MD , Christopher J. McLeod MB ChB, PhD","doi":"10.1016/j.mcpdig.2023.12.002","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>To develop and validate a robust risk prediction model for stroke and systemic embolism (SSE) in adult patients with congenital heart disease (ACHD), using artificial intelligence.</p></div><div><h3>Patients and Methods</h3><p>Deidentified insurance claims from the Optum Labs Data Warehouse, including enrollment records and medical and pharmacy claims for commercial and Medicare Advantage enrollees, were used to identify 49,276 patients with ACHD, followed between January 1, 2009, and December 31, 2014. The group was randomly divided into development (70%) and validation (30%) cohorts. The development cohort was used to train 2 machine learning (ML) algorithms, regularized Cox regression (RegCox), and extreme gradient boosting (XGBoost) to predict SSE at 1, 2, and 5 years. The Shapley additive explanations (SHAP) model was used to identify the variables particularly driving the SSE risk.</p></div><div><h3>Results</h3><p>Within this large and diverse cohort of patients with ACHD (mean age, 59 ± 19 years; 25,390 (51.5%) female, 35,766 [77.6%]) white), 1756 (3.6%) patients experienced SSE during follow-up. In the Validation cohort, CHA<sub>2</sub>DS<sub>2</sub>-VASC had an area under the receiver operating characteristics curve (AUC) of 0.66 for predicting SSE at 1-,2, and 5-years. RegCox had the best predictive performance, with AUCs of 0.82,.81, and.80 at 1-, 2, and 5-years. XGBoost had AUCs of 0.81, 0.80, and 0.79 respectively. Atrial septal defect (ASD) emerged as an important predictor for SSE uncovered by the unbiased ML algorithms. A new clinical risk score, the CHA<sub>2</sub>DS<sub>2</sub>-VASC-ASD<sub>2</sub> score, provides improved SSE prediction in ACHD. Yet, the ML models still outperformed this.</p></div><div><h3>Conclusion</h3><p>ML models significantly outperformed the clinical risk scores in patients with ACHD.</p></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"2 1","pages":"Pages 92-103"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949761224000026/pdfft?md5=c34fed3977be03552486d0740a93fe5f&pid=1-s2.0-S2949761224000026-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Beyond Atrial Fibrillation: Machine Learning Algorithm Predicts Stroke in Adult Patients With Congenital Heart Disease\",\"authors\":\"Anca Chiriac MD, PhD , Che Ngufor PhD , Holly K. van Houten BA , Raphael Mwangi MS , Malini Madhavan MBBS , Peter A. Noseworthy MD , Samuel J. Asirvatham MD , Sabrina D. Phillips MD , Christopher J. McLeod MB ChB, PhD\",\"doi\":\"10.1016/j.mcpdig.2023.12.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><p>To develop and validate a robust risk prediction model for stroke and systemic embolism (SSE) in adult patients with congenital heart disease (ACHD), using artificial intelligence.</p></div><div><h3>Patients and Methods</h3><p>Deidentified insurance claims from the Optum Labs Data Warehouse, including enrollment records and medical and pharmacy claims for commercial and Medicare Advantage enrollees, were used to identify 49,276 patients with ACHD, followed between January 1, 2009, and December 31, 2014. The group was randomly divided into development (70%) and validation (30%) cohorts. The development cohort was used to train 2 machine learning (ML) algorithms, regularized Cox regression (RegCox), and extreme gradient boosting (XGBoost) to predict SSE at 1, 2, and 5 years. The Shapley additive explanations (SHAP) model was used to identify the variables particularly driving the SSE risk.</p></div><div><h3>Results</h3><p>Within this large and diverse cohort of patients with ACHD (mean age, 59 ± 19 years; 25,390 (51.5%) female, 35,766 [77.6%]) white), 1756 (3.6%) patients experienced SSE during follow-up. In the Validation cohort, CHA<sub>2</sub>DS<sub>2</sub>-VASC had an area under the receiver operating characteristics curve (AUC) of 0.66 for predicting SSE at 1-,2, and 5-years. RegCox had the best predictive performance, with AUCs of 0.82,.81, and.80 at 1-, 2, and 5-years. XGBoost had AUCs of 0.81, 0.80, and 0.79 respectively. Atrial septal defect (ASD) emerged as an important predictor for SSE uncovered by the unbiased ML algorithms. A new clinical risk score, the CHA<sub>2</sub>DS<sub>2</sub>-VASC-ASD<sub>2</sub> score, provides improved SSE prediction in ACHD. Yet, the ML models still outperformed this.</p></div><div><h3>Conclusion</h3><p>ML models significantly outperformed the clinical risk scores in patients with ACHD.</p></div>\",\"PeriodicalId\":74127,\"journal\":{\"name\":\"Mayo Clinic Proceedings. Digital health\",\"volume\":\"2 1\",\"pages\":\"Pages 92-103\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2949761224000026/pdfft?md5=c34fed3977be03552486d0740a93fe5f&pid=1-s2.0-S2949761224000026-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mayo Clinic Proceedings. Digital health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949761224000026\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mayo Clinic Proceedings. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949761224000026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Beyond Atrial Fibrillation: Machine Learning Algorithm Predicts Stroke in Adult Patients With Congenital Heart Disease
Objective
To develop and validate a robust risk prediction model for stroke and systemic embolism (SSE) in adult patients with congenital heart disease (ACHD), using artificial intelligence.
Patients and Methods
Deidentified insurance claims from the Optum Labs Data Warehouse, including enrollment records and medical and pharmacy claims for commercial and Medicare Advantage enrollees, were used to identify 49,276 patients with ACHD, followed between January 1, 2009, and December 31, 2014. The group was randomly divided into development (70%) and validation (30%) cohorts. The development cohort was used to train 2 machine learning (ML) algorithms, regularized Cox regression (RegCox), and extreme gradient boosting (XGBoost) to predict SSE at 1, 2, and 5 years. The Shapley additive explanations (SHAP) model was used to identify the variables particularly driving the SSE risk.
Results
Within this large and diverse cohort of patients with ACHD (mean age, 59 ± 19 years; 25,390 (51.5%) female, 35,766 [77.6%]) white), 1756 (3.6%) patients experienced SSE during follow-up. In the Validation cohort, CHA2DS2-VASC had an area under the receiver operating characteristics curve (AUC) of 0.66 for predicting SSE at 1-,2, and 5-years. RegCox had the best predictive performance, with AUCs of 0.82,.81, and.80 at 1-, 2, and 5-years. XGBoost had AUCs of 0.81, 0.80, and 0.79 respectively. Atrial septal defect (ASD) emerged as an important predictor for SSE uncovered by the unbiased ML algorithms. A new clinical risk score, the CHA2DS2-VASC-ASD2 score, provides improved SSE prediction in ACHD. Yet, the ML models still outperformed this.
Conclusion
ML models significantly outperformed the clinical risk scores in patients with ACHD.