Qiying Dai MD , Akil A. Sherif MD , Chengyue Jin MD , Yongbin Chen MD, PhD , Peng Cai MS , Pengyang Li MD
{"title":"Machine learning predicting mortality in sarcoidosis patients admitted for acute heart failure","authors":"Qiying Dai MD , Akil A. Sherif MD , Chengyue Jin MD , Yongbin Chen MD, PhD , Peng Cai MS , Pengyang Li MD","doi":"10.1016/j.cvdhj.2022.08.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Sarcoidosis with cardiac involvement, although rare, has a worse prognosis than sarcoidosis involving other organ systems.</p></div><div><h3>Objective</h3><p>We used a large dataset to train machine learning models to predict in-hospital mortality among sarcoidosis patients admitted with heart failure (HF).</p></div><div><h3>Method</h3><p>Utilizing the National Inpatient Sample, we identified 4659 patients hospitalized with a primary diagnosis of HF. In this cohort, we identified patients with a secondary diagnosis of sarcoidosis using <em>International Statistical Classification of Disease, Tenth Revision</em> (ICD-10) codes. Patients were separated into a training group and a testing group in a 7:3 ratio. Least absolute shrinkage and selection operator regression was used to select variables to prevent model overfitting or underfitting. For machine learning models, logistic regression, random forest, and XGBoosting were applied in the training group. Parameters in each of the models were tuned using the GridSearchCV function. After training, all models were further validated in the testing group. Models were then evaluated using the area under curve (AUC) score, sensitivity, and specificity.</p></div><div><h3>Results</h3><p>A total of 2.3% of sarcoidosis patients died in HF admission. Our machine learning model analysis found the RF model to have the highest AUC score and sensitivity. Feature analysis found that comorbid arrhythmias and fluid electrolyte disorders were the strongest factors in predicting in-hospital mortality.</p></div><div><h3>Conclusion</h3><p>Machine learning methods can be useful in identifying predictors of in-hospital mortality in a given dataset.</p></div>","PeriodicalId":72527,"journal":{"name":"Cardiovascular digital health journal","volume":"3 6","pages":"Pages 297-304"},"PeriodicalIF":2.6000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/4f/35/main.PMC9795270.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cardiovascular digital health journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666693622001505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 1
Abstract
Background
Sarcoidosis with cardiac involvement, although rare, has a worse prognosis than sarcoidosis involving other organ systems.
Objective
We used a large dataset to train machine learning models to predict in-hospital mortality among sarcoidosis patients admitted with heart failure (HF).
Method
Utilizing the National Inpatient Sample, we identified 4659 patients hospitalized with a primary diagnosis of HF. In this cohort, we identified patients with a secondary diagnosis of sarcoidosis using International Statistical Classification of Disease, Tenth Revision (ICD-10) codes. Patients were separated into a training group and a testing group in a 7:3 ratio. Least absolute shrinkage and selection operator regression was used to select variables to prevent model overfitting or underfitting. For machine learning models, logistic regression, random forest, and XGBoosting were applied in the training group. Parameters in each of the models were tuned using the GridSearchCV function. After training, all models were further validated in the testing group. Models were then evaluated using the area under curve (AUC) score, sensitivity, and specificity.
Results
A total of 2.3% of sarcoidosis patients died in HF admission. Our machine learning model analysis found the RF model to have the highest AUC score and sensitivity. Feature analysis found that comorbid arrhythmias and fluid electrolyte disorders were the strongest factors in predicting in-hospital mortality.
Conclusion
Machine learning methods can be useful in identifying predictors of in-hospital mortality in a given dataset.