Toru Shirahata, Pietro Nardelli, Sirus Jesudasen, Ruben San José Estépar, Ariel H Curiale, Badar Patel, Eileen Harder, Rajan Saggar, Aaron B Waxman, Rebecca R Vanderpool, George R Washko, Sydney B Montesi, Raúl San José Estépar, Farbod N Rahaghi
{"title":"使用随机森林模型和中央计算机断层扫描结构的自动测量检测特发性肺纤维化中的肺动脉高压。","authors":"Toru Shirahata, Pietro Nardelli, Sirus Jesudasen, Ruben San José Estépar, Ariel H Curiale, Badar Patel, Eileen Harder, Rajan Saggar, Aaron B Waxman, Rebecca R Vanderpool, George R Washko, Sydney B Montesi, Raúl San José Estépar, Farbod N Rahaghi","doi":"10.1183/23120541.01057-2024","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Since pulmonary hypertension (PH) worsens the prognosis of idiopathic pulmonary fibrosis (IPF), early prediction of PH is crucial for timely intervention. This study aims to develop and validate a machine learning model to predict PH using automated computed tomography (CT)-based 3D measurements, particularly central cardiovascular structures, segmented by a publicly available tool.</p><p><strong>Methods: </strong>We retrospectively studied 163 IPF patients who underwent both thin-section chest CT (contrast-enhanced and non-contrast) and right heart catheterisation within 2 years (78.5% within 6 months). Central CT structures were segmented using the TotalSegmentator Neural Network Version 1. We also manually measured pulmonary artery (PA) and ascending aorta (A) diameters. Random forest (RF) and logistic regression (LR) models were created and the model's reliability was assessed with 10-fold cross-validation. Shapley additive explanation (SHAP) analysis was performed to understand the contribution of each variable to the RF model.</p><p><strong>Results: </strong>Of the 163 patients, 75 had PH (46.0%). Significant differences were found in race, body mass index, right atrial (RA) volume, and PA volume between PH and non-PH patients. The RF model outperformed the LR model, showing higher area under the curve (AUC) (0.87 <i>versus</i> 0.82). Replacing PA volume with the PA/A ratio in the RF model decreased performance (AUC: 0.87 <i>versus</i> 0.79). SHAP identified PA and RA volumes as key features. No significant differences were observed between mean pulmonary arterial pressure and RA or PA volume in non-contrast CT compared to contrast-enhanced CT.</p><p><strong>Conclusion: </strong>The RF model with volumetric measures showed superior predictive performance for PH. Notably, both the RF model and segmentations of central CT structures are automated, facilitating seamless integration into clinical practice.</p>","PeriodicalId":11739,"journal":{"name":"ERJ Open Research","volume":"11 5","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12451577/pdf/","citationCount":"0","resultStr":"{\"title\":\"Detection of pulmonary hypertension in idiopathic pulmonary fibrosis using random forest models and automated measures of central computed tomography structures.\",\"authors\":\"Toru Shirahata, Pietro Nardelli, Sirus Jesudasen, Ruben San José Estépar, Ariel H Curiale, Badar Patel, Eileen Harder, Rajan Saggar, Aaron B Waxman, Rebecca R Vanderpool, George R Washko, Sydney B Montesi, Raúl San José Estépar, Farbod N Rahaghi\",\"doi\":\"10.1183/23120541.01057-2024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>Since pulmonary hypertension (PH) worsens the prognosis of idiopathic pulmonary fibrosis (IPF), early prediction of PH is crucial for timely intervention. This study aims to develop and validate a machine learning model to predict PH using automated computed tomography (CT)-based 3D measurements, particularly central cardiovascular structures, segmented by a publicly available tool.</p><p><strong>Methods: </strong>We retrospectively studied 163 IPF patients who underwent both thin-section chest CT (contrast-enhanced and non-contrast) and right heart catheterisation within 2 years (78.5% within 6 months). Central CT structures were segmented using the TotalSegmentator Neural Network Version 1. We also manually measured pulmonary artery (PA) and ascending aorta (A) diameters. Random forest (RF) and logistic regression (LR) models were created and the model's reliability was assessed with 10-fold cross-validation. Shapley additive explanation (SHAP) analysis was performed to understand the contribution of each variable to the RF model.</p><p><strong>Results: </strong>Of the 163 patients, 75 had PH (46.0%). Significant differences were found in race, body mass index, right atrial (RA) volume, and PA volume between PH and non-PH patients. The RF model outperformed the LR model, showing higher area under the curve (AUC) (0.87 <i>versus</i> 0.82). Replacing PA volume with the PA/A ratio in the RF model decreased performance (AUC: 0.87 <i>versus</i> 0.79). SHAP identified PA and RA volumes as key features. No significant differences were observed between mean pulmonary arterial pressure and RA or PA volume in non-contrast CT compared to contrast-enhanced CT.</p><p><strong>Conclusion: </strong>The RF model with volumetric measures showed superior predictive performance for PH. Notably, both the RF model and segmentations of central CT structures are automated, facilitating seamless integration into clinical practice.</p>\",\"PeriodicalId\":11739,\"journal\":{\"name\":\"ERJ Open Research\",\"volume\":\"11 5\",\"pages\":\"\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2025-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12451577/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ERJ Open Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1183/23120541.01057-2024\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/9/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"RESPIRATORY SYSTEM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERJ Open Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1183/23120541.01057-2024","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
Detection of pulmonary hypertension in idiopathic pulmonary fibrosis using random forest models and automated measures of central computed tomography structures.
Objectives: Since pulmonary hypertension (PH) worsens the prognosis of idiopathic pulmonary fibrosis (IPF), early prediction of PH is crucial for timely intervention. This study aims to develop and validate a machine learning model to predict PH using automated computed tomography (CT)-based 3D measurements, particularly central cardiovascular structures, segmented by a publicly available tool.
Methods: We retrospectively studied 163 IPF patients who underwent both thin-section chest CT (contrast-enhanced and non-contrast) and right heart catheterisation within 2 years (78.5% within 6 months). Central CT structures were segmented using the TotalSegmentator Neural Network Version 1. We also manually measured pulmonary artery (PA) and ascending aorta (A) diameters. Random forest (RF) and logistic regression (LR) models were created and the model's reliability was assessed with 10-fold cross-validation. Shapley additive explanation (SHAP) analysis was performed to understand the contribution of each variable to the RF model.
Results: Of the 163 patients, 75 had PH (46.0%). Significant differences were found in race, body mass index, right atrial (RA) volume, and PA volume between PH and non-PH patients. The RF model outperformed the LR model, showing higher area under the curve (AUC) (0.87 versus 0.82). Replacing PA volume with the PA/A ratio in the RF model decreased performance (AUC: 0.87 versus 0.79). SHAP identified PA and RA volumes as key features. No significant differences were observed between mean pulmonary arterial pressure and RA or PA volume in non-contrast CT compared to contrast-enhanced CT.
Conclusion: The RF model with volumetric measures showed superior predictive performance for PH. Notably, both the RF model and segmentations of central CT structures are automated, facilitating seamless integration into clinical practice.
期刊介绍:
ERJ Open Research is a fully open access original research journal, published online by the European Respiratory Society. The journal aims to publish high-quality work in all fields of respiratory science and medicine, covering basic science, clinical translational science and clinical medicine. The journal was created to help fulfil the ERS objective to disseminate scientific and educational material to its members and to the medical community, but also to provide researchers with an affordable open access specialty journal in which to publish their work.