{"title":"A three-gene random forest model for diagnosing idiopathic pulmonary fibrosis based on circadian rhythm-related genes in lung tissue.","authors":"Jie He, Jun Hu, Hairong Liu","doi":"10.1080/17476348.2024.2311262","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The disorder of circadian rhythm could be a key factor mediating fibrotic lung disease Therefore, our study aims to determine the diagnostic value of circadian rhythm-related genes (CRRGs) in IPF.</p><p><strong>Methods: </strong>We retrieved the data on CRRGs from previous studies and the GSE150910 dataset. The participants from the GSE150910 dataset were divided into training and internal validation sets. Next, we used several various bioinformatics methods and machine learning algorithms to screen genes. Next, we identified SEMA5A, COL7A1, and TUBB3, which were included in the random forest (RF) diagnostic model. Finally, external validation was conducted on data retrieved from the GSE184316 datasets.</p><p><strong>Results: </strong>The results revealed that the RF diagnostic model could diagnose patients with IPF in the internal validation set with the area under the ROC curve (AUC) value of 0.905 and in the external validation with the AUC value of 0.767. Furthermore, real-time quantitative PCR and western blotting results revealed a significant decrease in SEMA5A (<i>p</i> < 0.05) expression level and an increase in COL7A1 and TUBB3 expression levels in TGF-β1-treated normal human lung fibroblasts.</p><p><strong>Conclusion: </strong>We constructed an RF diagnostic model based on SEMA5A, COL7A1, and TUBB3 expression in lung tissue for diagnosing patients with IPF.</p>","PeriodicalId":94007,"journal":{"name":"Expert review of respiratory medicine","volume":" ","pages":"1307-1320"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert review of respiratory medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/17476348.2024.2311262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/31 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: The disorder of circadian rhythm could be a key factor mediating fibrotic lung disease Therefore, our study aims to determine the diagnostic value of circadian rhythm-related genes (CRRGs) in IPF.
Methods: We retrieved the data on CRRGs from previous studies and the GSE150910 dataset. The participants from the GSE150910 dataset were divided into training and internal validation sets. Next, we used several various bioinformatics methods and machine learning algorithms to screen genes. Next, we identified SEMA5A, COL7A1, and TUBB3, which were included in the random forest (RF) diagnostic model. Finally, external validation was conducted on data retrieved from the GSE184316 datasets.
Results: The results revealed that the RF diagnostic model could diagnose patients with IPF in the internal validation set with the area under the ROC curve (AUC) value of 0.905 and in the external validation with the AUC value of 0.767. Furthermore, real-time quantitative PCR and western blotting results revealed a significant decrease in SEMA5A (p < 0.05) expression level and an increase in COL7A1 and TUBB3 expression levels in TGF-β1-treated normal human lung fibroblasts.
Conclusion: We constructed an RF diagnostic model based on SEMA5A, COL7A1, and TUBB3 expression in lung tissue for diagnosing patients with IPF.